Samsung HD203WI Raw Read Error Rate
Is it normal for the SMART attribute "Raw Read Error Rate" to display more than 0? I've never seen anything above 0 before but on my file server drive I noticed it was at 2 after formatting and moving a bit of data but it's now at 8, it's only been in use for about 3-4 days. I know SMART attributes are different on different drives but I'm a little concerned nonetheless, does this mean the drive is faulty?
Thanks. :)
Re: Samsung HD203WI Raw Read Error Rate
Definately maybe :)
http://en.wikipedia.org/wiki/S.M.A.R.T.
Not clear cut - as this article shows - and if you Google raw data error rate you get conflicting views. It also depends on how the drive mfr defines the raw data error rate.
More info here:
http://opensource-archive.org/showthread.php?t=105940
It is probably something to monitor, periodically, say daily for another week to see if the count increases - but it is a rate - and that implies errors/unit of time, rather than an absolute value. with some drives a lower count is worse than a high count - just depends on how the mfr reports them and how the SMART tool interprets that drives SMART data.
Probably the best way to determine the drive's health is to use Samsungs drive utility.
http://www.samsung.com/global/busine...ort_HUTIL.html
Re: Samsung HD203WI Raw Read Error Rate
I'm currently running a backup of everything on the drive and it's just gone up to 9. SMART status is 'PASSED' but I'll run the drive utility again once this backup completes.
Edit: Just refreshed and it's now 10. :(
Re: Samsung HD203WI Raw Read Error Rate
From a WD drive
Code:
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 180 179 021 Pre-fail Always - 5958
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 118
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 1415
From a Samsung drive
Code:
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 100 100 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0007 087 087 011 Pre-fail Always - 4780
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 36
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 253 253 051 Pre-fail Always - 0
8 Seek_Time_Performance 0x0025 100 100 015 Pre-fail Offline - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 747
The drives are in a fileserver (with one partition on each in a RAID1 array) Nothing there is giving me concern (sorry about the formatting Edit - formatting sorted :) )
Re: Samsung HD203WI Raw Read Error Rate
I've just run the drive utility (skipped the full ~6 hour scan but I did run that when I first got the drive) and it passed:
-------------------------------------------------
Test Loop Number : 1/1
-------------------------------------------------
>> READ DMA : Testing... Pass !
>> CHECK S.M.A.R.T. : Testing... Pass !
>> SPIN DOWN / UP : Testing... Pass !
>> SIMPLE SURFACE SCAN : Testing... Pass !
>> WRITE VERIFY : Testing... Pass !
>> RANDOM SURFACE SCAN : Testing... Pass !
>> READ SURFACE SCAN : Testing... Skip !
>> RAM VERIFY TEST : Testing... Pass !
>> MULTIPLE READ TEST : Testing... Pass !
Service Code => SJ25 : Test OK.
Total Elapsed Time : 00:01:58
Test was completed.
=================================================
What are the SMART attributes on your Samsungs?
Edit: You answered my question before I posted!! :laugh:
Even though the test passed I'm still a little concerned about that non-0 value, and I've never seen it on any of my other drives.
Re: Samsung HD203WI Raw Read Error Rate
Is that the raw_value or the value field (200 and 100 on the drive attributes I posted)
Re: Samsung HD203WI Raw Read Error Rate
It's the raw value. And worryingly it's now at 11.
Re: Samsung HD203WI Raw Read Error Rate
What are the other fields (value, worst and thresh)?
Re: Samsung HD203WI Raw Read Error Rate
Value=100, Worst=100, Thresh=51.
Re: Samsung HD203WI Raw Read Error Rate
Same as my Samsung drive. Probably something to be a little uneasy about, but not panic worthy. Defintely a case of watch and wait - but if the drive isn't in a RAID array, I'd be reviewing my backup strategy, remembering that catastrophic failure is unlikely (though not impossible) - but if the count continues increasing then I would start getting more concerned.
Re: Samsung HD203WI Raw Read Error Rate
It's not RAIDed but I make regular backups to another drive. When would you consider RMAing the drive? I've contacted Samsung who replied quickly saying basically what you've said, SMART data is hard to interpret, run some tests, keep an eye on the drive and consider an RMA if the value keeps increasing. I might email Scan and see what they say but I don't want to RMA if there's nothing wrong and considering it passes the drive utility they'll probably just return it.
Re: Samsung HD203WI Raw Read Error Rate
No - not yet - but I would keep an eye on it.
Re: Samsung HD203WI Raw Read Error Rate
Thanks I will do, but what should I look out for, a change in the 'Vaule' field or something? Also any idea if this can mean actual filesystem data loss or is it still corrected?
Re: Samsung HD203WI Raw Read Error Rate
Changes in raw_value. These are raw reads, so the decoding and ECC should take care of them. You if you are running logwatch, you should get an abstract of kernel messages that will show read/write data errors to and from the drive. (or look at the messages log - possibly in /var/log or wherever your distro writes its log files.)
Re: Samsung HD203WI Raw Read Error Rate
Re: Samsung HD203WI Raw Read Error Rate
It's still at 11 so I gather it only rises under load, which makes sense given the attribute name. I've left logging as defaults, any idea what log I should be looking in and what text I should be looking for? I'm guessing /var/log/messages and searching warn, doing that I found a few non-related warnings, mainly something about the ntp daemon.
Thanks for all your help lately BTW! :)
Edit: running fsck took it up to 13 so I'm guessing it's going to keep rising when the disk is being read.