Hi , I replace this this if your data are important
02 02 Throughput Performance Overall (general) throughput
performance of a hard disk drive. If the value of this attribute is
decreasing there is a high probability that there is a problem with
your disk.
Review:
http://en.wikipedia.org/wiki/Self-Monitoring%2C_Analysis%2C_and_Reporting_Technology
high probability :-D
On 30/01/07, Volker Kuhlmann <list0570@...> wrote:
> > Since yesterday I'm getting messages from smartd about /dev/hda failing the
> > SMART self-check, and smartctl tells me a drive failure is expected in less
> > than 24 hours. But when looking at the smartctl output I see only the
> > Throughput_Performance failing, and some manual self-tests didn't yet turn
> > up any errors,
>
> Hm, I don't know, but I thought one can usually trust the disk saying
> about itself that it's had it. If it's still under warranty get it
> replaced.
>
> Volker
>
> --
> Volker Kuhlmann is list0570 with the domain in header
> http://volker.dnsalias.net/ Please do not CC list postings to me.
>
> -------------------------------------------------------------------------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share your
> opinions on IT & business topics through brief surveys - and earn cash
> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
> _______________________________________________
> Smartmontools-support mailing list
> Smartmontools-support@...
> https://lists.sourceforge.net/lists/listinfo/smartmontools-support
>

> Since yesterday I'm getting messages from smartd about /dev/hda failing the
> SMART self-check, and smartctl tells me a drive failure is expected in less
> than 24 hours. But when looking at the smartctl output I see only the
> Throughput_Performance failing, and some manual self-tests didn't yet turn
> up any errors,
Hm, I don't know, but I thought one can usually trust the disk saying
about itself that it's had it. If it's still under warranty get it
replaced.
Volker
--
Volker Kuhlmann is list0570 with the domain in header
http://volker.dnsalias.net/ Please do not CC list postings to me.

I have a disk which shows a lot of badblocks when scanned with
"badblocks" command. Syslog indicates that the disk is broken:
hdb: dma_intr: status=0x51 { DriveReady SeekComplete Error }
hdb: dma_intr: error=0x40 { UncorrectableError }, LBAsect=39114674,
high=2, low=5560242, sector=39114624
ide: failed opcode was: unknown
end_request: I/O error, dev hdb, sector 39114624
Also, smartctl is not very happy about it:
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed: read failure 90% 14923
39114674
However, according to smartctl manual:
"To test within the shell for whether or not the different bits
are turned on or off, you can use the following type of con-
struction (this is bash syntax):
smartstat=$(($? & 8))
This looks at only at bit 3 of the exit status $? (since
8=2^3). The shell variable $smartstat will be nonzero if SMART
status check returned "disk failing" and zero otherwise."
For that broken disk, smartctl exits with code 192.
So, according to the manual:
# echo $((192 & 8))
0
exit code is 0, so the disk is failing according to badblocks and
kernel, but according to smartctl, the disk is fine?
Or do I have problems understanding the manual?
I'm running smartmontools 5.37.
--
Tomasz Chmielewski
http://wpkg.org

When I run smartmontools, I see that under "SMART support is:" it says:
Available - device has SMART capability.
Enabled status cached by OS, trying SMART RETURN STATUS cmd.
What does the last line mean? Is the ON/OFF status saved by the OS? When I turn SMART Off using smartmontools and reboot the SMART is back on. I was trying to figure out why and how to fix this? Does this 2nd line explain it why? And what is this "trying SMART RETURN STATUS cmd" mean?
-----Original Message-----
From: bmwrph@...
To: smartmontools-support@...
Sent: Thu, 18 Jan 2007 1:16 PM
Subject: [smartmontools-support] SMART settings are NOT preserved over power-cycling. Help needed.
I installed the smartmontools, turned off SMART and after rebooting, SMART is back ON (even though it was set to OFF). I have Windows 2000. Is the ON setting cached by the OS and that is why it does this? How can I prevent this?
I understand that SMART settings are preserved over power-cycling. If not, I see that it might be necessary that the command '-s off' be placed in a start-up script in rc.local or rc.sysinit. Before I would work on this script, is there anything else that I could try to fix this automatic revert to ON? If not, can anybody help me with this start-up script. I never done this. Thanks.
________________________________________________________________________
Check out the new AOL. Most comprehensive set of free safety and security tools, free access to millions of high-quality videos from across the web, free AOL Mail and more.

2007/1/18, Jon Steel <jon.steel@...>:
> When I executed smartd start, it gave me this message:
>
> Currently the smartmontools package has no init script for
> the Unknown OS/distribution. If you can provide one or this
> one works after removing some ifdefs, please contact
> smartmontools-support@...
>
> Im using OpenBSD 4.0. Would you mind giving me a hand?
Add this to /etc/rc.local:
--------------
# smartctl
if [ -x /usr/local/sbin/smartctl ]; then
echo -n ' smartctl '
/usr/local/sbin/smartctl -s on /dev/wd0c
fi
# smartd
if [ -x /usr/local/sbin/smartd ]; then
echo -n ' smartd '
/usr/local/sbin/smartd --pidfile=/var/run/smartd.pid
fi
--------------
Btw: smartmontools are in ports.
Best
Martin

Volker Kuhlmann wrote:
>> I have a smartd running on different machines.
>> It starts when the machine starts with:
>>
>> DEVICESCAN -a -o on -S on -s (S/../.././02|L/../../6/03)
>
>> Is there a way to check which drives is smartd monitoring?
>
> It's logged in syslog, other than that I don't think there is anything.
> Try the mail notification feature in test mode. Or scan syslog 20
> seconds after starting smartd?
That's exactly the problem.
1. I don't want email notification for several reasons: most of machines
I run smartd on can't send mail (no mail server installed, firewalled),
and I receive too much mail already.
2. I want to have the monitoring done via a monitoring tool like nagios.
Right now, I'm scanning the syslog to look for the drives smartd is
monitoring using the following logic:
* send HUP to smartd (it rereads the configuration, and prints
everything to syslog again)
* check the last lines of syslog, and check for the drives
killall -HUP smartd
IFS=" "
DRIVES=$(tail -100 /var/log/syslog | awk '/is SMART capable. Adding
to/{sub(/,/,"",$7);print $7}' | sort | uniq)
# then, start smartctl for each drive in $DRIVES
The problem is, it's not reliable.
- I have to send HUP to smartd each time
- right now I check the last 100 lines of syslog (after sending HUP to
smartd) - what if the machine is a firewall with lots of logging? I know
I could check the date, but it complicates the script a lot more.
So, if any smartmontools developer is listening, I'd ask for a new flag
to smartd:
-s --status
Print the drives smartd is monitoring and exit (smartd must be running).
--
Tomasz Chmielewski
http://wpkg.org

Hi,
my system has kernel source installed.
I m interesting to try this trick.
Where can i find this patch ?
Thanks in advance !
Regards,
Nicolas Lienard
On Sun, January 21, 2007 22:54, Sergey Svishchev wrote:
> On Tue, Jan 16, 2007 at 04:28:34PM -0500, Eduard Martinescu wrote:
>
>> Nicolas,
>>
>>
>> Currently, there is no support for the CCISS card under FreeBSD. I
>> don't have access to any cards of that type myself, so I won't be able=
to
>> add support for it in the near future. If someone else would like to
>> step up to do it, that would be great.
>
> There's experimental patch that uses Linux code (it's in archives). It
> requires FreeBSD kernel source, since relevant headers are not installe=
d
> into /usr/include.
>
> I tested it on 5.2.1 with SmartArray 5i.
>
>
> --
> Sergey Svishchev
> -----------------------------------------------------------------------=
--
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the chance to share
> your opinions on IT & business topics through brief surveys - and earn
> cash
> http://www.techsay.com/default.php?page=3Djoin.php&p=3Dsourceforge&CID=3D=
DEVDEV
> _______________________________________________
> Smartmontools-support mailing list
> Smartmontools-support@...
> https://lists.sourceforge.net/lists/listinfo/smartmontools-support
>
>

On Tue, Jan 16, 2007 at 04:28:34PM -0500, Eduard Martinescu wrote:
>Nicolas,
>
>Currently, there is no support for the CCISS card under FreeBSD. I
>don't have access to any cards of that type myself, so I won't be able
>to add support for it in the near future. If someone else would like to
>step up to do it, that would be great.
There's experimental patch that uses Linux code (it's in archives). It
requires FreeBSD kernel source, since relevant headers are not installed
into /usr/include.
I tested it on 5.2.1 with SmartArray 5i.
--=20
Sergey Svishchev

> I have a smartd running on different machines.
> It starts when the machine starts with:
>
> DEVICESCAN -a -o on -S on -s (S/../.././02|L/../../6/03)
> Is there a way to check which drives is smartd monitoring?
It's logged in syslog, other than that I don't think there is anything.
Try the mail notification feature in test mode. Or scan syslog 20
seconds after starting smartd?
Volker
--
Volker Kuhlmann is list0570 with the domain in header
http://volker.dnsalias.net/ Please do not CC list postings to me.

It is probably either the BIOS or Windows that is turning on SMART. Wy do
you want to turn it off?
On Thu, 18 Jan 2007, bmwrph@... wrote:
> I installed the smartmontools, turned off SMART and after rebooting, SMART is back ON (even though it was set to OFF). I have Windows 2000. Is the ON setting cached by the OS and that is why it does this? How can I prevent this?
>
> I understand that SMART settings are preserved over power-cycling. If not, I see that it might be necessary that the command '-s off' be placed in a start-up script in rc.local or rc.sysinit. Before I would work on this script, is there anything else that I could try to fix this automatic revert to ON? If not, can anybody help me with this start-up script. I never done this. Thanks.
>
> ________________________________________________________________________
> Check out the new AOL. Most comprehensive set of free safety and security tools, free access to millions of high-quality videos from across the web, free AOL Mail and more.
>

On Fri, Jan 19, 2007 at 10:50:11 -0300,
Eduardo Grosclaude <eduardo.grosclaude@...> wrote:
> Hello,
> Not SMART-related, please forgive if OT. I want to build a HA cluster
> and would love to test some odd failure modes. I'm interested in how
> to simulate a (SATA) disk failure. Can I take it offline while
> running, or otherwise make it appear as suddenly dead? Perhaps with
> ATA commands? Of course I don't want to break it! :)
> Thank you in advance
Very recent Linux kernels are supposed to have a way to introduce faults
for testing purposes.

Eduardo Grosclaude wrote:
> Hello,
> Not SMART-related, please forgive if OT. I want to build a HA cluster
> and would love to test some odd failure modes. I'm interested in how
> to simulate a (SATA) disk failure. Can I take it offline while
> running, or otherwise make it appear as suddenly dead? Perhaps with
> ATA commands? Of course I don't want to break it! :)
> Thank you in advance
>
>
I think you can use some echo in the /proc/scsi/scsi like
echo "*scsi remove-single*-device h c i l" >/proc/scsi/scsi
Where h is the host, c the channel, i the id, l the lun
cat /proc/scsi/scsi will show you the values you are looking at:
Host: scsi7 Channel: 00 Id: 06 Lun: 00
Vendor: CERTANCE Model: ULTRIUM 2 Rev: 1785
Type: Sequential-Access ANSI SCSI revision: 03
In this example, h=7 c=0 i=6 l=0..
Hope it will helps you,

Hello,
Not SMART-related, please forgive if OT. I want to build a HA cluster
and would love to test some odd failure modes. I'm interested in how
to simulate a (SATA) disk failure. Can I take it offline while
running, or otherwise make it appear as suddenly dead? Perhaps with
ATA commands? Of course I don't want to break it! :)
Thank you in advance
--
Eduardo Grosclaude
Universidad Nacional del Comahue
Neuquen, Argentina

I have a smartd running on different machines.
It starts when the machine starts with:
DEVICESCAN -a -o on -S on -s (S/../.././02|L/../../6/03)
in /etc/smartd.conf.
The machines have different drives; sometimes only /dev/hda or /dev/sda,
or sometimes /dev/sd[a-b] etc.
Is there a way to check which drives is smartd monitoring?
I noticed smartd adds some info when it starts to the syslog, but
parsing system logs doesn't seem to be a great idea for me (logs can be
rotated).
--
Tomasz Chmielewski
http://wpkg.org

Tomasz Chmielewski wrote:
> What does error code 64 mean?
>
> "RETURN VALUE" section of the manual is not very clear to me.
>
> According to it:
>
> smartstat=$(($? & 8))
>
> gives 0, so, is the disk healthy? Why the error code 64 then?
I've posted this before - it's a script that I run every day to check
smartmontools settings other than just a simple pass/fail (which I would
be notified about through smartd). It does some parsing of the
returncode from smartctl, so may be of interest.
http://jeremy.publication.org.uk/checkdisks [*]
64 (at least, a 1 in the 7th bit) is an indication that there are items
in the Error Log. In your case it's probably ok (I'm not sure of the
true nature of ABRT messages), but worth watching the disk just in case
(particularly things like the Reallocated Sector count).
Best wishes,
Jeremy
[*] Note: Python has some oddness about returncodes, using the low 8
bits to indicate the code the subprocess was killed with, hence the
bit-shift required to get the real returncode. This wouldn't be
necessary if you were dealing with it in bash or similar.

Erwan Velu wrote:
> Tomasz Chmielewski wrote:
>> I want to check the state of disks on multiple servers running smartd.
>>
>> The only information I need is if the disk is failing or not.
>>
>> After reading smartctl manual, I came to the conclusion that the below
>> test should be sufficient for IDE disks:
>>
>> smartctl -a /dev/disk &>/dev/null
>> echo $?
>>
>>
>> If the exit code is different than 0, we should suspect a failing disk
>> and look at the things more carefully.
>>
>> Am I correct?
>>
> Regarding the man page, it seems yes ;)
Hmmz.
I have a disk which gives the following results:
# smartctl -a /dev/hda &>/dev/null
# echo $?
64
What does error code 64 mean?
"RETURN VALUE" section of the manual is not very clear to me.
According to it:
smartstat=$(($? & 8))
gives 0, so, is the disk healthy? Why the error code 64 then?
--
Tomasz Chmielewski
http://wpkg.org

Hi
First off, I had an error with make which i fixed by removing the
section for smartd.conf.5. But I dont think this would have an effect on
the program.
When I executed smartd start, it gave me this message:
Currently the smartmontools package has no init script for
the Unknown OS/distribution. If you can provide one or this
one works after removing some ifdefs, please contact
smartmontools-support@...
Im using OpenBSD 4.0. Would you mind giving me a hand?
Thank You
Jonathan Steel

I installed the smartmontools, turned off SMART and after rebooting, SMART is back ON (even though it was set to OFF). I have Windows 2000. Is the ON setting cached by the OS and that is why it does this? How can I prevent this?
I understand that SMART settings are preserved over power-cycling. If not, I see that it might be necessary that the command '-s off' be placed in a start-up script in rc.local or rc.sysinit. Before I would work on this script, is there anything else that I could try to fix this automatic revert to ON? If not, can anybody help me with this start-up script. I never done this. Thanks.
________________________________________________________________________
Check out the new AOL. Most comprehensive set of free safety and security tools, free access to millions of high-quality videos from across the web, free AOL Mail and more.

Tomasz Chmielewski wrote:
> I want to check the state of disks on multiple servers running smartd.
>
> The only information I need is if the disk is failing or not.
>
> After reading smartctl manual, I came to the conclusion that the below
> test should be sufficient for IDE disks:
>
> smartctl -a /dev/disk &>/dev/null
> echo $?
>
>
> If the exit code is different than 0, we should suspect a failing disk
> and look at the things more carefully.
>
> Am I correct?
>
Regarding the man page, it seems yes ;)
>
> Does the smartd run some tests from time to time, or perhaps I should do
> it myself, too (i.e., smartctl -t long /dev/disk, via crontab once a day)?
>
First, you must activate the off line tests using :
smartctl --smart=on --offlineauto=on --saveauto=on /dev/disk
Then if you like to do some tests from time to time you just have to specify
a rule in /etc/smartd and restart the smartd service (/etc/init.d/smartd
restart)
For my own use I did the following :
/dev/sda -a -d ata -o on -S on -s (S/../.././02/L/../../06/03) -m
mymail@... -M exec /bin/mail
Every day I do a short test and one long every sunday, it something
occurs it sends me an email.
I find this very usefull.
Regards,

I want to check the state of disks on multiple servers running smartd.
The only information I need is if the disk is failing or not.
After reading smartctl manual, I came to the conclusion that the below
test should be sufficient for IDE disks:
smartctl -a /dev/disk &>/dev/null
echo $?
If the exit code is different than 0, we should suspect a failing disk
and look at the things more carefully.
Am I correct?
Does the smartd run some tests from time to time, or perhaps I should do
it myself, too (i.e., smartctl -t long /dev/disk, via crontab once a day)?
--
Tomasz Chmielewski
http://wpkg.org