Linux disk health

5/10/2023

Linux disk health

Read Now

Linux / UNIX: Smartctl Check Hard Disk Behind 3Ware RAID Card.Test If Linux Server SCSI / SATA / SSD Hard Disk Going Bad.This entry is 4 of 9 in the smartctl (smartd) Tutorial series. Read man page of smartd and nf for configuration help.More information on the smarttool see official home page.(c) Testing hard disk (short or long test):Ĭaution smartd is a monitoring tool not a backup solution. You can put following directives in Smart Configuration file: Start/Stop smart: /usr/local/etc/rc.d/smartd.sh start | stop.Enable smart by editing /etc/rc.conf file (add line smartd_enable=â€YES”).Start/Stop smart: /etc/init.d/smartd start | stop.Start/Stop smart: /etc/init.d/smartmontools start | stop.Enable smart by editing /etc/default/smartmontools file.Indicates that it is SMART capable and it is enabled. SMART support is: Available – device has SMART capability. You can configure the smartd daemon by editing the file /etc/nf. SMART support is: Available - device has SMART capability. Output: smartctl version 5.34 Copyright (C) 2002-5 Bruce Allenĭevice is: In smartctl database Following are distribution specific steps to install smartd:īefore configuring hard disk for SMART monitoring make sure your hard disk is SMART capable: However, smartd is not installed by default. With spinning disks, there are 2 important lines which are ‘Reallocated_Sector_Ct’ and ‘Current_Pending_Sector’ count, the reallocated sector count means you have bad sectors but the disk has dealt with them fine, this is not terrible if the count is either low <1000 or static and not growing over time if however you have Current_Pending_Sectors for more than 24 hours that means the disk has issues and it cannot deal with them itself, this is critical.Smartd works with following operating systems: You can then check the disk health with: #smatctl -a /dev/sda With spinning disks let us again assume that the disk is /dev/sda you can run a short or a long test as follows: #smartctl -t short -a /dev/sda (This is a short self test) To see the overall health you can check the ‘Wear_leveling_count’ line, this starts at 100 and reduces down to 0, again this is simply a vendor estimate of the % of the remaining life of the disk if you are getting filesystem issues and slow performance then the disk may be at the end of its best days.

You are then looking for any specific errors logged at the end which will be very obvious and titled as errors, if you find these please copy and paste them into a ticket for us to review. #smartctl -t long -a /dev/sda (This is a long self test)

With SSD’s you can use the regular smart tests, assuming that the disk is /dev/sda you can run a short or a long test as follows: #smartctl -t short -a /dev/sda (This is a short self test) It is however VERY important to keep in mind that you can be at 100% and still see no issues with the disk, this is simply a vendor estimate of the % of the remaining life of the disk if you are getting filesystem issues and slow performance then the disk may be at the end of its best days. The closer to 100% you get the more your disk is close to the end of its predicted life. Now we know the disk is nvme0 we can check its health: Node SN Model Namespace Usage Format FW Rev You will see a list of NVMe disks: # nvme list If you do not have this installed use your package manager to install ‘nvme-cli’, you will also need ‘smartmontools’ It is a good idea to do regular checks of your server disks, however with a number of disk types now, Spinning HDD’s SSD’s and NVMe disks there are different ways to check them, this guide is intended to give you indications only it is not black and white between good and bad disks.

0 Comments

Linux disk health

Leave a Reply.

Author

Archives

Categories