$ sudo apt-get install smartmontools
$ sudo smartctl -d cciss,0 -a /dev/cciss/c0d0 smartctl version 5.38 [x86_64-unknown-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ Serial number: 9SF0NFRR Device type: disk Local Time is: Tue Nov 23 13:07:05 2010 CET Device supports SMART and is Enabled Temperature Warning Disabled or Not Supported Log Sense failed, IE page [scsi response fails sanity test] Read defect list: asked for grown list but didn't get it Error Counter logging not supported Device does not support Self Test logging
$ sudo smartctl -s on -a /dev/hda
Si apareix el missatge:
In Linux, SATA disks accessed via libata are only supported by smartmontools for kernel versions 2.6.15 and above. Try an additional '-d ata' argument.
Cal posar:
$ sudo smartctl -a -d ata /dev/sda
Sinó podeu posar només:
$ sudo smartctl -a /dev/sda smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: ST3320418AS Serial Number: 6VM71J1C Firmware Version: CC44 User Capacity: 320,072,933,376 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Tue Nov 23 12:50:32 2010 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 617) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 74) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x103f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 117 099 006 Pre-fail Always - 129162381 3 Spin_Up_Time 0x0003 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 149 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 051 050 030 Pre-fail Always - 94492314816 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 264 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 150 183 Unknown_Attribute 0x0000 100 100 000 Old_age Offline - 0 184 Unknown_Attribute 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Unknown_Attribute 0x0032 100 097 000 Old_age Always - 171 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 067 059 045 Old_age Always - 33 (Lifetime Min/Max 24/33) 194 Temperature_Celsius 0x0022 033 041 000 Old_age Always - 33 (0 18 0 0) 195 Hardware_ECC_Recovered 0x001a 048 040 000 Old_age Always - 129162381 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 238357800026759 241 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 534473308 242 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 379862196 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
Si directament voleu veure si el disc està ok o no:
$ sudo smartctl -a -d ata /dev/sdc | grep "overall-health" SMART overall-health self-assessment test result: PASSED
L'anterior és un exemple d'Ok. Ara un exemple de no OK:
$ sudo smartctl -a -d ata /dev/sda | grep "overall-health" SMART overall-health self-assessment test result: FAILED!
smartd és un dimoni que permet monitoritzar de forma activa discs durs (sense la necessitat d'executar manualment smartctl).
Editeu el fitxer de configuració del dimoni smart:
$ sudo joe /etc/smartd.conf
NOTA: Segurament les opcions per defecte ja us seran vàlides i no canviareu res del fitxer
Finalment cal indicar que volem que el dimoni s'executi per defecte. Editeu el fitxer /etc/default/smartmontools:
$ sudo joe /etc/default/smartmontools
I descomenteu les línies:
start_smartd=yes
i
smartd_opts="--interval=1800"
Ara apliqueu els canvis:
Podeu veure si tot vva bé al syslog:
$ sudo cat /var/log/syslog | grep smartd Aug 18 13:17:59 proxmox01 smartd[798170]: smartd 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)#012Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net#01 2 Aug 18 13:17:59 proxmox01 smartd[798170]: Opened configuration file /etc/smartd.conf Aug 18 13:17:59 proxmox01 smartd[798170]: Drive: DEVICESCAN, implied '-a' Directive on line 22 of file /etc/smartd.conf Aug 18 13:17:59 proxmox01 smartd[798170]: Configuration file /etc/smartd.conf was parsed, found DEVICESCAN, scanning devices Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sda, type changed from 'scsi' to 'sat' Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sda [SAT], opened Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sda [SAT], not found in smartd database. Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sda [SAT], can't monitor Current Pending Sector count - no Attribute 197 Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sda [SAT], can't monitor Offline Uncorrectable Sector count - no Attribute 198 Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sda [SAT], is SMART capable. Adding to "monitor" list. Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdb, type changed from 'scsi' to 'sat' Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdb [SAT], opened Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdb [SAT], not found in smartd database. Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdb [SAT], is SMART capable. Adding to "monitor" list. Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdc, type changed from 'scsi' to 'sat' Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdc [SAT], opened Aug 18 13:17:59 proxmox01 smartd[798170]: Device: /dev/sdc [SAT], not found in smartd database. Aug 18 13:18:00 proxmox01 smartd[798170]: Device: /dev/sdc [SAT], is SMART capable. Adding to "monitor" list. Aug 18 13:18:00 proxmox01 smartd[798170]: Monitoring 3 ATA and 0 SCSI devices Aug 18 13:18:00 proxmox01 smartd[798170]: Device: /dev/sda [SAT], state written to /var/lib/smartmontools/smartd.INTEL_SSDSC2CT120A3-CVMP215500V9120BGN.ata.state Aug 18 13:18:00 proxmox01 smartd[798170]: Device: /dev/sdb [SAT], state written to /var/lib/smartmontools/smartd.WDC_WD2003FYYS_02W0B1-WD_WMAY04708779.ata.state Aug 18 13:18:00 proxmox01 smartd[798170]: Device: /dev/sdc [SAT], state written to /var/lib/smartmontools/smartd.WDC_WD2003FYYS_02W0B1-WD_WMAY04728754.ata.state Aug 18 13:18:00 proxmox01 smartd[798172]: smartd has fork()ed into background mode. New PID=798172. Aug 18 13:18:00 proxmox01 smartd[798172]: file /var/run/smartd.pid written containing PID 798172