Custom Query (1386 matches)

Filters
 
Or
 
  
 
Columns

Show under each result:


Results (67 - 69 of 1386)

Ticket Resolution Summary Owner Reporter
#793 invalid SMART error (FailedReadSmartSelfTestLog) detected on host: xxx valisann
Description

Hi,

I need your help regarding an issue related with smartmontools apps. I have a linux server (Debian GNU/Linux 8 (jessie), installed on a Supermicro 2U 6027R-E1R12L with a raid controller LSI.

root@pm03:~# lspci | egrep -i 'raid|adaptec'
03:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 2108 [Liberator] (rev 05)

On LSI I have 2 raid configuration:

Raid 1 - 2 SSD - Intel SSD DC S3500 Series 120GB Raid 10 - 7 HDD (6 + 1 spare) - Seagate Constellation ES.3 3.5 2TB SAS

root@pm03:~#_ smartctl --scan
/dev/sda_ -d scsi # /dev/sda, SCSI device
/dev/sdb_ -d scsi # /dev/sdb, SCSI device
/dev/bus/0_ -d megaraid,0 # /dev/bus/0 [megaraid_disk_00], SCSI device
/dev/bus/0_ -d megaraid,1 # /dev/bus/0 [megaraid_disk_01], SCSI device
/dev/bus/0_ -d megaraid,2 # /dev/bus/0 [megaraid_disk_02], SCSI device
/dev/bus/0_ -d megaraid,3 # /dev/bus/0 [megaraid_disk_03], SCSI device
/dev/bus/0_ -d megaraid,4 # /dev/bus/0 [megaraid_disk_04], SCSI device
/dev/bus/0__ -d megaraid,5 # /dev/bus/0 [megaraid_disk_05], SCSI device
/dev/bus/0_ -d megaraid,6 # /dev/bus/0 [megaraid_disk_06], SCSI device
/dev/bus/0_ -d megaraid,7 # /dev/bus/0 [megaraid_disk_07], SCSI device
/dev/bus/0 -d megaraid,15 # /dev/bus/0 [megaraid_disk_15], SCSI device spare

issue - I have received an error log "SMART error (FailedReadSmartSelfTestLog) detected on host: xxx" (see the message logs).

  • this was a spare disk form RAID 10 with ID 08
This message was generated by the smartd daemon running on:

   host name:  XXXX
   DNS domain: mydomain.int

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_08], Read SMART Self-Test Log Failed

Device_ info:
[SEAGATE  ST2000NM0023     0004], lu id: 0x5000c500628f5dcf, S/N: Z1Y2C8G90000C5124838, 2.00 TB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists._
  • I have contact the vendor and I have change the disk...after that, the SMART test on the new disk (with ID 15), was OK
  • after 1 day the error is Back :(
This message was generated by the smartd daemon running on:

   host name:  pm03
   DNS domain: mercury.int

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_15], Read SMART Self-Test Log Failed

Device info:
[SEAGATE  ST2000NM0023     0004], lu id: 0x5000c500845a1abf, S/N: Z1X5VF2R0000C6095M9C, 2.00 TB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists.

Another problem that I noticed after I have detect is that I cannot see in -- Disk information -- about the spare disk, using the command megaclisas-status && megasasctl

before https://drive.google.com/open?id=0B5sUwIIchHUgUlA4UGwyZTZaSjQ https://drive.google.com/open

after https://drive.google.com/open?id=0B5sUwIIchHUgMXBDRzExaXhZOE0 https://drive.google.com/open

  • the rest of the disks form RAID10, don't have issues on SMART
  • other log, if helps
root@pm03:~# smartctl -a /dev/bus/0 -d megaraid,15
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.4.35-1-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor:               SEAGATE
Product:              ST2000NM0023
Revision:             0004
Compliance:           SPC-4
User Capacity:        2,000,398,934,016 bytes [2.00 TB]
Logical block size:   512 bytes
LU is fully provisioned
Rotation Rate:        7200 rpm
Form Factor:          3.5 inches
Logical Unit id:      0x5000c500845a1abf
Serial number:        Z1X5VF2R0000C6095M9C
Device type:          disk

Transport protocol:   SAS (SPL-3)
Local Time is:        Fri Jan 13 13:54:06 2017 EET
SMART support is:     Unavailable - device lacks SMART capability.

=== START OF READ SMART DATA SECTION ===
Current Drive Temperature:     0 C
Drive Trip Temperature:        0 C

Error Counter logging not supported

Device does not support Self Test logging

megacli 8.07.14-1 amd64 LSI Logic MegaRAID SAS MegaCLI megaclisas-status 0.15 all get RAID status out of LSI MegaRAID SAS HW RAID controllers megactl 0.4.1+svn20090725.r6-5 amd64 LSI MegaRAID SCSI/SAS reporting tool megaraid-status 0.12 all get RAID status out of LSI MegaRAID SCSI/SAS HW RAID controllers

Thanks

#22 fixed smartctl segfaults on 3ware Christian Franke valharis
Description

I'm running smartctl from SVN head (revision 2985) on an AMD64 system with Linux kernel 2.6.30-2 (Debian unstable)

Calling smartctl -d 3ware,0 -a /dev/sda results in a Segmentation fault.

Backtrace: #0 0x000000000042e2de in os_linux::linux_escalade_device::ata_pass_through (this=0x674010, in=..., out=...) at os_linux.cpp:1610 #1 0x000000000040cb99 in smartcommandhandler (device=0x674010, command=IDENTIFY, select=0, data=0x662d40 "\036") at atacmds.cpp:569 #2 0x000000000040e82d in ataReadHDIdentity (device=0x674010, buf=0x662d40) at atacmds.cpp:774 #3 0x0000000000413891 in ataPrintMain (device=0x674010, options=...) at ataprint.cpp:1758 #4 0x000000000040600d in main_worker (argc=5, argv=0x7fffffffe4e8) at smartctl.cpp:949 #5 0x000000000040614c in main (argc=5, argv=0x7fffffffe4e8) at smartctl.cpp:967

Debugging reveals:

passthru is initialized in os_linux.cpp:1471 in the (m_escalade_type==AMCC_3WARE_678K) section. It is then reset to NULL at os_linux.cpp:1576 The subsequent access at os_linux.cpp:1610 dereferences the NULL-Pointer, which segfaults.

#1569 fixed add to drivedb: Sandisk SD9SN8W (SD9SN8W128G1020) Gabriele Pohl Vadmin
Description

Could you add the following disk reference in the smartctl database ? Here is the smartctl -a output:

=== START OF INFORMATION SECTION ===
Device Model:     SanDisk SD9SN8W128G1020
Serial Number:    ....
LU WWN Device Id: 5 001b44 8b994b732
Firmware Version: X6101020
User Capacity:    128,035,676,160 bytes [128 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      M.2
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-4 T13/BSR INCITS 529 revision 5
SATA Version is:  SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Feb 15 01:21:47 2022 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever
					been run.
Total time to complete Offline
data collection: 		(    0) seconds.
Offline data collection
capabilities: 			 (0x11) SMART execute Offline immediate.
					No Auto Offline data collection support.
					Suspend Offline collection upon new
					command.
					No Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  10) minutes.

SMART Attributes Data Structure revision number: 4
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   ---    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   ---    Old_age   Always       -       26536
 12 Power_Cycle_Count       0x0032   100   100   ---    Old_age   Always       -       25
165 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       1356418753
166 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       104
167 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       34
168 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       172
169 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       102
170 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       0
171 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       0
172 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       0
173 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       145
174 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       5
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   ---    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   064   070   ---    Old_age   Always       -       36 (Min/Max 27/70)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       0
230 Unknown_SSD_Attribute   0x0032   100   100   ---    Old_age   Always       -       44118142232608
232 Available_Reservd_Space 0x0033   100   100   004    Pre-fail  Always       -       100
233 Media_Wearout_Indicator 0x0032   100   100   ---    Old_age   Always       -       18443
234 Unknown_Attribute       0x0032   100   100   ---    Old_age   Always       -       56778
241 Total_LBAs_Written      0x0030   253   253   ---    Old_age   Offline      -       40598
242 Total_LBAs_Read         0x0030   253   253   ---    Old_age   Offline      -       39691
244 Unknown_Attribute       0x0032   000   100   ---    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

Selective Self-tests/Logging not supported
Batch Modify
Note: See TracBatchModify for help on using batch modify.
Note: See TracQuery for help on using queries.