Custom Query (1365 matches)

Filters
 
Or
 
  
 
Columns

Show under each result:


Results (193 - 195 of 1365)

Ticket Owner Reporter Resolution Summary
#793 valisann invalid SMART error (FailedReadSmartSelfTestLog) detected on host: xxx
Description

Hi,

I need your help regarding an issue related with smartmontools apps. I have a linux server (Debian GNU/Linux 8 (jessie), installed on a Supermicro 2U 6027R-E1R12L with a raid controller LSI.

root@pm03:~# lspci | egrep -i 'raid|adaptec'
03:00.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS 2108 [Liberator] (rev 05)

On LSI I have 2 raid configuration:

Raid 1 - 2 SSD - Intel SSD DC S3500 Series 120GB Raid 10 - 7 HDD (6 + 1 spare) - Seagate Constellation ES.3 3.5 2TB SAS

root@pm03:~#_ smartctl --scan
/dev/sda_ -d scsi # /dev/sda, SCSI device
/dev/sdb_ -d scsi # /dev/sdb, SCSI device
/dev/bus/0_ -d megaraid,0 # /dev/bus/0 [megaraid_disk_00], SCSI device
/dev/bus/0_ -d megaraid,1 # /dev/bus/0 [megaraid_disk_01], SCSI device
/dev/bus/0_ -d megaraid,2 # /dev/bus/0 [megaraid_disk_02], SCSI device
/dev/bus/0_ -d megaraid,3 # /dev/bus/0 [megaraid_disk_03], SCSI device
/dev/bus/0_ -d megaraid,4 # /dev/bus/0 [megaraid_disk_04], SCSI device
/dev/bus/0__ -d megaraid,5 # /dev/bus/0 [megaraid_disk_05], SCSI device
/dev/bus/0_ -d megaraid,6 # /dev/bus/0 [megaraid_disk_06], SCSI device
/dev/bus/0_ -d megaraid,7 # /dev/bus/0 [megaraid_disk_07], SCSI device
/dev/bus/0 -d megaraid,15 # /dev/bus/0 [megaraid_disk_15], SCSI device spare

issue - I have received an error log "SMART error (FailedReadSmartSelfTestLog) detected on host: xxx" (see the message logs).

  • this was a spare disk form RAID 10 with ID 08
This message was generated by the smartd daemon running on:

   host name:  XXXX
   DNS domain: mydomain.int

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_08], Read SMART Self-Test Log Failed

Device_ info:
[SEAGATE  ST2000NM0023     0004], lu id: 0x5000c500628f5dcf, S/N: Z1Y2C8G90000C5124838, 2.00 TB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists._
  • I have contact the vendor and I have change the disk...after that, the SMART test on the new disk (with ID 15), was OK
  • after 1 day the error is Back :(
This message was generated by the smartd daemon running on:

   host name:  pm03
   DNS domain: mercury.int

The following warning/error was logged by the smartd daemon:

Device: /dev/bus/0 [megaraid_disk_15], Read SMART Self-Test Log Failed

Device info:
[SEAGATE  ST2000NM0023     0004], lu id: 0x5000c500845a1abf, S/N: Z1X5VF2R0000C6095M9C, 2.00 TB

For details see host's SYSLOG.

You can also use the smartctl utility for further investigation.
Another message will be sent in 24 hours if the problem persists.

Another problem that I noticed after I have detect is that I cannot see in -- Disk information -- about the spare disk, using the command megaclisas-status && megasasctl

before https://drive.google.com/open?id=0B5sUwIIchHUgUlA4UGwyZTZaSjQ https://drive.google.com/open

after https://drive.google.com/open?id=0B5sUwIIchHUgMXBDRzExaXhZOE0 https://drive.google.com/open

  • the rest of the disks form RAID10, don't have issues on SMART
  • other log, if helps
root@pm03:~# smartctl -a /dev/bus/0 -d megaraid,15
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.4.35-1-pve] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor:               SEAGATE
Product:              ST2000NM0023
Revision:             0004
Compliance:           SPC-4
User Capacity:        2,000,398,934,016 bytes [2.00 TB]
Logical block size:   512 bytes
LU is fully provisioned
Rotation Rate:        7200 rpm
Form Factor:          3.5 inches
Logical Unit id:      0x5000c500845a1abf
Serial number:        Z1X5VF2R0000C6095M9C
Device type:          disk

Transport protocol:   SAS (SPL-3)
Local Time is:        Fri Jan 13 13:54:06 2017 EET
SMART support is:     Unavailable - device lacks SMART capability.

=== START OF READ SMART DATA SECTION ===
Current Drive Temperature:     0 C
Drive Trip Temperature:        0 C

Error Counter logging not supported

Device does not support Self Test logging

megacli 8.07.14-1 amd64 LSI Logic MegaRAID SAS MegaCLI megaclisas-status 0.15 all get RAID status out of LSI MegaRAID SAS HW RAID controllers megactl 0.4.1+svn20090725.r6-5 amd64 LSI MegaRAID SCSI/SAS reporting tool megaraid-status 0.12 all get RAID status out of LSI MegaRAID SCSI/SAS HW RAID controllers

Thanks

#800 Simone Giordano wontfix "can't get bus number" issue with MegaRAID on ESXi
Description

There is an issue using smartctl on ESXi to monitor disks behind the RAID. Example:

smartctl -a /dev/disks/naa.6c81f660d2aeab001fd4153f9ba416c5 -d sat+megaraid,12

Smartctl open device: /dev/disks/naa.6c81f660d2aeab001fd4153f9ba416c5 [megaraid_disk_12] [SAT] failed: can't get bus number

I've compiled a static version of smartctl from updated sources (6.6 r4384) and the issue still exists. Because ESXi is different than a normal Linux distribution, I've tried to patch os_linux.cpp forcing linux_megaraid_device::open to use the right device:

  if ((m_fd = ::open("/dev/megaraid_sas_ioctl", O_RDWR)) >= 0) {
    m_hba = 1;  // ?
    pt_cmd = &linux_megaraid_device::megasas_cmd;
    set_fd(m_fd);
    return true; 
  }

After this patch, the device is opened but I get "INQUIRY FAILED"

On ESXi the MegaCli utility works right, so I think there are no issues with driver or ioctl support.

I can do any test that you want or apply a particular patch.

It's important for monitor disks behind RAID because the SMART indicators reported by controller are very poor.

Thank you. Simone

#804 Christian Franke Zulkefli Aris fixed CSMI ports != 0 no longer accessible after an update of IRST driver to 15.2
Description
C:\>smartctl --scan
/dev/sda -d scsi # /dev/sda, SCSI device
/dev/csmi2,0 -d ata # /dev/csmi2,0, ATA device

C:\>smartctl -a  /dev/sda
smartctl 6.5 2016-05-07 r4318 [i686-w64-mingw32-win7(64)-sp1] (sf-6.5-1)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor:               Intel
Product:              Raid 1 Volume
Revision:             1.0.
User Capacity:        500,105,547,264 bytes [500 GB]
Logical block size:   512 bytes
LU is fully provisioned
Rotation Rate:        5400 rpm
Logical Unit id:      0x785a2d5700000000001517ffff0aeb84
Serial number:        OutletSVR
Device type:          disk
Local Time is:        Wed Feb 08 09:05:58 2017 MPST
SMART support is:     Available - device has SMART capability.
SMART support is:     Enabled
Temperature Warning:  Disabled or Not Supported

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Current Drive Temperature:     0 C
Drive Trip Temperature:        0 C

Error Counter logging not supported

Device does not support Self Test logging


C:\>smartctl -a /dev/csmi2,0 -d ata
smartctl 6.5 2016-05-07 r4318 [i686-w64-mingw32-win7(64)-sp1] (sf-6.5-1)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Toshiba 2.5" HDD MQ01ABF...
Device Model:     TOSHIBA MQ01ABF050
Serial Number:    66R8P3FJT
LU WWN Device Id: 5 000039 71250ad18
Firmware Version: AM0P2A
User Capacity:    500,107,862,016 bytes [500 GB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Feb 08 09:07:19 2017 MPST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  120) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 123) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   050    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0005   100   100   050    Pre-fail  Offline      -       0
  3 Spin_Up_Time            0x0027   100   100   001    Pre-fail  Always       -       1394
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       17
  5 Reallocated_Sector_Ct   0x0033   100   100   050    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000b   100   100   050    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   100   100   050    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       14
 10 Spin_Retry_Count        0x0033   100   100   030    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       16
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       4
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       164
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       28 (Min/Max 21/33)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
220 Disk_Shift              0x0002   100   100   000    Old_age   Always       -       0
222 Loaded_Hours            0x0032   100   100   000    Old_age   Always       -       13
223 Load_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
224 Load_Friction           0x0022   100   100   000    Old_age   Always       -       0
226 Load-in_Time            0x0026   100   100   000    Old_age   Always       -       250
240 Head_Flying_Hours       0x0001   100   100   001    Pre-fail  Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


C:\>smartctl -r ioctl,2 -d csmi --scan
 \\.\Scsi0:: successfully opened
  IOCTL_SCSI_MINIPORT(CC_CSMI_1) failed, Error=1
 \\.\Scsi1:: successfully opened
  IOCTL_SCSI_MINIPORT(CC_CSMI_1) failed, Error=1
 \\.\Scsi2:: successfully opened
  IOCTL_SCSI_MINIPORT(CC_CSMI_1) succeeded, bytes returned: 204
CSMI_SAS_DRIVER_INFO:
  Name:        "iaStorA"
  Description: "Intel(R) Rapid Storage Technology "
  Revision:    15.2
  IOCTL_SCSI_MINIPORT(CC_CSMI_20) succeeded, bytes returned: 2080
CSMI_SAS_PHY_INFO: NumberOfPhys=6
Phy[0] Port:   0x00
  Type:        0x10, 0x10
  InitProto:   0x01, 0x00
  TargetProto: 0x00, 0x01
  PhyIdent:    0x00, 0x00
  SASAddress:  00 00 00 00 00 00 00 00, 00 00 00 00 00 00 00 00
Phy[1] Port:   0x00
  Type:        0x10, 0x10
  InitProto:   0x01, 0x00
  TargetProto: 0x00, 0x01
  PhyIdent:    0x01, 0x01
  SASAddress:  00 00 00 00 00 00 00 00, 00 01 00 00 00 00 00 00
Phy[2] Port:   0x00
  Type:        0x10, 0x00
  InitProto:   0x01, 0x00
  TargetProto: 0x00, 0x01
  PhyIdent:    0x02, 0x02
  SASAddress:  00 00 00 00 00 00 00 00, 00 02 00 00 00 00 00 00
Phy[3] Port:   0x00
  Type:        0x10, 0x00
  InitProto:   0x01, 0x00
  TargetProto: 0x00, 0x01
  PhyIdent:    0x03, 0x03
  SASAddress:  00 00 00 00 00 00 00 00, 00 03 00 00 00 00 00 00
Phy[4] Port:   0x00
  Type:        0x10, 0x00
  InitProto:   0x01, 0x00
  TargetProto: 0x00, 0x01
  PhyIdent:    0x04, 0x04
  SASAddress:  00 00 00 00 00 00 00 00, 00 04 00 00 00 00 00 00
Phy[5] Port:   0x00
  Type:        0x10, 0x00
  InitProto:   0x01, 0x00
  TargetProto: 0x00, 0x01
  PhyIdent:    0x05, 0x05
  SASAddress:  00 00 00 00 00 00 00 00, 00 05 00 00 00 00 00 00
/dev/csmi2,0 -d ata # /dev/csmi2,0, ATA device
Batch Modify
Note: See TracBatchModify for help on using batch modify.
Note: See TracQuery for help on using queries.