Opened 5 years ago

Closed 5 years ago

Last modified 5 years ago

#1134 closed defect (worksforme)

Read NVMe SMART/Health Information failed: NVMe Status 0x6002

Reported by: Sider Owned by:
Priority: major Milestone: Release 7.0
Component: all Version: 6.6
Keywords: nvme Cc:

Description (last modified by Christian Franke)

Hello,

There is a problem with reading SMART data from NVMe drive Toshiba RC100. Is there a workaround? Web search didn't bring any result, even didn't description of the error code.
Here is full output:

sider@nas:~$ sudo smartctl -x /dev/nvme0n1
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.0-39-generic] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       TOSHIBA-RC100
Serial Number:                      48APA0S7PW6S
Firmware Version:                   ADRA0101
PCI Vendor/Subsystem ID:            0x1179
IEEE OUI Identifier:                0x00080d
Controller ID:                      0
Number of Namespaces:               1
Namespace 1 Size/Capacity:          240 057 409 536 [240 GB]
Namespace 1 Formatted LBA Size:     512
Local Time is:                      Wed Nov 21 11:55:13 2018 MSK
Firmware Updates (0x12):            1 Slot, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL *Other*
Optional NVM Commands (0x0017):     Comp Wr_Unc DS_Mngmt Sav/Sel_Feat
Maximum Data Transfer Size:         512 Pages
Warning  Comp. Temp. Threshold:     82 Celsius
Critical Comp. Temp. Threshold:     85 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     3.30W       -        -    0  0  0  0        0       0
 1 +     2.70W       -        -    1  1  1  1        0       0
 2 +     2.30W       -        -    2  2  2  2        0       0
 3 -   0.0500W       -        -    4  4  4  4    10000   45000
 4 -   0.0050W       -        -    4  4  4  4    10000   50000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 -    4096       0         0
 1 +     512       0         3

=== START OF SMART DATA SECTION ===
Read NVMe SMART/Health Information failed: NVMe Status 0x6002

Change History (11)

comment:1 by Sider, 5 years ago

Keywords: nvme added

comment:2 by Christian Franke, 5 years ago

Description: modified (diff)

comment:3 by Christian Franke, 5 years ago

Milestone: undecided

Please test: smartctl -x /dev/nvme0

comment:4 by Sider, 5 years ago

Aha, much better, thanks. The problem now remains only in WEBMIN, which recognizes this drive as /dev/nvme0n1 and therefore doesnt show it's SMART data.

sider@nas:~$ sudo smartctl -x /dev/nvme0
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.0-39-generic] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number:                       TOSHIBA-RC100
Serial Number:                      48APA0S7PW6S
Firmware Version:                   ADRA0101
PCI Vendor/Subsystem ID:            0x1179
IEEE OUI Identifier:                0x00080d
Controller ID:                      0
Number of Namespaces:               1
Namespace 1 Size/Capacity:          240 057 409 536 [240 GB]
Namespace 1 Formatted LBA Size:     512
Local Time is:                      Wed Nov 21 15:53:03 2018 MSK
Firmware Updates (0x12):            1 Slot, no Reset required
Optional Admin Commands (0x0017):   Security Format Frmw_DL *Other*
Optional NVM Commands (0x0017):     Comp Wr_Unc DS_Mngmt Sav/Sel_Feat
Maximum Data Transfer Size:         512 Pages
Warning  Comp. Temp. Threshold:     82 Celsius
Critical Comp. Temp. Threshold:     85 Celsius

Supported Power States
St Op     Max   Active     Idle   RL RT WL WT  Ent_Lat  Ex_Lat
 0 +     3.30W       -        -    0  0  0  0        0       0
 1 +     2.70W       -        -    1  1  1  1        0       0
 2 +     2.30W       -        -    2  2  2  2        0       0
 3 -   0.0500W       -        -    4  4  4  4    10000   45000
 4 -   0.0050W       -        -    4  4  4  4    10000   50000

Supported LBA Sizes (NSID 0x1)
Id Fmt  Data  Metadt  Rel_Perf
 0 -    4096       0         0
 1 +     512       0         3

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff)
Critical Warning:                   0x00
Temperature:                        48 Celsius
Available Spare:                    100%
Available Spare Threshold:          10%
Percentage Used:                    0%
Data Units Read:                    3 623 [1,85 GB]
Data Units Written:                 24 595 [12,5 GB]
Host Read Commands:                 93 171
Host Write Commands:                170 034
Controller Busy Time:               1
Power Cycles:                       11
Power On Hours:                     7
Unsafe Shutdowns:                   10
Media and Data Integrity Errors:    0
Error Information Log Entries:      293
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 1:               48 Celsius

Error Information (NVMe Log 0x01, max 64 entries)
Num   ErrCount  SQId   CmdId  Status  PELoc          LBA  NSID    VS
  0        293     0  0x0005  0xc005  0x004            -     1     -
  1        292     0  0x0005  0xc005  0x004            -     1     -

comment:5 by Christian Franke, 5 years ago

Please test smartctl -d nvme,0xffffffff -x /dev/nvme0n1.

If this works, the problem is that this drive requires that the broadcast namespace is specified if SMART/Health and Error Information logs are requested. This issue was unspecified in early revisions of the NVMe standard.

Option -d nvme,0xffffffff should no longer be necessary with smartctl >= r4671.

comment:6 by Sider, 5 years ago

It works, thanks. I thought I have the latest version :)
How do I upgrade to smartctl >= r4671?
The changes in the link are for source code, as far as I understand. Is there a compiled version for Ubuntu 18.04 server?

comment:7 by Sider, 5 years ago

One more thing:
these strings are missing in all outputs:

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

What might be the reason?

in reply to:  6 comment:8 by Christian Franke, 5 years ago

How do I upgrade to smartctl >= r4671?

Build from source or use a recent CircleCI build.

Is there a compiled version for Ubuntu 18.04 server?

No, the official repository still provides only this post-6.5 r4324 snapshot release. The real 6.6 release (r4594) is only available for 19.04. The Ubuntu package follows the Debian package.

comment:9 by Christian Franke, 5 years ago

Component: smartctlall
Milestone: undecidedRelease 6.7
Resolution: worksforme
Status: newclosed

Already fixed in r4671.

in reply to:  7 comment:10 by Christian Franke, 5 years ago

these strings are missing in all outputs:

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

What might be the reason?

Unlike ATA/SATA, NVMe drives have no optional SMART support which could be unavailable or disabled. The NVMe SMART/Health Information log is mandatory.

In general, diagnostic info from ATA/SATA, SCSI/SAS and NVMe differ considerably and so do smartctl outputs.

comment:11 by Christian Franke, 5 years ago

Milestone: Release 6.7Release 7.0

Milestone renamed

Note: See TracTickets for help on using tickets.