Custom Query (1383 matches)
Results (19 - 21 of 1383)
Ticket | Resolution | Summary | Owner | Reporter |
---|---|---|---|---|
#1828 | duplicate | smartctl -x creates new NVMe errors | ||
Description |
I was given a drive to check the condition, but errors kept increasing. After a while I actually noticed that the errors increased by 1 by each smartctl -x call (I was repeatedly checking the state of the ssd while ext4 lazy inode init was doing its job after formatting the partition). Since the errors are not about failure, but rather a malformed command, could be done something in smartctl in order to prevent such things happening? My expectation: an error checking tool shall not generate errors while doing so. I see multiple reasons why to fix this - if somebody calls smartctl repetitively, the errors increase but its not actually representing any error condition of the drive. If somebody would call smartctl too aggressively, then actual errors might get lost. (if the nvme error log is a circular buffer which afaik is, 64 entries on mine). There was a #1222 ticket about something similar? Smartd should ignore non-error entries from NVMe Error Information log But hiding is one thing.. while making actual errors is something NOT desireable. So the drive in question is a WD SN640 nvme: === START OF INFORMATION SECTION === Model Number: WUS4BB076D7P3E3 Serial Number: ******** Firmware Version: R111000L PCI Vendor/Subsystem ID: 0x1b96 IEEE OUI Identifier: 0x0014ee Total NVM Capacity: 7,681,501,126,656 [7.68 TB] Unallocated NVM Capacity: 0 Controller ID: 0 NVMe Version: 1.3 Number of Namespaces: 1 Namespace 1 Size/Capacity: 7,681,501,126,656 [7.68 TB] Namespace 1 Formatted LBA Size: 4096 Namespace 1 IEEE EUI-64: 0014ee 83066cbb80 Local Time is: Tue Apr 30 20:30:15 2024 CEST Firmware Updates (0x19): 4 Slots, Slot 1 R/O, no Reset required Optional Admin Commands (0x001f): Security Format Frmw_DL NS_Mngmt Self_Test Optional NVM Commands (0x005e): Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat Timestmp Log Page Attributes (0x03): S/H_per_NS Cmd_Eff_Lg Warning Comp. Temp. Threshold: 70 Celsius Critical Comp. Temp. Threshold: 80 Celsius Namespace 1 Features (0x02): NA_Fields Current state at end of smartctl -x call: Error Information (NVMe Log 0x01, 16 of 256 entries) Num ErrCount SQId CmdId Status PELoc LBA NSID VS Message 0 12 0 0xd009 0xc004 - 0 1 - Invalid Field in Command 1 11 0 0xc008 0xc004 - 0 1 - Invalid Field in Command 2 10 0 0xa00b 0xc004 - 0 1 - Invalid Field in Command 3 9 0 0x900a 0xc004 - 0 1 - Invalid Field in Command 4 8 0 0x8009 0xc004 - 0 1 - Invalid Field in Command 5 7 0 0xa00e 0xc004 - 0 1 - Invalid Field in Command 6 6 0 0x900d 0xc004 - 0 1 - Invalid Field in Command 7 5 0 0x7008 0xc004 - 0 1 - Invalid Field in Command 8 4 0 0x800c 0xc004 - 0 1 - Invalid Field in Command 9 3 0 0x600c 0xc004 - 0 1 - Invalid Field in Command 10 2 0 0x100a 0xc004 - 0 1 - Invalid Field in Command 11 1 0 0x300e 0xc004 0x028 0 0 - Invalid Field in Command And using nvme-cli error-log, i see that all these errors (except the oldest) are of this kind: ................. Entry[ 0] ................. error_count : 12 sqid : 0 cmdid : 0xd009 status_field : 0x6002(Invalid Field in Command: A reserved coded value or an unsupported value in a defined field) phase_tag : 0 parm_err_loc : 0xffff lba : 0 nsid : 0x1 vs : 0 trtype : The transport type is not indicated or the error is not transport related. csi : 0 opcode : 0 cs : 0 trtype_spec_info: 0 log_page_version: 0 How can we trace this to an exact query which smartctl does? |
|||
#1827 | duplicate | Read Self-test Log failed: Invalid Field in Command (0x4002) | ||
Description |
Sorry if it's already been reported but I've not been able to find any similar reports. My device is:
|
|||
#1824 | wontfix | Change the capability name of "Offline surface scan" to "Off-line Read Scanning" | ||
Description |
This is listed in the capabilities section when using smartctl --capabilities command. It is called Off-line Read Scanning in this draft of ACS-3, page 264, Table 133, Bit 3: https://people.freebsd.org/~imp/asiabsdcon2015/works/d2161r5-ATAATAPI_Command_Set_-_3.pdf And I can find a few manuals of drives that use the name Off-line Read Scanning: https://documents.westerndigital.com/content/dam/doc-library/en_us/assets/public/western-digital/product/data-center-drives/ultrastar-dc-hc500-series/product-manual-ultrastar-dc-hc550-sata-oem-spec.pdf page 306, 12.53.2.6 Off-Line Data Collection Capability, Bit 3 https://ia804502.us.archive.org/14/items/manualzilla-id-7035037/7035037.pdf page 149, 14.39.2.7 Off-Line Data Collection Capability, Bit 3 http://datasheet.elcodis.com/pdf2/99/55/995536/mmdoe28gxmsp-0va.pdf page 27, 7.3.2.7 Off-line data collection capability, Bit 3 Using the same name as these documents will make searching for information easier. |