SSD 的读取错误率“糟糕”?

SSD 的读取错误率“糟糕”?

我的 Windows 7 笔记本电脑报告

Windows detected a hard disk problem. 
Back up your files immediately to prevent information loss, 
and then contact computer manufacturer to determine 
if you need to repair or replace the disk.

我使用的是 Micron SSD MTFDDAK512MAM-1K1,CrystalDiskInfo 报告一切正常,除了“原始读取错误率”,指出

  Attribute Name:           Current:    Worst:   Threshold:     Raw Values:
Raw Read Error Rate            1          1          50         0000000001C9

同时,Windows“磁盘管理”实用程序也报告该磁盘的状态为“健康”。经过一番挖掘,我发现Micron C400 手册将此属性解释为

SMART ID 1: Raw Read Error Rate

Attribute Flags (0x32)
• Warranty = 0
• Offline = 1
• Performance = 0
• Error Rate = 0
• Event Count = 1
• Self-Preservation = 1

Current Value (8 bits)
This value is the total number of correctable and uncorrectable ECC error events divided by the total host pages read over the life of the drive. 
Note that ECC errors occurring while reading non-user data will still contribute to this rate.
The Current Value will not be calculated and remains as 0x64 until the host read page count is not less than (100,000 × total block count ÷ 2).

Worst Value (8 bits)
The worst value of this field is the lowest value of the Current Value field ever calculated over the life of the drive, always between 1% and 100% (0x01 to 0x64).

Raw Data (48 bits)
This data field holds the raw sum of correctable and uncorrectable ECC error events over the life of the drive. If this ever exceeds 0xFFFFFFFFFFFF, this value will wrap around.

Reserved/Threshold (8 bits)
The threshold for this attribute is set to 0x32 (50%).

所以我想知道,

  1. 为什么“原始读取错误率”为 1 就被认为是不好的?读取错误率越低,读取效果不是越好(错误越少)吗?
  2. 这是否意味着我的 SSD 即将出现故障?自从几年前我买了笔记本电脑以来,它一直运行良好……

如能得到任何帮助/解释都将不胜感激!!

**编辑:CrystalDiskInfo 的完整智能报告如下: 在此处输入图片描述 作为

----------------------------------------------------------------------------
CrystalDiskInfo 8.12.0 (C) 2008-2021 hiyohiyo
                                Crystal Dew World: https://crystalmark.info/
----------------------------------------------------------------------------

    OS : Windows 7 Professional SP1 [6.1 Build 7601] (x64)
  Date : 2021/05/24 17:40:26

-- Controller Map ----------------------------------------------------------
 - ATA Channel 5 (5) [ATA]
 + ATA Channel 0 (0) [ATA]
   - MTFDDAK512MAM-1K1 ATA Device
 + Intel(R) 6 Series/C200 Series Chipset Family 6 Port SATA AHCI Controller - 1C03 [ATA]
   - ATA Channel 0 (0)
   + ATA Channel 1 (1)
     - MATSHITA DVD+-RW UJ8A2 ATA Device
   - ATA Channel 3 (3)
   - ATA Channel 4 (4)
   - ATA Channel 5 (5)
 - WinCDEmu Virtual Bus [SCSI]
 - O2Micro Integrated MMC/SD controller [SCSI]
 - O2Micro Integrated MS/MSPRO controller [SCSI]

-- Disk List ---------------------------------------------------------------
 (01) MTFDDAK512MAM-1K1 : 512.1 GB [0/0/0, pd1] - m2

----------------------------------------------------------------------------
 (01) MTFDDAK512MAM-1K1
----------------------------------------------------------------------------
           Model : MTFDDAK512MAM-1K1
        Firmware : 0909
   Serial Number : ************
       Disk Size : 512.1 GB (8.4/137.4/512.1/512.1)
     Buffer Size : Unknown
     Queue Depth : 32
    # of Sectors : 1000215216
   Rotation Rate : ---- (SSD)
       Interface : Serial ATA
   Major Version : ACS-2
   Minor Version : ATA8-ACS version 6
   Transfer Mode : SATA/600 | SATA/600
  Power On Hours : 19490 hours
  Power On Count : 4201 count
     Temperature : Unknown
   Health Status : Bad (98 %)
        Features : S.M.A.R.T., APM, NCQ, TRIM
       APM Level : 0000h [ON]
       AAM Level : ----
    Drive Letter : C:

-- S.M.A.R.T. --------------------------------------------------------------
ID Cur Wor Thr RawValues(6) Attribute Name
01 __1 __1 _50 0000000001C9 Raw Read Error Rate
05 100 100 _10 000000009000 Reallocated Sectors Count
09 100 100 __0 000000004C22 Power-On Hours
0C 100 100 __0 000000001069 Power Cycle Count
AA 100 100 _10 000000000009 Reserved Block Count
AB 100 100 __0 000000000000 Program Fail Count
AC 100 100 __0 000000000000 Erase Fail Count
AD _98 _98 _10 00000000004A Average Block-Erase Count
AE 100 100 __0 000000000001 Unexpected Power Loss Count
B5 100 100 __0 02C001E700D8 Program Fail Count
B7 100 100 __0 000000000000 SATA Interface Downshift
B8 100 100 _50 000000000000 Error Correction Count
BB 100 100 __0 0000000001B3 Reported Uncorrectable Errors
BC 100 100 __0 000000000000 Command Timeout Count
BD 100 100 __0 0000000000B1 Factory Bad Block Count
C2 100 100 __0 000000000000 Temperature
C3 100 100 __0 00000000180B Cumulative ECC Bit Correction Count
C4 100 100 __0 000000000009 Reallocation Event Count
C5 100 100 __0 000000000000 Current Pending Sector Count
C6 100 100 __0 000000000000 Smart Off-line Scan Uncorrectable Error Count
C7 100 100 __0 000000000003 Ultra DMA CRC Error Rate
CA _98 _98 __0 000000000002 Percent Lifetime Used
CE 100 100 __0 000000000000 Write Error Rate

-- IDENTIFY_DEVICE ---------------------------------------------------------
        0    1    2    3    4    5    6    7    8    9
000: 0440 3FFF C837 0010 0000 0000 003F 0000 0000 0000
010: FFFF FFFF FFFF FFFF FFFF FFFF FFFF FFFF FFFF FFFF
020: 0000 0000 0004 3039 3039 2020 2020 4D54 4644 4441
030: 4B35 3132 4D41 4D2D 314B 3120 2020 2020 2020 2020
040: 2020 2020 2020 2020 2020 2020 2020 8010 4000 2F00
050: 4001 0000 0000 0007 3FFF 0010 003F FC10 00FB 0110
060: FFFF 0FFF 0000 0007 0003 0078 0078 0078 0078 4020
070: 0000 0000 0000 0000 0000 001F 050E 0006 004C 0040
080: 03F8 0028 706B 7C09 4123 7069 BC09 4123 003F 0001
090: 0001 0000 FFFE 0000 0000 0000 0000 0000 0000 0000
100: 12B0 3B9E 0000 0000 0000 0008 4000 0000 500A 0751
110: 0945 AEDD 0000 0000 0000 0000 0000 0000 0000 401C
120: 401C 0000 0000 0000 0000 0000 0000 0000 0029 3039
130: 3039 2E30 412E 3032 0000 0000 0000 3730 3031 2020
140: 2020 3441 4C37 3636 3633 2020 2020 0000 0000 0000
150: 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
160: 0000 0000 0000 0000 0000 0000 0000 0000 0003 0001
170: 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
180: 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
190: 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
200: 0000 0000 0000 0000 0000 0000 003D 0000 0000 4000
210: 0000 0000 0000 0000 0000 0000 0000 0001 0000 0000
220: 0000 0000 103F 0000 0000 0000 0000 0000 0000 0000
230: 0000 0000 0000 0000 0001 00FF 0000 0000 0000 0000
240: 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
250: 0000 0000 0000 0000 0000 7BA5

-- SMART_READ_DATA ---------------------------------------------------------
     +0 +1 +2 +3 +4 +5 +6 +7 +8 +9 +A +B +C +D +E +F
000: 10 00 01 2F 00 01 01 C9 01 00 00 00 00 00 05 33
010: 00 64 64 00 90 00 00 00 00 00 09 32 00 64 64 22
020: 4C 00 00 00 00 00 0C 32 00 64 64 69 10 00 00 00
030: 00 00 AA 33 00 64 64 09 00 00 00 00 00 00 AB 32
040: 00 64 64 00 00 00 00 00 00 00 AC 32 00 64 64 00
050: 00 00 00 00 00 00 AD 33 00 62 62 4A 00 00 00 00
060: 00 00 AE 32 00 64 64 01 00 00 00 00 00 00 B5 22
070: 00 64 64 D8 00 E7 01 C0 02 00 B7 32 00 64 64 00
080: 00 00 00 00 00 00 B8 33 00 64 64 00 00 00 00 00
090: 00 00 BB 32 00 64 64 B3 01 00 00 00 00 00 BC 32
0A0: 00 64 64 00 00 00 00 00 00 00 BD 0E 00 64 64 B1
0B0: 00 00 00 00 00 00 C2 22 00 64 64 00 00 00 00 00
0C0: 00 00 C3 3A 00 64 64 0B 18 00 00 00 00 00 C4 32
0D0: 00 64 64 09 00 00 00 00 00 00 C5 32 00 64 64 00
0E0: 00 00 00 00 00 00 C6 30 00 64 64 00 00 00 00 00
0F0: 00 00 C7 36 00 64 64 03 00 00 00 00 00 00 CA 18
100: 00 62 62 02 00 00 00 00 00 00 CE 0E 00 64 64 00
110: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
120: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
130: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
140: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
150: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
160: 00 00 00 00 00 00 00 00 00 00 00 00 4C 09 00 7B
170: 03 00 01 00 02 27 03 00 00 00 00 00 00 00 00 00
180: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
190: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1A0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1B0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1C0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1D0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1E0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1F0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 10

-- SMART_READ_THRESHOLD ----------------------------------------------------
     +0 +1 +2 +3 +4 +5 +6 +7 +8 +9 +A +B +C +D +E +F
000: 10 00 01 32 00 00 00 00 00 00 00 00 00 00 05 0A
010: 00 00 00 00 00 00 00 00 00 00 09 00 00 00 00 00
020: 00 00 00 00 00 00 0C 00 00 00 00 00 00 00 00 00
030: 00 00 AA 0A 00 00 00 00 00 00 00 00 00 00 AB 00
040: 00 00 00 00 00 00 00 00 00 00 AC 00 00 00 00 00
050: 00 00 00 00 00 00 AD 0A 00 00 00 00 00 00 00 00
060: 00 00 AE 00 00 00 00 00 00 00 00 00 00 00 B5 00
070: 00 00 00 00 00 00 00 00 00 00 B7 00 00 00 00 00
080: 00 00 00 00 00 00 B8 32 00 00 00 00 00 00 00 00
090: 00 00 BB 00 00 00 00 00 00 00 00 00 00 00 BC 00
0A0: 00 00 00 00 00 00 00 00 00 00 BD 00 00 00 00 00
0B0: 00 00 00 00 00 00 C2 00 00 00 00 00 00 00 00 00
0C0: 00 00 C3 00 00 00 00 00 00 00 00 00 00 00 C4 00
0D0: 00 00 00 00 00 00 00 00 00 00 C5 00 00 00 00 00
0E0: 00 00 00 00 00 00 C6 00 00 00 00 00 00 00 00 00
0F0: 00 00 C7 00 00 00 00 00 00 00 00 00 00 00 CA 00
100: 00 00 00 00 00 00 00 00 00 00 CE 00 00 00 00 00
110: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
120: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
130: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
140: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
150: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
160: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
170: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
180: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
190: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1A0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1B0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1C0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1D0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1E0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
1F0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 6C

**编辑#2:事实证明,美光甚至有自己的存储设备分析/报告工具https://www.micron.com/products/ssd/storage-executive-software,报告一切正常: 在此处输入图片描述

答案1

为什么“原始读取错误率”为 1 就被认为是不好的?读取错误率越低,读取效果不是越好(错误越少)吗?

您的研究发现,这Raw Read Error Rate是源于“可纠正和不可纠正的 ECC 错误事件总数”。该数字被标准化并以百分比形式处理,因此当前值代表 1%,即 1% 的读取操作出现了问题。

现代 NAND 芯片明确要求具备 ECC 功能,因为在正常运行期间偶尔会出现读取位错误。该要求将指定每个 NAND 页面读取中可能出错且需要纠正的位数。

换句话说,读取操作有时可能会引发可纠正错误,因此这不是即将发生故障的指示。
发生无法纠正读取错误可能会带来问题。理论上,如果某个扇区/页面/块(持续)产生无法纠正的读取错误,则集成驱动器控制器应识别该扇区/页面/块,将其标记为坏块,并停止使用。

并不Raw Read Error Rate无法纠正
读取错误(现在可以在您附加的 SMART 报告中找到)。无法纠正读取错误似乎显示为Reported Uncorrectable Errors0x1B3 或 435。
与总读取错误 0x1C9 或 457 相比,这表明只有 22 个(良性)可纠正的读取错误(假设没有回绕),但总数中的 95% 是令人担忧的无法纠正读取错误。

这是否意味着我的 SSD 即将出现故障?自从几年前我买了笔记本电脑以来,它一直运行良好……

如果你认为驱动器“工作正常”,则可能意味着驱动器能够通过成功重试和/或重新映射成功从这些错误中恢复。 (请注意,SMART 报告表明在此驱动器的使用寿命内,迄今为止已有 9 个块被淘汰。)
至少您可以备份该驱动器中的数据,并定期监控 SMART 报告中的更改。

由于使用了近 20,000 小时,因此无法确定这些错误何时发生。
但您可以尝试通过扫描整个驱动器来生成新的读取错误,方法是使用 SMART 长/扩展测试或使用 Linux 命令(例如)sudo dd if=/dev/sdX of=/dev/null。第一个测试要快得多,但只会增加 SMART 统计数据,而后面的测试也可能因读取错误而中止,从而提供问题区域的 LBA。
如果您没有遇到更多读取错误,那么这可以令人放心。

请注意,SMART 报告显示当前值为 98%,表示Percent Lifetime Used仅使用了预期使用寿命的 2%。原始值为 2,表示两个显著的​​使用寿命终止指标(平均块磨损和可用备用块)均没有问题。

答案2

屏幕截图稍微澄清了一些问题。“当前”和“最差”是标准化值(越高越好)。“原始值”字段可能是十六进制的原始读取错误率(因此十进制为 457)。我预计原始读取错误率为 0 是理想的,并且可能对应于“当前”值 100。您的标准化“当前”值 (1) 远低于标准化阈值 (50),这表明 Micron 预计您的 SSD 很快就会出现故障。

每个属性都有一个 1 字节的规范化值(范围从 1 到 253)和一个对应的 1 字节阈值。如果一个或多个规范化属性值小于或等于其对应的阈值,则磁盘预计会在 24 小时内发生故障,或者已经超出其设计或使用寿命。

https://www.linuxjournal.com/article/6983

答案3

这看起来相当于 MX500,但使用寿命评级存在缺陷,因为它没有计算 FTL 页数,而 FTL 页数比所提供的屏幕截图中的主机页数值高出几个倍。

mx 500 中的控制器以及外观上的 micron c400 存在严重的写入放大问题,这将比预期更快地损坏驱动器。

没有 74 次擦除周期,在正常情况下仅使用 ssd 寿命的 2%

这个驱动器有缺陷,请更换或退款,然后购买没有此放大缺陷的 SSD。

相关内容