背景
今天早上,当我打开双启动机器并尝试进入 Windows 时,它没有工作。Windows 的自我诊断工具不起作用,两者都不起作用chkdsk
或ntfsfix
两者都说分区无法修复。幸运的是,我备份了大部分内容,Ubuntu 仍然可以工作(在同一个驱动器上)。我不知道这是由 Windows 更新或类似原因造成的,还是某种环境影响(我冒着倾盆大雨走回家,笔记本电脑放在防水背包的保护套里),或者是驱动器有故障。
问题
我如何判断 SSD 是否有故障?这是 INTEL SSDPEKKW256G7,也称为英特尔 SSD 600p 系列 (256GB)。英特尔有一个工具来评估驱动器的健康状况,但它只能在 Windows 上运行。
我尝试过的方法
chkdsk
(失败unable to determine volumen version and state
)并ntfsfix
尝试修复驱动器。我试图smartctl
弄清楚是否值得在驱动器上重新安装 Windows 或购买新的驱动器:
$ sudo smartctl -t long -a /dev/nvme0
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.0-20-generic] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Number: INTEL SSDPEKKW256G7
Serial Number: BTPY63520PCF256D
Firmware Version: PSF100C
PCI Vendor/Subsystem ID: 0x8086
IEEE OUI Identifier: 0x5cd2e4
Controller ID: 1
Number of Namespaces: 1
Namespace 1 Size/Capacity: 256,060,514,304 [256 GB]
Namespace 1 Formatted LBA Size: 512
Local Time is: Thu Nov 1 18:14:28 2018 GMT
Firmware Updates (0x12): 1 Slot, no Reset required
Optional Admin Commands (0x0006): Format Frmw_DL
Optional NVM Commands (0x001e): Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
Maximum Data Transfer Size: 32 Pages
Warning Comp. Temp. Threshold: 70 Celsius
Critical Comp. Temp. Threshold: 80 Celsius
Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 9.00W - - 0 0 0 0 5 5
1 + 4.60W - - 1 1 1 1 30 30
2 + 3.80W - - 2 2 2 2 30 30
3 - 0.0700W - - 3 3 3 3 10000 300
4 - 0.0050W - - 4 4 4 4 2000 10000
Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 0
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff)
Critical Warning: 0x00
Temperature: 41 Celsius
Available Spare: 100%
Available Spare Threshold: 10%
Percentage Used: 8%
Data Units Read: 45,702,395 [23.3 TB]
Data Units Written: 38,274,889 [19.5 TB]
Host Read Commands: 1,028,294,677
Host Write Commands: 655,432,948
Controller Busy Time: 8,135
Power Cycles: 2,208
Power On Hours: 8,663
Unsafe Shutdowns: 71
Media and Data Integrity Errors: 1
Error Information Log Entries: 316
Warning Comp. Temperature Time: 24
Critical Comp. Temperature Time: 0
Error Information (NVMe Log 0x01, max 64 entries)
Num ErrCount SQId CmdId Status PELoc LBA NSID VS
0 316 4 0x005a 0x0004 0x018 0 1 -
1 315 4 0x005a 0x0004 0x018 0 1 -
2 314 4 0x005a 0x0004 0x018 0 1 -
3 313 4 0x005a 0x0004 0x018 0 1 -
4 312 4 0x005a 0x0004 0x018 0 1 -
5 311 4 0x005a 0x0004 0x018 0 1 -
6 310 4 0x005a 0x0004 0x018 0 1 -
7 309 4 0x005a 0x0004 0x018 0 1 -
8 308 4 0x005a 0x0004 0x018 0 1 -
9 307 4 0x005a 0x0004 0x018 0 1 -
10 306 4 0x005a 0x0004 0x018 0 1 -
11 305 4 0x005a 0x0004 0x018 0 1 -
12 304 4 0x005a 0x0004 0x018 0 1 -
13 303 4 0x005a 0x0004 0x018 0 1 -
14 302 4 0x005a 0x0004 0x018 0 1 -
15 301 4 0x005a 0x0004 0x018 0 1 -
... (48 entries not shown)
我可以看到 SMART 测试已通过,但我很难找到其他看起来相关的内容。