缓存设备崩溃后无法启动 LVM 卷

Question 1

我无法重新激活 LVM 卷，但我能够读取数据。

root@fileserver:~# lvchange -ay raid6-4T/r6_4T_files_corig --activationmode partial
  PARTIAL MODE. Incomplete logical volumes will be processed.
Do you want to activate component LV in read-only mode? [y/n]: y
  Allowing activation of component LV.
  Couldn't find device with uuid tOkG3t-aWGl-4PfO-DI3O-TMoG-ia1z-p4UQgP.
  Couldn't find device with uuid qmDOrk-0SRI-9Z1S-PzgI-GRI8-xLrP-kC5LHd.

root@fileserver:~# mount -o noload -r /dev/mapper/raid6--4T-r6_4T_files_corig /media/data/

root@fileserver:~# lvchange -ay raid6-4T/lvm-var_corig --activationmode partial
  PARTIAL MODE. Incomplete logical volumes will be processed.
Do you want to activate component LV in read-only mode? [y/n]: y
  Allowing activation of component LV.
  Couldn't find device with uuid tOkG3t-aWGl-4PfO-DI3O-TMoG-ia1z-p4UQgP.
  Couldn't find device with uuid qmDOrk-0SRI-9Z1S-PzgI-GRI8-xLrP-kC5LHd.

root@fileserver:~# mount -o noload -r /dev/mapper/raid6--4T-lvm--var_corig /media/var/

Answer

我无法重新激活 LVM 卷，但我能够读取数据。

root@fileserver:~# lvchange -ay raid6-4T/r6_4T_files_corig --activationmode partial
  PARTIAL MODE. Incomplete logical volumes will be processed.
Do you want to activate component LV in read-only mode? [y/n]: y
  Allowing activation of component LV.
  Couldn't find device with uuid tOkG3t-aWGl-4PfO-DI3O-TMoG-ia1z-p4UQgP.
  Couldn't find device with uuid qmDOrk-0SRI-9Z1S-PzgI-GRI8-xLrP-kC5LHd.

root@fileserver:~# mount -o noload -r /dev/mapper/raid6--4T-r6_4T_files_corig /media/data/

root@fileserver:~# lvchange -ay raid6-4T/lvm-var_corig --activationmode partial
  PARTIAL MODE. Incomplete logical volumes will be processed.
Do you want to activate component LV in read-only mode? [y/n]: y
  Allowing activation of component LV.
  Couldn't find device with uuid tOkG3t-aWGl-4PfO-DI3O-TMoG-ia1z-p4UQgP.
  Couldn't find device with uuid qmDOrk-0SRI-9Z1S-PzgI-GRI8-xLrP-kC5LHd.

root@fileserver:~# mount -o noload -r /dev/mapper/raid6--4T-lvm--var_corig /media/var/

Question 2

我今天遇到了同样的问题（缓存 SSD 死了），但最终运行vgreduce --removemissing --force并删除了关联的逻辑卷！一阵恐慌之后，我尝试使用 vgcfgrestore 从存档的 LVM 配置中恢复 LV，但由于物理卷丢失，这也一直失败。

对我来说，修复方法是编辑存档的 LVM 配置（在 /etc/lvm/archive/vgname.vg 中）并删除对有缺陷的 PV 和缓存设备/卷的任何引用。

在该文件中，您将找到对 PV（包括有缺陷的 PV）和 LV（在此示例中我们将其称为“lvname”）的引用。 LV 引用将有一个“缓存”类型的段。您还会发现一个名为“lvname_corig”的 LV，在数据 PV 上有多个段，指示非缓存数据的存储位置，以及用于缓存元数据等的更多 LV。

我解决这个问题的过程是：

备份原始 VG 配置文件。
从 lvname 复制“id”和“state”行，然后找到名为 lvname_corig 的卷并将它们粘贴到那里（注释掉该块中的 id 和 state 行）。
删除lvname块。
删除缓存的所有相关卷块（cachedatalvname_cpool、cachedatalvname_cpool_cmeta、cachedatalvname_cpool_cdata）。
将 lvname_corig 块重命名为 lvname。
删除引用有缺陷 PV 的任何其他卷（在我的例子中，有一个 lvol0_pmspare 卷）。
删除对有缺陷的 PV 的引用（在我的例子中，这是一个名为“pv3”的块）。

完成所有这些编辑后，用于vgcfgrestore -f filename vgname恢复 VG/LV 配置，而无需有缺陷的缓存 PV 或任何相关的缓存卷。

Answer