即使出现小型 SAN 网络问题，SBD 也会终止两个群集节点

Question

您需要检查各个层：

1：hba驱动程序参数

modinfo <module_name>

2：多路径超时和特殊方式配置参数no_path_retry = fail

multipath -v3

我从你的 sbd dump 中看到“watch timeout 10”，我认为多路径的超时时间不够

模式应采用以下方式（快速，无需重试）：

failed hba(report the down)-> linux scsi says (disks on that path are down) -> multipath says that disk is failed i don't retry there any io request and start to work the no failed path.

但是如果你使用默认的参数，你的 sbd 进程的 io 请求仍然处于挂起状态

Answer 1

您需要检查各个层：

1：hba驱动程序参数

modinfo <module_name>

2：多路径超时和特殊方式配置参数no_path_retry = fail

multipath -v3

我从你的 sbd dump 中看到“watch timeout 10”，我认为多路径的超时时间不够

模式应采用以下方式（快速，无需重试）：

failed hba(report the down)-> linux scsi says (disks on that path are down) -> multipath says that disk is failed i don't retry there any io request and start to work the no failed path.

但是如果你使用默认的参数，你的 sbd 进程的 io 请求仍然处于挂起状态

即使出现小型 SAN 网络问题，SBD 也会终止两个群集节点

答案1

相关内容