“我们在一台运行 Ubuntu 20.04 并配备 NVIDIA 4090 GPU 的机器上遇到了意外关机。该系统使用 NVIDIA 驱动程序 NVIDIA-SMI 535.171.04 和内核版本 5.15.0-1052-intel-iotg。机器运行一段时间后就意外关机,并且不会自动重启。我们在日志中注意到了 ACPI 错误和热错误。”
请查看以下错误日志:
ACPI: thermal: Thermal Zone [TZ00] (28 C)
[ 0.993513] ACPI: video: Video Device [GFX0] (multi-head: yes rom: no post: no)
[ 3.596622] ACPI Warning: \_SB.PC00.PEG1.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20210730/nsarguments-61)
[ 3.778368] ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG1.PEGP._DSM.USRG], AE_ALREADY_EXISTS (20210730/dsfield-184)
[ 3.778391] ACPI Error: AE_ALREADY_EXISTS, CreateBufferField failure (20210730/dswload2-477)
[ 3.778415] ACPI Error: Aborting method \_SB.PC00.PEG1.PEGP._DSM due to previous error (AE_ALREADY_EXISTS) (20210730/psparse-529)
[ 3.778537] ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG1.PEGP._DSM.USRG], AE_ALREADY_EXISTS (20210730/dsfield-184)
[ 3.778556] ACPI Error: AE_ALREADY_EXISTS, CreateBufferField failure (20210730/dswload2-477)"
ACPI: thermal: Thermal Zone [TZ00] (28 C)
[ 3.568606] thermal thermal_zone1: failed to read out thermal zone (-61)
nvidia: module verification failed: signature and/or required key missing - tainting kernel