Skip to content

Nvidia GPU passthrough issue #10036

@luxnr

Description

@luxnr

Qubes OS release

Qubes OS 4.2

Brief summary

Trying to setup GPU passthrough on my newly built desktop computer (motherboard : MSI B650, GPU: MSI RTX 5070). The qube boots and the GPU is listed in lspci, but nvidia-smi shows "no devices were found" error.

I've checked that the passthrough works with KVM (same guest, Fedora host).

Steps to reproduce

  1. Install Qubes 4.2.4 and update dom0
  2. Hide GPU and associated audio device from dom0
  3. Create a fedora HVM and attach both devices (with permissive=True and no-strict-reset=True)
  4. Install akmod-nvidia and xorg-x11-drv-nvidia-cuda in HVM
  5. Reboot
  6. Run nvidia-smi

Expected behavior

Print GPU informations

Actual behavior

"No devices were found"

Additional information

In dom0

$ cat /proc/cmdline
placeholder root=/dev/mapper/qubes_dom0-root ro rd.luks.uuid=luks-21e9460e-7bff-4bf9-8b96-541bacb7b6eb rd.lvm.lv=qubes_dom0/root rd.lvm.lv=qubes_dom0/swap plymouth.ignore-serial-consoles 6.6.77-1.qubes.fc37.x86_64 x86_64 rhgb quiet usbcore.authorized_default=0 rd.qubes.hide_pci=01:00.0,01:00.1,02:00.0,11:00.3,11:00.4,12:00.0
$ lspci -nnk
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:2f04] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] Device [1462:5323]
	Kernel driver in use: pciback
	Kernel modules: nouveau
01:00.1 Audio device [0403]: NVIDIA Corporation Device [10de:2f80] (rev a1)
	Subsystem: NVIDIA Corporation Device [10de:0000]
	Kernel driver in use: pciback
	Kernel modules: snd_hda_intel

In gpu qube

$ lspci -nnk
00:06.0 VGA compatible controller [0300]: NVIDIA Corporation GB205 [GeForce RTX 5070] [10de:2f04] (rev a1)
    Subsystem: Micro-Star International Co., Ltd. [MSI] Device [1462:5323]
    Kernel driver in use: nvidia
    Kernel modules: nouveau, nova_core, nvidia_drm, nvidia
00:07.0 Audio device [0403]: NVIDIA Corporation Device [10de:2f80] (rev a1)
    Subsystem: NVIDIA Corporation Device [10de:0000]
    Kernel driver in use: snd_hda_intel
    Kernel modules: nova_core, snd_hda_intel
$ nvidia-smi
No devices were found
$ sudo dmesg
[    4.639647] nvidia: loading out-of-tree module taints kernel.
[    4.639655] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[    4.742397] vif vif-0 enX0: renamed from eth0
[    4.742708] nvidia-nvlink: Nvlink Core is being initialized, major device number 237

[    4.745428] nvidia 0000:00:06.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=io+mem
[    4.809931] snd_hda_intel 0000:00:07.0: Disabling MSI
[    4.843360] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:07.0/sound/card0/input7
[    4.845393] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:07.0/sound/card0/input8
[    4.848111] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:07.0/sound/card0/input9
[    4.848885] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:07.0/sound/card0/input10
[    4.924466] EXT4-fs (xvda2): mounted filesystem b4503358-f7db-42e5-9e6f-5afcccdb7b61 r/w with ordered data mode. Quota mode: none.
[    5.684786] NET: Registered PF_QIPCRTR protocol family
[    6.026721] RPC: Registered named UNIX socket transport module.
[    6.026724] RPC: Registered udp transport module.
[    6.026726] RPC: Registered tcp transport module.
[    6.026727] RPC: Registered tcp-with-tls transport module.
[    6.026728] RPC: Registered tcp NFSv4.1 backchannel transport module.
[    7.656386] NVRM: loading NVIDIA UNIX Open Kernel Module for x86_64  575.64  Release Build  (dvs-builder@U22-I3-AF02-18-4)  Tue Jun 10 18:53:09 UTC 2025
[    7.815475] nvidia-modeset: Loading NVIDIA UNIX Open Kernel Mode Setting Driver for x86_64  575.64  Release Build  (dvs-builder@U22-I3-AF02-18-4)  Tue Jun 10 18:43:58 UTC 2025
[    8.029491] NVRM: kfspProcessCommandResponse_GH100: FSP response reported error. Task ID: 0x1 Command type: 0x14 Error code: 0x177
[    8.029503] NVRM: kfspSendBootCommands_GH100: Sent following content to FSP: 
[    8.029505] NVRM: kfspSendBootCommands_GH100: version=0x2, size=0x35c, gspFmcSysmemOffset=0x129a80000
[    8.029506] NVRM: kfspSendBootCommands_GH100: frtsSysmemOffset=0x0, frtsSysmemSize=0x0
[    8.029508] NVRM: kfspSendBootCommands_GH100: frtsVidmemOffset=0x1c00000, frtsVidmemSize=0x100000
[    8.029509] NVRM: kfspSendBootCommands_GH100: gspBootArgsSysmemOffset=0x1418b4000
[    8.029510] NVRM: kfspSendBootCommands_GH100: FSP boot cmds failed. RM cannot boot.
[    8.029513] NVRM: kfspDumpDebugState_GB202: FSP microcode v4.76
[    8.029515] NVRM: kfspDumpDebugState_GB202: GPU 0000:00:06
[    8.029518] NVRM: kfspDumpDebugState_GB202: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(0) = 0x177
[    8.029520] NVRM: kfspDumpDebugState_GB202: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(1) = 0x0
[    8.029523] NVRM: kfspDumpDebugState_GB202: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(2) = 0x0
[    8.029525] NVRM: kfspDumpDebugState_GB202: NV_PFSP_FALCON_COMMON_SCRATCH_GROUP_2(3) = 0xa
[    8.030705] NVRM: nvCheckOkFailedNoLog: Check failed: Failure: Generic Error [NV_ERR_GENERIC] (0x0000FFFF) returned from kfspSendBootCommands_HAL(pGpu, pKernelFsp) @ kernel_gsp_gh100.c:856
[    8.031121] NVRM: _kgspBootGspRm: unexpected WPR2 already up, cannot proceed with booting GSP
[    8.031124] NVRM: _kgspBootGspRm: (the GPU is likely in a bad state and may need to be reset)
[    8.031131] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110094,  regvalue: 0xbadf4100,  error code: Unknown SYS_PRI_ERROR_CODE
[    8.031147] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[    8.032726] NVRM: GPU 0000:00:06.0: RmInitAdapter failed! (0x62:0x40:1941)
[    8.037384] NVRM: GPU 0000:00:06.0: rm_init_adapter failed, device minor number 0
[    8.048182] [drm] [nvidia-drm] [GPU ID 0x00000006] Loading driver
[    8.112082] NVRM: _kgspBootGspRm: unexpected WPR2 already up, cannot proceed with booting GSP
[    8.112088] NVRM: _kgspBootGspRm: (the GPU is likely in a bad state and may need to be reset)
[    8.112095] NVRM: gpuHandleSanityCheckRegReadError_GH100: Possible bad register read: addr: 0x110094,  regvalue: 0xbadf4100,  error code: Unknown SYS_PRI_ERROR_CODE
[    8.112105] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[    8.113160] NVRM: GPU 0000:00:06.0: RmInitAdapter failed! (0x62:0x40:1941)
[    8.117994] NVRM: GPU 0000:00:06.0: rm_init_adapter failed, device minor number 0

(the last part repeats)

Metadata

Metadata

Assignees

No one assigned

    Labels

    C: otherNo other component ("C:") label applies to this issue, or the appropriate label is not yet known.P: defaultPriority: default. Default priority for new issues, to be replaced given sufficient information.affects-4.2This issue affects Qubes OS 4.2.hardware supportThis issue pertains to the hardware that Qubes OS supports.needs diagnosisRequires technical diagnosis from developer. Replace with "diagnosed" or remove if otherwise closed.

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions