[vfio-users] Error RmInitAdapter failed! (0x24:0x1f:1127) from guest GPU

Kevin Vasko kvasko at gmail.com
Thu Jan 19 01:35:03 UTC 2017


I am having issues on my guest VM with getting the NVidia drivers to work
properly. I keep running into the following issue and get the error in
title in my dmesg.

What I do is...

sudo apt-get update
sudo apt-get install linux-image-generic
sudo shutdown -r now
sudo apt-get install build-essential linux-source linux-headers-`uname -r`
sudo ./NVIDIA-Linux-x86_64-367.27.run

After I install the NVidia drivers 367.27 and then run nvidia-smi. I then
get "no devices were found" but the device shows up with lspci -vnnn on the
guest with no problems and shows the driver is attached to the device.

$ sudo nvidia-smi
No devices were found

$ dmesg | grep NVRM
[  636.432822] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  367.27  Thu
Jun  9 18:53:27 PDT 2016
[  647.951682] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  367.27  Thu
Jun  9 18:53:27 PDT 2016
[  664.443838] NVRM: RmInitAdapter failed! (0x24:0x1f:1127)
[  664.443844] NVRM: rm_init_adapter failed for device bearing minor number
0


I have added the flag kvm=off. I did "-cpu Haswell-noTSX,kvm=off"

I can confirm that it is off by doing the following on the guest.

dmesg | grep Booting
[    0.000000] Booting paravirtualized kernel on bare hardware
[    0.088000] x86: Booting SMP configuration:


Host:

$uname -a
Linux node-81 3.19.0-79-generic #87~14.04.1-Ubuntu SMP Wed Dec 21 18:12:31
UTC 2016 x86_64 GNU/Linux


kvm -version
QEMU emulator version 2.3.0 (Debian 1:2.3+dfsg-5~u14.04+mos3)

$virsh -version
1.2.9.3

Guest:
$ uname -a
Linux safsfd 3.13.0-107-generic #154-Ubuntu SMP Tue Dec 20 09:57:27 UTC
2016 x86_64 x86_64 x86_64 GNU/Linux


#spci -vnnn

00:05.0 VGA compatible controller: NVIDIA Corporation GM200 [GeForce GTX
TITAN X] (rev a1) (prog-if 00 [VGA controller])
        Subsystem: NVIDIA Corporation Device 1132
        Physical Slot: 5
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 11
        Region 0: Memory at fd000000 (32-bit, non-prefetchable) [size=16M]
        Region 1: Memory at e0000000 (64-bit, prefetchable) [size=256M]
        Region 3: Memory at f2000000 (64-bit, prefetchable) [size=32M]
        Region 5: I/O ports at c000 [size=128]
        Expansion ROM at fe000000 [disabled] [size=512K]
        Capabilities: [60] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
                Address: 00000000fee3f00c  Data: 4191
        Capabilities: [78] Express (v2) Legacy Endpoint, MSI 00
                DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s
unlimited, L1 <64us
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal-
Unsupported-
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 256 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr-
TransPend-
                LnkCap: Port #12, Speed 8GT/s, Width x16, ASPM not
supported, Exit Latency L0s <1us, L1 <4us
                        ClockPM+ Surprise- LLActRep- BwNot-
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk-
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 8GT/s, Width x16, TrErr- Train- SlotClk+
DLActive- BWMgmt- ABWMgmt-
                DevCap2: Completion Timeout: Range AB, TimeoutDis+, LTR+,
OBFF Via message
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-,
LTR-, OBFF Disabled
                LnkSta2: Current De-emphasis Level: -6dB,
EqualizationComplete+, EqualizationPhase1+
                         EqualizationPhase2+, EqualizationPhase3+,
LinkEqualizationRequest-
        Kernel driver in use: nvidia


$ cat /proc/driver/nvidia/
gpus/     params    patches/  registry  version   warnings/
ubuntu at safsfd:~$ cat /proc/driver/nvidia/gpus/0000\:00\:05.0/information
Model:           GeForce GTX TITAN X
IRQ:             45
GPU UUID:        GPU-????????-????-????-????-????????????
Video BIOS:      ??.??.??.??.??
Bus Type:        PCIe
DMA Size:        40 bits
DMA Mask:        0xffffffffff
Bus Location:    0000:00:05.0
Device Minor:    0


~$  modinfo nvidia
filename:
/lib/modules/3.13.0-107-generic/kernel/drivers/video/nvidia.ko
alias:          char-major-195-*
version:        367.27
supported:      external
license:        NVIDIA
srcversion:     513914449E3DDF807584E74
alias:          pci:v000010DEd00000E00sv*sd*bc04sc80i00*
alias:          pci:v000010DEd*sv*sd*bc03sc02i00*
alias:          pci:v000010DEd*sv*sd*bc03sc00i00*
depends:
vermagic:       3.13.0-107-generic SMP mod_unload modversions
parm:           NVreg_Mobile:int
parm:           NVreg_ResmanDebugLevel:int
parm:           NVreg_RmLogonRC:int
parm:           NVreg_ModifyDeviceFiles:int
parm:           NVreg_DeviceFileUID:int
parm:           NVreg_DeviceFileGID:int
parm:           NVreg_DeviceFileMode:int
parm:           NVreg_UpdateMemoryTypes:int
parm:           NVreg_InitializeSystemMemoryAllocations:int
parm:           NVreg_UsePageAttributeTable:int
parm:           NVreg_MapRegistersEarly:int
parm:           NVreg_RegisterForACPIEvents:int
parm:           NVreg_CheckPCIConfigSpace:int
parm:           NVreg_EnablePCIeGen3:int
parm:           NVreg_EnableMSI:int
parm:           NVreg_TCEBypassMode:int
parm:           NVreg_MemoryPoolSize:int
parm:           NVreg_RegistryDwords:charp
parm:           NVreg_RmMsg:charp
parm:           NVreg_AssignGpus:charp


~$ lsmod|grep nvidia
nvidia_drm             14357  0
nvidia_modeset        764225  1 nvidia_drm
nvidia              11070409  1 nvidia_modeset
drm                   303102  4 ttm,drm_kms_helper,cirrus,nvidia_drm

Any suggestions? The only thing I haven't tried is later kernel version
and/or newer versions of libvirt/qemu.

Any help on this would be appreciated.

Thanks,

-Kevin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/vfio-users/attachments/20170118/84dbff38/attachment.htm>


More information about the vfio-users mailing list