[vfio-users] Error RmInitAdapter failed! (0x24:0x1f:1127) from guest GPU
Kevin Vasko
kvasko at gmail.com
Thu Jan 19 01:35:03 UTC 2017
I am having issues on my guest VM with getting the NVidia drivers to work
properly. I keep running into the following issue and get the error in
title in my dmesg.
What I do is...
sudo apt-get update
sudo apt-get install linux-image-generic
sudo shutdown -r now
sudo apt-get install build-essential linux-source linux-headers-`uname -r`
sudo ./NVIDIA-Linux-x86_64-367.27.run
After I install the NVidia drivers 367.27 and then run nvidia-smi. I then
get "no devices were found" but the device shows up with lspci -vnnn on the
guest with no problems and shows the driver is attached to the device.
$ sudo nvidia-smi
No devices were found
$ dmesg | grep NVRM
[ 636.432822] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 367.27 Thu
Jun 9 18:53:27 PDT 2016
[ 647.951682] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 367.27 Thu
Jun 9 18:53:27 PDT 2016
[ 664.443838] NVRM: RmInitAdapter failed! (0x24:0x1f:1127)
[ 664.443844] NVRM: rm_init_adapter failed for device bearing minor number
0
I have added the flag kvm=off. I did "-cpu Haswell-noTSX,kvm=off"
I can confirm that it is off by doing the following on the guest.
dmesg | grep Booting
[ 0.000000] Booting paravirtualized kernel on bare hardware
[ 0.088000] x86: Booting SMP configuration:
Host:
$uname -a
Linux node-81 3.19.0-79-generic #87~14.04.1-Ubuntu SMP Wed Dec 21 18:12:31
UTC 2016 x86_64 GNU/Linux
kvm -version
QEMU emulator version 2.3.0 (Debian 1:2.3+dfsg-5~u14.04+mos3)
$virsh -version
1.2.9.3
Guest:
$ uname -a
Linux safsfd 3.13.0-107-generic #154-Ubuntu SMP Tue Dec 20 09:57:27 UTC
2016 x86_64 x86_64 x86_64 GNU/Linux
#spci -vnnn
00:05.0 VGA compatible controller: NVIDIA Corporation GM200 [GeForce GTX
TITAN X] (rev a1) (prog-if 00 [VGA controller])
Subsystem: NVIDIA Corporation Device 1132
Physical Slot: 5
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0, Cache Line Size: 32 bytes
Interrupt: pin A routed to IRQ 11
Region 0: Memory at fd000000 (32-bit, non-prefetchable) [size=16M]
Region 1: Memory at e0000000 (64-bit, prefetchable) [size=256M]
Region 3: Memory at f2000000 (64-bit, prefetchable) [size=32M]
Region 5: I/O ports at c000 [size=128]
Expansion ROM at fe000000 [disabled] [size=512K]
Capabilities: [60] Power Management version 3
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Address: 00000000fee3f00c Data: 4191
Capabilities: [78] Express (v2) Legacy Endpoint, MSI 00
DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s
unlimited, L1 <64us
ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset-
DevCtl: Report errors: Correctable- Non-Fatal- Fatal-
Unsupported-
RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop+
MaxPayload 256 bytes, MaxReadReq 512 bytes
DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr-
TransPend-
LnkCap: Port #12, Speed 8GT/s, Width x16, ASPM not
supported, Exit Latency L0s <1us, L1 <4us
ClockPM+ Surprise- LLActRep- BwNot-
LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk-
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed 8GT/s, Width x16, TrErr- Train- SlotClk+
DLActive- BWMgmt- ABWMgmt-
DevCap2: Completion Timeout: Range AB, TimeoutDis+, LTR+,
OBFF Via message
DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-,
LTR-, OBFF Disabled
LnkSta2: Current De-emphasis Level: -6dB,
EqualizationComplete+, EqualizationPhase1+
EqualizationPhase2+, EqualizationPhase3+,
LinkEqualizationRequest-
Kernel driver in use: nvidia
$ cat /proc/driver/nvidia/
gpus/ params patches/ registry version warnings/
ubuntu at safsfd:~$ cat /proc/driver/nvidia/gpus/0000\:00\:05.0/information
Model: GeForce GTX TITAN X
IRQ: 45
GPU UUID: GPU-????????-????-????-????-????????????
Video BIOS: ??.??.??.??.??
Bus Type: PCIe
DMA Size: 40 bits
DMA Mask: 0xffffffffff
Bus Location: 0000:00:05.0
Device Minor: 0
~$ modinfo nvidia
filename:
/lib/modules/3.13.0-107-generic/kernel/drivers/video/nvidia.ko
alias: char-major-195-*
version: 367.27
supported: external
license: NVIDIA
srcversion: 513914449E3DDF807584E74
alias: pci:v000010DEd00000E00sv*sd*bc04sc80i00*
alias: pci:v000010DEd*sv*sd*bc03sc02i00*
alias: pci:v000010DEd*sv*sd*bc03sc00i00*
depends:
vermagic: 3.13.0-107-generic SMP mod_unload modversions
parm: NVreg_Mobile:int
parm: NVreg_ResmanDebugLevel:int
parm: NVreg_RmLogonRC:int
parm: NVreg_ModifyDeviceFiles:int
parm: NVreg_DeviceFileUID:int
parm: NVreg_DeviceFileGID:int
parm: NVreg_DeviceFileMode:int
parm: NVreg_UpdateMemoryTypes:int
parm: NVreg_InitializeSystemMemoryAllocations:int
parm: NVreg_UsePageAttributeTable:int
parm: NVreg_MapRegistersEarly:int
parm: NVreg_RegisterForACPIEvents:int
parm: NVreg_CheckPCIConfigSpace:int
parm: NVreg_EnablePCIeGen3:int
parm: NVreg_EnableMSI:int
parm: NVreg_TCEBypassMode:int
parm: NVreg_MemoryPoolSize:int
parm: NVreg_RegistryDwords:charp
parm: NVreg_RmMsg:charp
parm: NVreg_AssignGpus:charp
~$ lsmod|grep nvidia
nvidia_drm 14357 0
nvidia_modeset 764225 1 nvidia_drm
nvidia 11070409 1 nvidia_modeset
drm 303102 4 ttm,drm_kms_helper,cirrus,nvidia_drm
Any suggestions? The only thing I haven't tried is later kernel version
and/or newer versions of libvirt/qemu.
Any help on this would be appreciated.
Thanks,
-Kevin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://listman.redhat.com/archives/vfio-users/attachments/20170118/84dbff38/attachment.htm>
More information about the vfio-users
mailing list