[vfio-users] vfio passthrough devices behind pcie switcher problem
Alex Williamson
alex.williamson at redhat.com
Thu Mar 9 13:49:48 UTC 2017
On Thu, 9 Mar 2017 11:47:32 +0800
rhett rhett <rhett.kernel at gmail.com> wrote:
> somebody can help me ?
I asked for VM commandline or XML, you haven't provided it. I asked
for lspci info, you haven't provided it. Help us help you.
> 2017-03-08 14:34 GMT+08:00 rhett rhett <rhett.kernel at gmail.com>:
>
> > here's some more error log from centos guest:
> >
> > Mar 7 05:38:07 localhost kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel
> > Module 375.39 Tue Jan 31 20:47:00 PST 2017 (using threaded interrupts)
> > Mar 7 05:38:08 localhost kernel: nvidia-modeset: Loading NVIDIA Kernel
> > Mode Setting Driver for UNIX platforms 375.39 Tue Jan 31 19:41:48 PST 2017
> > Mar 7 05:39:27 localhost kernel: NVRM: RmInitAdapter failed!
> > (0x24:0x51:1060)
> > Mar 7 05:39:27 localhost kernel: NVRM: rm_init_adapter failed for device
> > bearing minor number 0
> > Mar 7 05:43:40 localhost kernel: NVRM: RmInitAdapter failed!
> > (0x24:0x51:1060)
> > Mar 7 05:43:40 localhost kernel: NVRM: rm_init_adapter failed for device
> > bearing minor number 0
> > Mar 8 05:07:47 localhost kernel: nvidia: module license 'NVIDIA' taints
> > kernel.
> > Mar 8 05:07:47 localhost kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel
> > Module
> >
> > 2017-03-08 14:31 GMT+08:00 rhett rhett <rhett.kernel at gmail.com>:
> >
> >> i have two guest , a windows 2008 server and a centos 7.2 . in windows,
> >> the device manager said the gpu can't start ,error code 10.
> >> in centos, when i run nvidia-smi, it said no device found.
> >>
> >> no specil vm configurations, whit the same config, i can use gpu
> >> successfully in my two gpu server. the biggest different is , that server
> >> is no pcie switcher.
> >>
> >> 2017-03-08 11:55 GMT+08:00 Alex Williamson <alex.williamson at redhat.com>:
> >>
> >>> On Wed, 8 Mar 2017 11:26:17 +0800
> >>> rhett rhett <rhett.kernel at gmail.com> wrote:
> >>>
> >>> > two gpus share the same irq , i found the reason. because the msi be
> >>> > disabled later , so irq 140 is being reused.
> >>> >
> >>> > but i don't know why somebady calls vfio_pci_ioctl to disable the msi.
> >>>
> >>> vfio just does what the guest requests, but you're really providing
> >>> hardly any more information than when you asked off list. My wild
> >>> guess, is that maybe you're running a Windows guest and not configuring
> >>> the VM for a vCPU type where Windows supports MSI. For more
> >>> assistance, please provide basic information, like the QEMU command
> >>> line or VM XML, also the PCI information from the host (sudo lspci
> >>> -vvv), and of course any error codes in the guest or an actual
> >>> description of how the device doesn't work in the guest. Thanks,
> >>>
> >>> Alex
> >>>
> >>>
> >>> > 2017-03-08 10:55 GMT+08:00 rhett rhett <rhett.kernel at gmail.com>:
> >>> >
> >>> > > i have a question about vfio , here is my description.
> >>> > >
> >>> > > i have 8 gpus in my server machine , but they are all behind a pcie
> >>> > > bridge. when i make a vfio passthrough , i can't use the gpus in my
> >>> guest
> >>> > > os.
> >>> > > dmesg shows the following message
> >>> > >
> >>> > > [ 662.208072] vfio-pci 0000:87:00.0: irq 140 for MSI/MSI-X
> >>> > > [ 725.761623] vfio-pci 0000:04:00.0: irq 140 for MSI/MSI-X
> >>> > >
> >>> > > i started two vm , one use 87 and another use 04, dmesg shows that
> >>> they
> >>> > > share the same irq 140 . is this normal ?
> >>> > >
> >>> > > i also saw the iommu groups, each gpu stays in a separate group, and
> >>> with
> >>> > > no other device in group. so this means ACS works correctly ?
> >>> > >
> >>> > > hope to get your helps !
> >>> > >
> >>>
> >>>
> >>
> >
More information about the vfio-users
mailing list