[libvirt] [PATCHv2 11/11] qemu_driver: BaselineHypervisorCPU supports S390 using QEMU/QMP

Collin Walling walling at linux.ibm.com
Mon Jul 23 20:16:55 UTC 2018


On 07/21/2018 12:05 AM, Chris Venteicher wrote:
> Quoting David Hildenbrand (2018-07-18 02:26:24)
>> On 18.07.2018 00:39, Collin Walling wrote:
>>> On 07/17/2018 05:01 PM, David Hildenbrand wrote:
>>>> On 13.07.2018 18:00, Jiri Denemark wrote:
>>>>> On Mon, Jul 09, 2018 at 22:56:55 -0500, Chris Venteicher wrote:
>>>>>> Transient S390 configurations require using QEMU to compute CPU Model
>>>>>> Baseline and to do CPU Feature Expansion.
>>>>>>
>>>>>> Start and use a single QEMU instance to do both the baseline and
>>>>>> expansion transactions required by BaselineHypervisorCPU.
>>>>>>
>>>>>> CPU Feature Expansion uses true / false to indicate if property is/isn't
>>>>>> included in model. Baseline only returns property list where all
>>>>>> enumerated properties are included.
>>>>>
>>>>> So are you saying on s390 there's no chance there would be a CPU model
>>>>> with some feature which is included in the CPU model disabled for some
>>>>> reason? Sounds too good to be true :-) (This is the question I referred
>>>>> to in one of my replies to the other patches.)
>>>>
>>>> Giving some background information: When we expand/baseline CPU models,
>>>> we always expand them to the "-base" variants of our CPU models, which
>>>> contain some set of features we expect to be around in all sane
>>>> configurations ("minimal feature set").
>>>>
>>>> It is very unlikely that we ever have something like
>>>> "z14-base,featx=off", but it could happen
>>>>  - When using an emulator (TCG)
>>>>  - When running nested and the guest hypervisor is started with such a
>>>>    strange CPU model
>>>>  - When something in the HW is very wrong or eventually removed in the
>>>>    future (unlikely but possible)
>>>>
>>>> On some very weird inputs to a baseline request, such a strange model
>>>> can also be the result. But it is very unusual.
>>>>
>>>> I assume something like "baseline z14-base,featx=off with z14-base" will
>>>> result in "z14-base,featx=off", too.
>>>>
>>>>
>>>
>>> That's correct. A CPU model with a feature disabled that is baseline with a CPU 
>>> model with that same feature enabled will omit that feature in the QMP response.
>>>
>>> e.g. if z14-base has features x, y, z then
>>>
>>> "baseline z14-base,featx=off with z14-base" will result in "z14-base,featy=on,featz=on"
> 
> I am runing tests on both S390 and X86 (hacked QEMU to enable baseline).
> 
> I don't see a "false" property in the baseline response in any of the logs.

Right... baseline should not be returning any properties paired with false. It
constructs a third CPU model with properties that can run on both CPUs.

> 
> I did try to slip a "zpci":false into the query-cpu-model-baseline but I still 
> don't get a false in the response.
> 

Sending a property paired with "false" in the JSON object is telling QEMU "I want 
to turn off this feature." The feature will then be omitted from the QMP response.

> Here is the request/response for reference.
> 
> {"execute":"query-cpu-model-baseline",
>  "arguments":{"modela":{"name":"z14"},
>               "modelb":{"name":"z13","props":{"msa5":true,"exrl":true,"zpci":false}}},
>  "id":"libvirt-2"} 
> 
> {"return": {"model": {"name": "z13-base","props": {"aen": true, "aefsi": true, 
> "msa5": true, "msa4": true, "msa3": true, "msa2": true, "msa1": true, "sthyi": 
> true, "edat": true, "ri": true, "edat2": true, "vx": true, "ipter": true, 
> "esop": true, "cte": true, "sea_esop2": true, "te": true, "cmm": true}}}, "id": 
> "libvirt-2"}
> 
>>> Usually we try to not chose a model with stripped off base features ("we
>> try to produce a model that looks sane"), but instead fallback to some
>> very ancient CPU model. E.g.
>>
>> { "execute": "query-cpu-model-baseline", "arguments" : { "modela": {
>> "name": "z14-base", "props": {"msa" : false}}, "modelb": { "name": "z14"}} }
>>
>> -> {"return": {"model": {"name": "z800-base", "props": {"etf2": true,
>> "ldisp": true}}}}
>>
>> We might want to change that behavior in the future however (or maybe it
>> already is like this for some corner cases) - assume some base feature
>> gets dropped by HW in a new CPU generation. We don't always want to
>> fallback to a z900 or so when baselining. So one should assume that we
>> can have disabled features here.
>>
>> Especially as there is a BUG in QEMU I'll have to fix:
>>
>> { "execute": "query-cpu-model-baseline", "arguments" : { "modela": {
>> "name": "z14-base", "props": {"esan3" : false}}, "modelb": { "name":
>> "z14"}} }
>>
>> -> Segmentation fault
>>
>> This would have to produce a model with esan3 disabled (very very
>> unlikely to ever happen in real life :) )
>>
>> The result should be something like {"model": {"name": "z900-base",
>> "props": {"esan3": false}}} or an error that they cannot be baselined.
>>
> 
> Seems like were saying I should be filtering out or otherwise property excluding
> any false properties that are returned. Please correct if I have this wrong.> 
> I currently do filtering / exclusion on the result of expansion but seems like I
> should be doing filtering on baseline output too if we don't do the expansion 
> just in case baseline would return a false property for some reason.
> 
Filtering is not necessary. All properties in baseline or a static expansion response will 
either be paired with "true" or a feature will be absent. The only occurrence of a property 
paired with "false" is during a "full" expansion, where QEMU will return ALL features supported 
by the hypervisor and weather or not that feature is supported on the specified CPU model.

Looking back at your code, you'll need to make sure the baseline JSON function captures the
props field. If the libvirt command was used with --features flag, then the captured features
should be output. Otherwise just output the model name. (Additionally, the --migratable flag 
won't make a difference for s390).

>>
>>>
>>> Since baseline will just report a base cpu and its minimal feature set... I wonder if it
>>> makes sense for libvirt to just report the features resulting from the baseline command 
>>> instead of later calling expansion?
>>
>> Yes it does and the docs say:
>>
>> "Baseline two CPU models, creating a compatible third model. The created
>> model will always be a static, migration-safe CPU model (see "static"
>> CPU model expansion for details)"
>>
> 
> Here is my understanding: 
> 
> 1) query-cpu-model-baseline will only return migratable properties.
> 

Yes.

> 2) query-cpu-model-expansion on S390 only returns migratable properties.
> 

Yup.

> In fact, here is what happens if you ask S390 for non-migratable features too:
> 
> {"execute":"query-cpu-model-expansion",
>  "arguments":{"type":"static","model":{"name":"host","props":{"migratable":false}}},"id":"libvirt-45"} 
> 
> Line [{"id": "libvirt-45", "error": {"class": "GenericError", "desc": "Property '.migratable' not found"}}]
> 
> 3) There seem to be two usecases for expansion
> 
> A/ Enumrate migratable properties in the baseline model
> B/ Enumerate both migratable and nonmigratable props in baseline model.
>     
> B/ Doesn't work on S390 but A/ does.  Here is what A/ looks like:
> 
> {"execute":"query-cpu-model-baseline",
>  "arguments":{"modela":{"name":"z13-base"},"modelb":{"name":"z13-base"}},
>  "id":"libvirt-4"} 
> 
> Line [{"return": {"model": {"name": "z13-base"}}, "id": "libvirt-4"}]

This looks right. A "-base" model is a CPU model with a "minimum feature set,"
so it makes sense to not see any features listed here. 

> 
> ***
> 
> {"execute":"query-cpu-model-expansion",
>  "arguments":{"type":"full","model":{"name":"z13-base"}},"id":"libvirt-5"} 

AFAIU, a "full" expansion will show *all* features the hypervisor knows about and weather 
or not that feature can be enabled ("true") on the specified CPU model or not ("false"). All
features listed have migration support.

> 
> Line [{"return": {"model": {"name": "z13-base", "props": {"pfmfi": false, 
> "exrl": true, "stfle45": true, "cmma": false, "dateh2": true, "aen": false, 
> "gen13ptff": true, "dateh": true, "cmmnt": false, "iacc2": true, "parseh": true, 
> "csst": true, "idter": false, "idtes": true, "msa": true, "aefsi": false, 
> "hpma2": false, "csst2": true, "csske": true, "mepoch": false, "msa8": false, 
> "msa7": false, "msa6": false, "msa5": false, "msa4": false, "msa3": false, 
> "msa2": false, "msa1": false, "sthyi": false, "stckf": true, "stfle": true, 
> "edat": false, "etf3": true, "etf2": true, "hfpm": true, "ri": false, "edat2":
> false, "hfpue": true, "dfp": true, "mvcos": true, "sprogp": true, "sigpif": 
> false, "ldisphp": true, "vx": false, "ipter": false, "emon": true, "cei": false, 
> "cmpsceh": true, "ginste": true, "dfppc": true, "dfpzc": true, "dfphp": true, 
> "stfle49": true, "mepochptff": false, "opc": false, "asnlxr": true, "gpereh": 
> false, "minste2": false, "vxeh": false, "vxpd": false, "esop": false, "ectg": 
> true, "ib": false, "siif": false, "tsi": false, "tpei": false, "esan3": true, 
> "fpe": true, "ibs": false, "zarch": true, "stfle53": true, "sief2": false, 
> "eimm": true, "iep": false, "irbm": false, "srs": true, "kss": false, "cte": 
> false, "ais": false, "fpseh": true, "ltlbc": true, "ldisp": true, "bpb": false, 
> "64bscao": false, "ctop": false, "gs": false, "sema": false, "etf3eh": true, 
> "etf2eh": true, "eec": false, "ppa15": false, "zpci": false, "nonqks": true, 
> "sea_esop2": false, "pfpo": true, "te": false, "cmm": false, "tods": true, 
> "plo": true, "gsls": false, "skey": false}}}, "id": "libvirt-5"}]
> 
>>>
>>>>>
>>>>> Most of the code you added in this patch is indented by three spaces
>>>>> while we use four spaces in libvirt.
>>>>>
>>>>>> ---
>>>>>>  src/qemu/qemu_driver.c | 74 +++++++++++++++++++++++++++++++++++++-----
>>>>>>  1 file changed, 65 insertions(+), 9 deletions(-)
>>>>>>
>>>>>> diff --git a/src/qemu/qemu_driver.c b/src/qemu/qemu_driver.c
>>>>>> index 9a35e04a85..6c6107f077 100644
>>>>>> --- a/src/qemu/qemu_driver.c
>>>>>> +++ b/src/qemu/qemu_driver.c
>>>>>> @@ -13400,10 +13400,13 @@ qemuConnectBaselineHypervisorCPU(virConnectPtr conn,
>>>>>>      virArch arch;
>>>>>>      virDomainVirtType virttype;
>>>>>>      virDomainCapsCPUModelsPtr cpuModels;
>>>>>> -    bool migratable;
>>>>>> +    bool migratable_only;
>>>>>
>>>>> Why? The bool says the user specified
>>>>> VIR_CONNECT_BASELINE_CPU_MIGRATABLE which means they want a migratable
>>>>> CPU back. What does the "_only" part mean? This API does not return
>>>>> several CPUs, it only returns a single one and it's either migratable or
>>>>> not.
>>>>>
>>>>>>      virCPUDefPtr cpu = NULL;
>>>>>>      char *cpustr = NULL;
>>>>>>      char **features = NULL;
>>>>>> +    virQEMUCapsInitQMPCommandPtr cmd = NULL;
>>>>>> +    bool forceTCG = false;
>>>>>> +    qemuMonitorCPUModelInfoPtr modelInfo = NULL;
>>>>>>  
>>>>>>      virCheckFlags(VIR_CONNECT_BASELINE_CPU_EXPAND_FEATURES |
>>>>>>                    VIR_CONNECT_BASELINE_CPU_MIGRATABLE, NULL);
>>>>>> @@ -13411,8 +13414,6 @@ qemuConnectBaselineHypervisorCPU(virConnectPtr conn,
>>>>>>      if (virConnectBaselineHypervisorCPUEnsureACL(conn) < 0)
>>>>>>          goto cleanup;
>>>>>>  
>>>>>> -    migratable = !!(flags & VIR_CONNECT_BASELINE_CPU_MIGRATABLE);
>>>>>> -
>>>>>>      if (!(cpus = virCPUDefListParse(xmlCPUs, ncpus, VIR_CPU_TYPE_AUTO)))
>>>>>>          goto cleanup;
>>>>>>  
>>>>>> @@ -13425,6 +13426,19 @@ qemuConnectBaselineHypervisorCPU(virConnectPtr conn,
>>>>>>      if (!qemuCaps)
>>>>>>          goto cleanup;
>>>>>>  
>>>>>> +    /* QEMU can enumerate non-migratable cpu model features for some archs like x86
>>>>>> +     * migratable_only == true:  ask for and include only migratable features
>>>>>> +     * migratable_only == false: ask for and include all features
>>>>>> +     */
>>>>>> +    migratable_only = !!(flags & VIR_CONNECT_BASELINE_CPU_MIGRATABLE);
>>>>>> +
>>>>>> +    if (ARCH_IS_S390(arch)) {
>>>>>> +       /* QEMU for S390 arch only enumerates migratable features
>>>>>> +        * No reason to explicitly ask QEMU for or include non-migratable features
>>>>>> +        */
>>>>>> +       migratable_only = true;
>>>>>> +    }
>>>>>> +
>>>>>
>>>>> And what if they come up with some features which are not migratable in
>>>>> the future? I don't think there's any reason for this API to mess with
>>>>> the value. The code should just provide the same CPU in both cases for
>>>>> s390.
>>>>
>>>> s390x usually only provides features if they are migratable. Could it
>>>> happen it the future that we have something that cannot be migrated?
>>>> Possible but very very unlikely. We cared about migration (even for
>>>> nested support) right from the beginning. As of now, we have no features
>>>> that are supported by QEMU that cannot be migrated - in contrast to e.g.
>>>> x86 (e.g. VMX/SVM). Every new feature has to be whitelisted in QEMU -
>>>> and we only allow to do so if migration is in place for it.
>>>>
> A problem here is if I set migratable_only (or migratable) to false then
> property "migratable":false is added to query-cpu-model-expansion.
> 
> If X86 && model "name":"host" (limted of names) you get a successfull result.
> 
> For all other archs and specific model names like (z13, IvyBridge) you get this:
> Line [{"id": "libvirt-45","error": {"class": "GenericError", "desc": "Property '.migratable' not found"}}]
> 
> So unless something changes on the QEMU side you get nothing if you try to get
> query-cpu-model-expansion to enumerate non-migratable features outside the X86 +
> name: host type of usecases.
> >>>
>>> You're a gold mine on this kind of information.  I may have to pester you about some 
>>> CPU model related stuff in the future :)
>>
>> Sure, just CC or ask me and I'm happy to help.
>>
>>
>> -- 
>>
>> Thanks,
>>
>> David / dhildenb
> 


-- 
Respectfully,
- Collin Walling




More information about the libvir-list mailing list