[libvirt] [PATCH v3 06/20] backup: Document new XML for backups

Vladimir Sementsov-Ogievskiy vsementsov at virtuozzo.com
Wed Nov 7 14:49:01 UTC 2018


25.10.2018 22:20, Eric Blake wrote:
> Prepare for new checkpoint and backup APIs by describing the XML
> that will represent a checkpoint and backup.  The checkpoint XML
> is modeled heavily after virDomainSnapshotPtr, since both represent
> a point in time of the guest (however, a snapshot exists with the
> intent to roll back to that point, while a checkpoint exists to
> facilitate later incremental backups). Meanwhile, the backup XML
> has enough information to represent both push model (the hypervisor
> writes the backup file to a location of the user's choice) and the
> pull model (the hypervisor needs local temporary storage, and also
> creates an NBD server that the user can use to read the backup via
> a third-party client)..  But while a snapshot exists with the
> intent of rolling back to that state, a checkpoint instead makes it
> possible to create an incremental backup at a later time.

hm, the last sentence is a duplication of "(however," above

>
> Add testsuite coverage for some minimal uses of both XML.
>
> Ultimately, I'd love for push model backups to target a network
> driver rather than just a local file or block device; but doing
> that got hairy

got hairy = is complicated?

>   (while <domain> uses <source> as the description
> of a host or network resource, I picked <target> as the description
> of a push model backup target [defaults to qcow2 but can also be
> raw or any other format]

As I remember, it defaults to source disk format.

> , and <scratch> as the description
> of a pull model backup scratch space [must be qcow2]). The ideal
> refactoring would be a way to parameterize RNG to accept
> <disk type='FOO'>...</disk> so that the name of the subelement
> can be <source> for domain, or <target> or <scratch> as needed for
> backups. Future patches may improve this area of code.
>
> Signed-off-by: Eric Blake <eblake at redhat.com>
>
> ---
> v2: apply (some) wording changes from review
> ---
>   docs/docs.html.in                            |   3 +-
>   docs/domainstatecapture.html.in              |   4 +-
>   docs/format.html.in                          |   1 +
>   docs/formatcheckpoint.html.in                | 291 +++++++++++++++++++
>   docs/index.html.in                           |   3 +-
>   docs/schemas/domainbackup.rng                | 185 ++++++++++++
>   docs/schemas/domaincheckpoint.rng            |  94 ++++++
>   libvirt.spec.in                              |   2 +
>   mingw-libvirt.spec.in                        |   4 +
>   tests/Makefile.am                            |   6 +-
>   tests/domainbackupxml2xmlin/backup-pull.xml  |   9 +
>   tests/domainbackupxml2xmlin/backup-push.xml  |   9 +
>   tests/domainbackupxml2xmlin/empty.xml        |   1 +
>   tests/domainbackupxml2xmlout/backup-pull.xml |   9 +
>   tests/domainbackupxml2xmlout/backup-push.xml |   9 +
>   tests/domainbackupxml2xmlout/empty.xml       |   7 +
>   tests/domaincheckpointxml2xmlin/empty.xml    |   1 +
>   tests/domaincheckpointxml2xmlin/sample.xml   |   7 +
>   tests/domaincheckpointxml2xmlout/empty.xml   |  10 +
>   tests/domaincheckpointxml2xmlout/sample.xml  |  16 +
>   tests/virschematest.c                        |   4 +
>   21 files changed, 670 insertions(+), 5 deletions(-)
>   create mode 100644 docs/formatcheckpoint.html.in
>   create mode 100644 docs/schemas/domainbackup.rng
>   create mode 100644 docs/schemas/domaincheckpoint.rng
>   create mode 100644 tests/domainbackupxml2xmlin/backup-pull.xml
>   create mode 100644 tests/domainbackupxml2xmlin/backup-push.xml
>   create mode 100644 tests/domainbackupxml2xmlin/empty.xml
>   create mode 100644 tests/domainbackupxml2xmlout/backup-pull.xml
>   create mode 100644 tests/domainbackupxml2xmlout/backup-push.xml
>   create mode 100644 tests/domainbackupxml2xmlout/empty.xml
>   create mode 100644 tests/domaincheckpointxml2xmlin/empty.xml
>   create mode 100644 tests/domaincheckpointxml2xmlin/sample.xml
>   create mode 100644 tests/domaincheckpointxml2xmlout/empty.xml
>   create mode 100644 tests/domaincheckpointxml2xmlout/sample.xml
>
> diff --git a/docs/docs.html.in b/docs/docs.html.in
> index 4c46b74980..4914e7dbed 100644
> --- a/docs/docs.html.in
> +++ b/docs/docs.html.in
> @@ -79,7 +79,8 @@
>             <a href="formatdomaincaps.html">domain capabilities</a>,
>             <a href="formatnode.html">node devices</a>,
>             <a href="formatsecret.html">secrets</a>,
> -          <a href="formatsnapshot.html">snapshots</a></dd>
> +          <a href="formatsnapshot.html">snapshots</a>,
> +          <a href="formatcheckpoint.html">backups and checkpoints</a></dd>
>
>           <dt><a href="uri.html">URI format</a></dt>
>           <dd>The URI formats used for connecting to libvirt</dd>
> diff --git a/docs/domainstatecapture.html.in b/docs/domainstatecapture.html.in
> index f7f2fe0b98..9b890b4c0c 100644
> --- a/docs/domainstatecapture.html.in
> +++ b/docs/domainstatecapture.html.in
> @@ -259,9 +259,9 @@
>           a checkpoint as a side-effect of starting a new incremental
>           backup with <code>virDomainBackupBegin()</code>, since a
>           second incremental backup is most useful when using the
> -        checkpoint created during the first.  <!--See also
> +        checkpoint created during the first.  See also
>           the <a href="formatcheckpoint.html">XML details</a> used with
> -        this command.--></dd>
> +        this command.</dd>
>
>         <dt>virDomainBackupBegin(), virDomainBackupEnd()</dt>
>         <dd>This API wraps approaches for capturing the state of disks
> diff --git a/docs/format.html.in b/docs/format.html.in
> index 22b23e3fc7..8c4e15e079 100644
> --- a/docs/format.html.in
> +++ b/docs/format.html.in
> @@ -24,6 +24,7 @@
>         <li><a href="formatnode.html">Node devices</a></li>
>         <li><a href="formatsecret.html">Secrets</a></li>
>         <li><a href="formatsnapshot.html">Snapshots</a></li>
> +      <li><a href="formatcheckpoint.html">Backups and checkpoints</a></li>
>       </ul>
>
>       <h2>Command line validation</h2>
> diff --git a/docs/formatcheckpoint.html.in b/docs/formatcheckpoint.html.in
> new file mode 100644
> index 0000000000..6d66bd0511
> --- /dev/null
> +++ b/docs/formatcheckpoint.html.in
> @@ -0,0 +1,291 @@
> +<?xml version="1.0" encoding="UTF-8"?>
> +<!DOCTYPE html>
> +<html xmlns="http://www.w3.org/1999/xhtml">
> +  <body>
> +    <h1>Checkpoint and Backup XML format</h1>
> +
> +    <ul id="toc"></ul>
> +
> +    <h2><a id="CheckpointAttributes">Checkpoint XML</a></h2>
> +
> +    <p>
> +      One method of capturing domain disk backups is via the use of
> +      incremental backups.  Right now, incremental backups are only
> +      supported for the qemu hypervisor when using qcow2 disks at the
> +      active layer; if other disk formats are in use, capturing disk
> +      backups requires different libvirt APIs

and the reason for this limitation, is that we can store persistent 
dirty bitmaps only in qcow2..

> +      (see <a href="domainstatecapture.html">domain state capture</a>
> +      for a comparison between APIs).
> +    </p>
> +    <p>
> +      Libvirt is able to facilitate incremental backups by tracking
> +      disk checkpoints, which are points in time against which it is
> +      easy to compute which portion of the disk has changed.  Given a
> +      full backup (a backup created from the creation of the disk to a
> +      given point in time), coupled with the creation of a disk
> +      checkpoint at that time, and an incremental backup (a backup
> +      created from just the dirty portion of the disk between the
> +      first checkpoint and the second backup operation), it is
> +      possible to do an offline reconstruction of the state of the
> +      disk at the time of the second backup without having to copy as
> +      much data as a second full backup would require.  Most disk
> +      checkpoints are created in concert with a backup
> +      via <code>virDomainBackupBegin()</code>; however, libvirt also
> +      exposes enough support to create disk checkpoints independently
> +      from a backup operation
> +      via <code>virDomainCheckpointCreateXML()</code>.
> +    </p>
> +    <p>
> +      Attributes of libvirt checkpoints are stored as child elements
> +      of the <code>domaincheckpoint</code> element.  At checkpoint
> +      creation time, normally only
> +      the <code>name</code>, <code>description</code>,
> +      and <code>disks</code> elements are settable. The rest of the
> +      fields are ignored on creation and will be filled in by libvirt
> +      in for informational purposes
> +      by <code>virDomainCheckpointGetXMLDesc()</code>.  However, when
> +      redefining a checkpoint, with
> +      the <code>VIR_DOMAIN_CHECKPOINT_CREATE_REDEFINE</code> flag
> +      of <code>virDomainCheckpointCreateXML()</code>, all of the XML
> +      fields described here are relevant.
> +    </p>
> +    <p>
> +      Checkpoints are maintained in a hierarchy.  A domain can have a
> +      current checkpoint, which is the most recent checkpoint compared to
> +      the current state of the domain (although a domain might have
> +      checkpoints without a current checkpoint, if checkpoints have been
> +      deleted in the meantime).

hmm.. why? if we remove current checkpoint, should not we just set it's 
parent as current?

>    Creating or reverting to a checkpoint
> +      sets that checkpoint as current, and the prior current checkpoint is
> +      the parent of the new checkpoint.  Branches in the hierarchy can
> +      be formed by reverting to a checkpoint with a child, then creating
> +      another checkpoint.
> +    </p>
> +    <p>
> +      The top-level <code>domaincheckpoint</code> element may contain
> +      the following elements:
> +    </p>
> +    <dl>
> +      <dt><code>name</code></dt>
> +      <dd>The name for this checkpoint.  If the name is specified when
> +        initially creating the checkpoint, then the checkpoint will have
> +        that particular name.  If the name is omitted when initially
> +        creating the checkpoint, then libvirt will make up a name for
> +        the checkpoint, based on the time when it was created.
> +      </dd>
> +      <dt><code>description</code></dt>
> +      <dd>A human-readable description of the checkpoint.  If the
> +        description is omitted when initially creating the checkpoint,
> +        then this field will be empty.
> +      </dd>
> +      <dt><code>disks</code></dt>
> +      <dd>On input, this is an optional listing of specific
> +        instructions for disk checkpoints; it is needed when making a
> +        checkpoint on only a subset of the disks associated with a
> +        domain (in particular, since qemu checkpoints require qcow2
> +        disks, this element may be needed on input for excluding guest
> +        disks that are not in qcow2 format); if the entire element was
> +        omitted on input, then all disks participate in the
> +        checkpoint, but if individual disks were omitted from the
> +        element, they will not be part of the checkpoint.  On output,
> +        this is fully populated to show the state of each disk in the
> +        checkpoint.  This element has a list of <code>disk</code>
> +        sub-elements, describing anywhere from one to all of the disks
> +        associated with the domain.
> +        <dl>
> +          <dt><code>disk</code></dt>
> +          <dd>This sub-element describes the checkpoint properties of
> +            a specific disk.  The attribute <code>name</code> is
> +            mandatory, and must match either the <code><target
> +            dev='name'/></code> or an unambiguous <code><source
> +            file='name'/></code> of one of
> +            the <a href="formatdomain.html#elementsDisks">disk
> +            devices</a> specified for the domain at the time of the
> +            checkpoint.  The attribute <code>checkpoint</code> is
> +            optional on input; possible values are <code>no</code>
> +            when the disk does not participate in this checkpoint;
> +            or <code>bitmap</code> if the disk will track all changes
> +            since the creation of this checkpoint via a bitmap, in
> +            which case another attribute <code>bitmap</code> will be
> +            the name of the tracking bitmap (defaulting to the
> +            checkpoint name).  On output, an additional
> +            attribute <code>size</code> may be present if
> +            the <code>VIR_DOMAIN_CHECKPOINT_XML_SIZE</code> flag was
> +            used to perform a dynamic query of the estimated size in
> +            bytes of the changes made since the checkpoint was created.

from creation to what? to the next check-point? or to current 
checkpoint? But what if current is not a descendant of this checkpoint, 
and/or this checkpoint has several children?

> +          </dd>
> +        </dl>
> +      </dd>
> +      <dt><code>creationTime</code></dt>
> +      <dd>The time this checkpoint was created.  The time is specified
> +        in seconds since the Epoch, UTC (i.e. Unix time).  Readonly.
> +      </dd>
> +      <dt><code>parent</code></dt>
> +      <dd>The parent of this checkpoint.  If present, this element
> +        contains exactly one child element, name.  This specifies the
> +        name of the parent checkpoint of this one, and is used to
> +        represent trees of checkpoints.  Readonly.
> +      </dd>
> +      <dt><code>domain</code></dt>
> +      <dd>The inactive <a href="formatdomain.html">domain
> +        configuration</a> at the time the checkpoint was created.
> +        Readonly.
> +      </dd>
> +    </dl>
> +
> +    <h2><a id="BackupAttributes">Backup XML</a></h2>
> +
> +    <p>
> +      Creating a backup, whether full or incremental, is done
> +      via <code>virDomainBackupBegin()</code>, which takes an XML
> +      description of the actions to perform.  There are two general
> +      modes for backups: a push mode (where the hypervisor writes out
> +      the data to the destination file, which may be local or remote),
> +      and a pull mode (where the hypervisor creates an NBD server that
> +      a third-party client can then read as needed, and which requires
> +      the use of temporary storage, typically local, until the backup
> +      is complete).
> +    </p>
> +    <p>
> +      The instructions for beginning a backup job are provided as
> +      attributes and elements of the
> +      top-level <code>domainbackup</code> element.  This element
> +      includes an optional attribute <code>mode</code> which can be
> +      either "push" or "pull" (default push).  Where elements are
> +      optional on creation, <code>virDomainBackupGetXMLDesc()</code>
> +      can be used to see the actual values selected (for example,
> +      learning which port the NBD server is using in the pull model,
> +      or what file names libvirt generated when none were supplied).
> +      The following child elements are supported:
> +    </p>
> +    <dl>
> +      <dt><code>incremental</code></dt>
> +      <dd>Optional. If this element is present, it must name an
> +        existing checkpoint of the domain, which will be used to make
> +        this backup an incremental one (in the push model, only
> +        changes since the checkpoint are written to the destination;
> +        in the pull model, the NBD server uses the
> +        NBD_OPT_SET_META_CONTEXT extension to advertise to the client
> +        which portions of the export contain changes since the
> +        checkpoint).  If omitted, a full backup is performed.

I'm sure in future we'll want (at least as an option) make only changed 
data available for third software for incremental pull backup, returning 
EINVAL for not-dirty regions, to avoid extra copy-before-write 
operations, supporting this pull backup.

> +      </dd>
> +      <dt><code>server</code></dt>
> +      <dd>Present only for a pull mode backup.  Contains the same
> +        attributes as the <code>protocol</code> element of a disk
> +        attached via NBD in the domain (such as transport, socket,
> +        name, port, or tls), necessary to set up an NBD server that
> +        exposes the content of each disk at the time the backup
> +        started.
> +      </dd>
> +      <dt><code>disks</code></dt>
> +      <dd>This is an optional listing of instructions for disks
> +        participating in the backup (if omitted, all disks
> +        participate, and libvirt attempts to generate filenames by
> +        appending the current timestamp as a suffix). When provided on
> +        input, disks omitted from the list do not participate in the
> +        backup.  On output, the list is present but contains only the
> +        disks participating in the backup job.  This element has a
> +        list of <code>disk</code> sub-elements, describing anywhere
> +        from one to all of the disks associated with the domain.
> +        <dl>
> +          <dt><code>disk</code></dt>
> +          <dd>This sub-element describes the backup properties of
> +            a specific disk.  The attribute <code>name</code> is
> +            mandatory, and must match either the <code><target
> +            dev='name'/></code> or an unambiguous <code><source
> +            file='name'/></code> of one of
> +            the <a href="formatdomain.html#elementsDisks">disk
> +            devices</a> specified for the domain at the time of the
> +            checkpoint.  The optional attribute <code>type</code> can
> +            be <code>file</code>, <code>block</code>,
> +            or <code>network</code>, similar to a disk declaration for
> +            a domain, controls what additional sub-elements are needed
> +            to describe the destination (such as <code>protocol</code>
> +            for a network destination).  In push mode backups, the
> +            primary sub-element is <code>target</code>; in pull mode,
> +            the primary sub-element is <code>scratch</code>; but
> +            either way, the primary sub-element describes the file
> +            name to be used during the backup operation, similar to
> +            the <code>source</code> sub-element of a domain disk. In
> +            push mode, an optional sub-element <code>driver</code> can
> +            also be used, with an attribute <code>type</code> to
> +            specify a destination format different from
> +            qcow2. Additionally, if a push backup is not
> +            incremental, <code>target</code> may contain an optional
> +            attribute <code>shallow="on"</code> so that the
> +            destination file copies only the top-most source file in a
> +            backing chain, rather than collapsing the entire chain
> +            into the copy.
> +          </dd>
> +        </dl>
> +      </dd>
> +    </dl>
> +
> +    <h2><a id="example">Examples</a></h2>
> +
> +    <p>Using this XML to create a checkpoint of just vda on a qemu
> +      domain with two disks and a prior checkpoint:</p>
> +    <pre>
> +<domaincheckpoint>
> +  <description>Completion of updates after OS install</description>
> +  <disks>
> +    <disk name='vda' checkpoint='bitmap'/>
> +    <disk name='vdb' checkpoint='no'/>
> +  </disks>
> +</domaincheckpoint></pre>
> +
> +    <p>will result in XML similar to this from
> +      <code>virDomainCheckpointGetXMLDesc()</code>:</p>
> +    <pre>
> +<domaincheckpoint>
> +  <name>1525889631</name>
> +  <description>Completion of updates after OS install</description>
> +  <creationTime>1525889631</creationTime>
> +  <parent>
> +    <name>1525111885</name>
> +  </parent>
> +  <disks>
> +    <disk name='vda' checkpoint='bitmap' bitmap='1525889631'/>
> +    <disk name='vdb' checkpoint='no'/>
> +  </disks>
> +  <domain type='qemu'>
> +    <name>fedora</name>
> +    <uuid>93a5c045-6457-2c09-e56c-927cdf34e178</uuid>
> +    <memory>1048576</memory>
> +    ...
> +    <devices>
> +      <disk type='file' device='disk'>
> +        <driver name='qemu' type='qcow2'/>
> +        <source file='/path/to/file1'/>
> +        <target dev='vda' bus='virtio'/>
> +      </disk>
> +      <disk type='file' device='disk' snapshot='external'>
> +        <driver name='qemu' type='raw'/>
> +        <source file='/path/to/file2'/>
> +        <target dev='vdb' bus='virtio'/>
> +      </disk>
> +      ...
> +    </devices>
> +  </domain>
> +</domaincheckpoint></pre>
> +
> +    <p>With that checkpoint created, the qcow2 image is now tracking
> +      all changes that occur in the image since the checkpoint via
> +      the persistent bitmap named <code>1525889631</code>.  Now, we
> +      can make a subsequent call
> +      to <code>virDomainBackupBegin()</code> to perform an incremental
> +      backup of just this data, using the following XML to start a
> +      pull model NBD export of the vda disk:
> +    </p>
> +    <pre>
> +<domainbackup mode="pull">
> +  <incremental>1525889631</incremental>
> +  <server transport="unix" socket="/path/to/server"/>
> +  <disks/>
> +    <disk name='vda' type='file'>
> +      <scratch file='/path/to/file1.scratch'/>
> +    </disk>
> +  </disks/>
> +</domainbackup>
> +    </pre>
> +  </body>
> +</html>
>

[...]

-- 
Best regards,
Vladimir





More information about the libvir-list mailing list