CloudServerNAppCheckpointRestart

Differences between revisions 3 and 4
Revision 3 as of 2010-11-05 14:33:38
Size: 2247
Editor: cpe-66-68-83-102
Comment:
Revision 4 as of 2010-11-09 17:01:35
Size: 3227
Editor: 64
Comment:
Deletions are marked like this. Additions are marked like this.
Line 5: Line 5:
 * '''Contributors''':
 * '''Packages affected''':
 * '''Contributors''': serge-hallyn
 * '''Packages affected''': lxc, linux-kernel
Line 9: Line 9:

Application checkpoint/restart allows us to do live migration of
LXC containers, greatly increasing their usability as an alternative
cluster node type.
Line 16: Line 20:
Checkpoint, restart, and live migration of applications and containers
is now supported.
Line 18: Line 25:
A great deal of support for application c/r is already implemented in
out-of-upstream patchsets, and available at linux-cr.org. The purpose
of this blueprint is simply to publish a PPA with the kernel and
userspace packages needed for users to trivially test c/r, be it on
their host, in a VM, or on EC2.
Line 19: Line 32:

Mike is a user with a running mysql server. He wants to temporarily
migrate the mysql instance to another machine while he upgrades the
server's operating system.

John wants to run a cluster using containers. There are two long running
jobs in LXC containers on the same physical machine. He migrates one
container to another machine to increase performance for both jobs.

Summary

Application checkpoint/restart allows us to do live migration of LXC containers, greatly increasing their usability as an alternative cluster node type.

Release Note

Checkpoint, restart, and live migration of applications and containers is now supported.

Rationale

A great deal of support for application c/r is already implemented in out-of-upstream patchsets, and available at linux-cr.org. The purpose of this blueprint is simply to publish a PPA with the kernel and userspace packages needed for users to trivially test c/r, be it on their host, in a VM, or on EC2.

User stories

Mike is a user with a running mysql server. He wants to temporarily migrate the mysql instance to another machine while he upgrades the server's operating system.

John wants to run a cluster using containers. There are two long running jobs in LXC containers on the same physical machine. He migrates one container to another machine to increase performance for both jobs.

Assumptions

Design

Implementation

Test/Demo Plan

Unresolved issues

BoF agenda and discussion

UDS Natty discussion

== Application checkpoint/restart ==

Application checkpoint/restart in linux (linux-cr.org) provides the ability to checkpoint, restart, and migrate application and system containers. This provides a very lightweight mechanism for load-balancing in the cloud.

Actions:
 * Create a ppa with the kernel and userspace packages needed to experiment with c/r
   * Create a project in lp
   * When ppa is up, Gustavo will blog about how to use it


CategorySpec

CloudServerNAppCheckpointRestart (last edited 2010-11-09 17:01:35 by 64)