UbuntuOne

Revision 18 as of 2009-06-08 17:56:25

Clear message

Ubuntu One is a blanket name for what is destined to be a suite of online services from Canonical.

So far there is only one service, a Dropbox-like service for file exchange and mirroring.

Invitations

To get access to Ubuntu ONE(TM) go to https://ubuntuone.com and log in with your Launchpad account. This will generate an invitation request. Alternatively ask someone already with an account to share something with you.

The IRC channel at #ubuntuone on irc.freenode.net is a good place to ask questions.

Introduction

  • The log file is at ~/.cache/ubuntuone/log/syncdaemon.log

  • There is as yet no indicator of the remaining time/bytes.
  • The sync algorithm handles partial downloads of files and resumes from interruptions.
  • If a file changes the entire file is re-copied; deltas are planned but not yet implemented.
  • At the moment it will only sync files in the ~/Ubuntu One folder, but it could do more in future, perhaps synchronising /var/www from a server for example.
  • The plans are 2GB of free storage and a paid for 10GB option. More than 10GB is not currently available.
  • The data is stored on the Amazon S3 servers in both the UK and the USA.

Areas needing extensive testing

* Putting files in and out whilst synchronising * Interrupted synchronisations * Deletions * Copy the ~/Ubuntu One folder or some part of it from one computer to another via sftp or otherwise and let U1 pick up synchronisation * Conflicts ("simultaneous" modifications to the same file from different locations)

Reporting bugs

You may be requested to submit part of your log file. Please bear in mind this contains filenames when posting a public bug report or posting on IRC.

The command

  • tail -n 100 logfile | pastebinit

will put the last 100 lines of logfile on pastebin and give you a URL suitable for discussing on IRC.


Technical details

  • Details as viewed by an outsider. Blame sladen for E&OE and for the insistence on using cute names.

As of 2009-06-08, these are:

  • Storage: remote storage

Because the "Ubuntu One" branding is non-specific and because most of the components do not have succinct official names, we'll use shorter (unique) names to succinctly identify each of the components and layers. You can find a table mapping these simple names to download packages and URLs at the bottom. (See the "storagefs" section for just how long-winded and inconvenient these actual names can get).

"Ubunet" was the original codename for the online services initiative (the Canonical department itself is called "Online Services"), and is the name of the Launchpad project encompassing the backend components (there is also a "control tower" group which is used when landing branches).

"Storage"

This provides 2GB-10GB of remotely accessible disk space stored on Amazon S3. Rather than using an existing protocol (eg. FTP, SFTP, NFS, Webdav) access to this remote disk space is via a not-invented-here communication protocol called "u1storage".

The system is intented to be for mirroring of content between computers, opportunistically synchronizing those mirrors when the user is online. There is a delay because a file must be completely uploaded before the changes can start to be shared. The heavy duty backend storage is ultimately out-sourced to Amazon S3 (Amazon Simple Storage Service).

At a high-level, Storage has been compared to Dropbox. In June 2009, Mandriva have produced something similar called mdvbackup, (branded as "Mandriva Click'n'Backup").

During early development, the codename for this facility was "Hammertime".

General concepts

  • There is a backing database (currently on S3) which stores file objects, directory objects, and the BLOB objects (files) themselves.

  • On the user's computer, a Share is a directory that is actively being mirrored.

  • When a file is modifid, the complete new version is uploaded and saved in a BLOB. The file object is updated to point at the new BLOB (directories are not themselves changed for a simple overwrite operation).

  • A Subtree is a particular directory object, representing itself and all of its descendants.

  • A Subtree can be shared with other users. This is called a View.

  • The View given to another user can be either read-only, or read-write.

  • On connection, or reconnection, the u1sync-agent is responsible for spidering the full directory tree and identifying any changes on its own.

  • While connected, the u1sync-agent will receive unsolicited notifications of changed objects. Notifications are an optimisation. They allow for lower latency in synchronisation and they avoid unnecessary polling.

  • u1ftp is an example of a simple one-shot client that ignores unsolicited notifications.

updown

This is a backend librarian service. It listens for connections via HTTP and accepts the upload, or returns the download requested. It does not have a user-visible interface.

It does not use the "u1storage" protocol for access, talking instead to the backend database and is therefore effectively a raw HTTP<->S3 gateway.

There is one implementation, which is proprietary. Little is known, but an instance of it runs on an EC2 server at updown.ubuntu.com.

webstorage

Also known as "web ui". This is a pretty HTTP/web-browser interface to view directory listings and files held in the user's storage service (and other Shares which they have access to). It is universal and cross-platform and allows viewing of files, plus uploading and downloading.

The user's own files are displayed under "My Files" and incoming Shares from other users show up under "Shared With Me". An advantage is that it allows direct "cherry picking" of individual files, without waiting for a (potentially lengthy) synchronisation process to complete.

Going via the browser appears to be "closest to bare metal" interface.

There is one implementation, which is proprietary. Little is known, but an instance of it runs at https://www.ubuntuone.com/files/ (subscribers only).

u1storage

This protocol is built on top of Google Protocol Buffers (a sort of fast binary XMLish schema language). This abstraction is to allow extension of the u1storage protocol with breaking compatibility.

The intention of the protocol is to enabling synchronisation by mirroring of content. The protocol itself uses SHA-1 hashes as identifiers for directory objects and their contents (files). Requests are always initiated from the client end. The first versions does not support any kind of delta-optimisation and each object (file) must be sent in its entirety.

Currently there is only one implementation, it written in Python and called python-u1storage (AGPLv3+copyright assignment required). The .proto descriptions (AGPLv3+assignment) can be used as starting points for other language implementations.

storagefsd

This is the fuzzy term for a monolithic(?) deamon running on the server fs-1.ubuntuone.com (174.129.241.144, currently an Amazon EC2 virtual machine instance). It accepts incoming SSL/TLS wrapped u1storage connections on port 443. Note that although port 443 (HTTPS) is used to allow passage through firewalls, the protocol has nothing to do with HTTP.

Storagefsd sends a client connected via the u1storage protocol unsolicited updates of any hash reference that have been updated, so that the other end can re-download the changed objects.

Currently there is one implementation, which is proprietary. This links to python-u1storage. That implemetnation is started by /etc/init.d/ubuntuoneapiserver which probably executes something like /usr/bin/env python .../canonical/ubuntuone/storage/server/server.py. <small>(Doesn't exactly trip off the tongue...).</small> It seems to save the partially uploaded files to a temporary directory; then presumably indexes/moves them off to Amazon S3 where the real storage is.

u1sync-agent

This is a per-user daemon run on each desktop computer and has no user interface. It runs in the background and exposes a DBus API interface. The agent does the actual work of deciding what to sychronise in which direction and handles doing so. Automatic resumption and partial uploading/downloading is supported, these files will be visible as filename.partial until the transfer has completed, when they are required to the proper filename.

By default, one fold called ~/Ubuntu One/ is automatically created and configured for mirroring. Changes to this fold (and any others added) are watched using inotifiy. Synchronisation meta-data about which directories are being mirrored and what their is stored in ~/.cache/ubuntuone. When remote content has changed, the agent acts on the incoming unsolicated notification sent by storagefs and starts downloading.

Authenication is via OAuth. Passwords are securely stored via the gnome-keyring service.

There is one implementation, written in Python (GPLv3), this may also be referred to as chicharra. It is started by /usr/share/dbus-1/services/com.ubuntuone.SyncDaemon.service and exposes its DBus interface under com.ubuntuone.SyncDaemon. Logs files are kept in ~/.cache/ubuntuone/log/syncdaemon.log.

gnome-u1sync-applet

Tray icon to control the behaviour of the a running u1sync-agent instance, by communicating over the DBus interface. Provides a constant status update on "what's happening".

There is one implementation, which is written in Python (GPL+copyright assignment hassle status unknown).

u1sync-tool

Command line utility to control the behaviour of a running u1sync-agent instance. The tool does not perform any actions directly, but sends messages to the running agent.

There is one implementation, which is written in Python (GPLv3).

libnautilus-u1sync

An extension (plugin) for the default GNOME file-browser called Nautilus to adds a status icon/button. This can send connection and disconnection requests to a running u1sync-agent instance. Additional directories (Shares) can be nominated for sharing using a new right-click context menu.

There are two implementations. One currently being shipped, written in Python (GPLv3), and one written in C (GPLv3) under development as of 2009-06-08.

u1ftp

This is a command line and test utility tool more akin to bzr branch/push or git clone/push (version control systems). It is incompatible with the rest of the u1sync-agent infrastructure. Synchronisation metadata is stored in a hidden directory at the root folder of each share named $dir/.ubuntuone-sync.

Apparently, this utility has concepts borrowed from Amazon's s3sync tool.

There is one implementation, which is written in Python (GPLv3).

fuse-u1storage

Apparently there was a FUSE (Filesystem in User Space) implementation, but it was "a disaster", so probably got canned. No details on whether this worked via mirroring, or simply exposing/mounting the remote filesystem locally.

There are zero current implementations.

Name mappings

This document

Canonical name

Bugs

Source

updown

updown

ubunet

Python, n/a

webstorage

web ui

ubunet

??, n/a

storagefsd

ubuntuoneapiserver

ubunet

Python, n/a

u1storage

ubuntuone-storage-protocol

ubuntone-storage-protocol

Python, AGPLv3 u-s-p/trunk

u1sync-agent

ubuntuone-syncdaemon

ubuntone-client

Python, GPLv3

u1sync-tool

u1sdtool

ubuntone-client

Python, GPLv3

gnome-u1sync-applet

ubuntuone-client-applet

ubuntone-client

Python, GPLv3

libnautilus-u1sync

ubuntuone-nautilus

ubuntone-client

Python/C, GPLv3

u1ftp

u1sync

ubuntone-client

Python, GPLv3

fuse-u1storage

??

??

??, n/a