CleanupAudioJumble

Differences between revisions 15 and 16
Revision 15 as of 2006-11-24 11:33:33
Size: 8054
Editor: 87
Comment: pulse is in feisty, no need for external archive
Revision 16 as of 2006-11-24 13:10:02
Size: 8372
Editor: tri69-2-89-80-32-83
Comment:
Deletions are marked like this. Additions are marked like this.
Line 86: Line 86:
 * I have made trials in Edgy with the packages provided by Lennart : no gnome sound (miss post configuration), but every applications like Listen, Totem or Banshee are detected and works ; Xmms cna play music with the dedicated plug-in provided ; the jack-module is not detected, so impossible. [wiki:Ttoine ttoine]

Please check the status of this specification in Launchpad before editing it. If it is Approved, contact the Assignee or another knowledgeable person before making changes.

Summary

The idea is to make PulseAudio the default sound system on Ubuntu, replacing the Esound Sound Daemon (esd) and ALSA dmix. PulseAudio is a drop-in replacement for Esound, but adds new features, opening it for many entirely new areas.

Rationale

Apple managed to standardize on a single powerful sound system (CoreAudio) for MacOSX which makes almost all users happy, ranging from normal day-to-day desktop users to gamers, to professional audio people. We should be able to provide the same on Linux. PulseAudio can currently provide the functionality at least partially, with the only notable exception being pro audio. PulseAudio is a modular sound server, kind of an "application server" for audio. Beyond the obvious sound mixing functionality it offers advanced audio features like "desktop bling", hot-plug support, transparent network audio, hot moving of playback streams between audio devices, seperate volume adjustments for all playback or record streams, very low latency, very precise latency estimation (even over the network), a modern zero-copy memory management , a wide range of extension modules, availability for many operating system, compatibility with 90% of all currently available audio applications for Linux in one way or another.

In the future it is expected that PulseAudio will extend to professional audio stuff, entering JACK's current application area. This however is not relevant for the implementation of this spec, at least at this time.

Use cases

  • L. wants to play a video and a background music track at the same time, without any special setup and hassles but with lip-sync audio.
  • L. wants to transparently playback local audio on a remote machine.
  • L. wants to move the currently played back stream from the internal soundcard of his laptop to the USB headset he just plugged in, without any interruption in playback and with only minor clicking on the UI.
  • L. wants that the operation described in the previous item is done automatically by the sound system if he plugs in his USB headset.
  • L. wants to control the volume for each playback stream separately, selecting the right mixer track based on the song name.
  • L. wants to merge his two stereo sound cards into a single 4 channel surround sound card.
  • L. wants that his MP3 music is always played at half the volume but Ekiga's voice stream at the full volume level.
  • L. wants to browse for the audio devices on the network and use them much the same way he already uses the shared network printers.
  • L. wants to move the local audio stream which is played by his bedroom's computer without interruption to the computer in the kitchen.
  • L. wants to multicast audio from his laptop to all machines in his network.
  • L. wants mixed audio but still low enough latencies for voip.
  • L. wants proper audio on his LTSP thin clients.
  • L. is using an average-quality speaker set. It needs some equalization to sound right, however his audio player of choice (Rhythmbox :P) does not yet feature an EQ. He simply modifies the overall equalizer and everything that his PC plays now sounds good on his setup.

Scope

This specification changes the default sound daemon for Ubuntu. The same is immediately applicable to Xubuntu, if desired.

Design

Mode of operation

In order to provide the highest possible audio quality, use all features like hal support and dynamic stream handling, and not opening any potential attack vector, the upstream recommended mode of operation is to have a permanent pulse daemon running as the user, without automatic module unloading, and maintaining a permanent connection to the sound card hardware device. However, this mode breaks the following use cases:

  1. Two users are logged in at the same time, and either of them wants to use the sound card.
  2. Commercial and legacy applications often use OSS for sound output, which wants exclusive sound card access.

While 2. can be circumvented by calling the application with existing wrappers which redirect /dev/dsp access to the Pulse emulation, we recognize that we cannot enforce this at the moment. Thus the alsa sink driver needs to be modified to release the sound card file handle when it's inactive for 5 seconds.

Compatibility

PulseAudio emulates the OSS, ALSA, and esound API (amongst others), so that existing applications can be moved to Pulse without much effort. Just for the record, this emulation has nothing to do with ALSA's OSS emulation, Pulse will work even if these modules are loaded (but not used).

For fully transparent OSS emulation, we should check whether the FUSD userspace devices implementation is mature enough to replace the current LD_PRELOAD hacks.

GUI

PulseAudio offers three different graphical user interfaces for controlling audio (see a [http://0pointer.de/public/pulse-screenshot.png screenshot]:

  1. pulseaudio volume control: Controls the volume of sinks, sources, and streams, and allows the user to move streams between sinks.
  2. pulseaudio preferences: configure network related services including multicast
  3. panel applet: notify about changes of sound hardware, choose default device, call the other pulseaudio tools.

The volume control and preferences applications are sufficient for all the use cases mentioned above. The panel applet is just "nice to have" for control freaks, so we should ship it, but not activate by default.

We will ship the pulseaudio volume control by default, since it is a very convenient interface to control the volume per stream. However, we will keep the default Gnome mixer applet (which controls the hardware mixer levels) for now.

Implementation

Code

  • The esound package is not installed by default any more, and replaced by pulseaudio-esound-compat.

  • The esound client library will still stay around to not break Gnome sound events and custom packages still relying on it.

  • Upstream is working on a clean and generic solution for the sound card file handle releasing on inactivity, but this will not be finished before spring 2007. After discussion with upstream, we decided that we will create our own simple patch for this for feisty, which might not fit well into the clean pulse architecture, but allow us to deploy, test, and benefit from it now without losing legacy compatibility.
  • Change gstreamer to prefer the pulse sink, and fall back to ALSA.
  • Change other applications in main to default to pulse output, if there is an available output module (xine, mplayer, libao (for Gaim), xmms, etc.).

Data preservation and migration

Package upgrades will be handled through a transitional esound package added to Debian's pulseaudio source package (this will work because pulseaudio's version is higher than esound's). Data migration is not necessary.

Comments

  • Beware that flashplugin-nonfree creates a /tmp/.esd folder in an init script. This prevents pulseaudio from starting. ("strace pulseaudio | grep -i open" or something like that) Work-around is to uninstall flashplugin-nonfree, of course, or disable the init script, which I think means flash sound will stop working, but I haven't tried. JackWasey

    • This has been resolved in Feisty's ALSA-based flashplugin-nonfree (there will be backports to dapper and edgy) that uses the latest beta 2 of Flash 9. The upgrade removes the initscript, so upon reboot, there will not exist a /tmp/.esd if you don't enable EsounD. ["DanielTChen"]
  • I have made trials in Edgy with the packages provided by Lennart : no gnome sound (miss post configuration), but every applications like Listen, Totem or Banshee are detected and works ; Xmms cna play music with the dedicated plug-in provided ; the jack-module is not detected, so impossible. [wiki:Ttoine ttoine]


CategorySpec

DesktopTeam/Specs/CleanupAudioJumble (last edited 2008-08-06 16:22:41 by localhost)