MultilingualSpeechSynthesis

Differences between revisions 1 and 6 (spanning 5 versions)
Revision 1 as of 2006-10-08 14:06:48
Size: 1109
Editor: ti200720a080-5542
Comment: initial spec stub
Revision 6 as of 2006-12-03 14:41:40
Size: 2976
Editor: host-81-191-165-41
Comment: clarified design and implementation
Deletions are marked like this. Additions are marked like this.
Line 3: Line 3:
## Register at https://launchpad.net/distros/ubuntu/+specs
 * '''Laun
chpad entry''': none yet
 * '''Packages affected''':
 * '''Launchpad entry''': https://features.launchpad.net/distros/ubuntu/+spec/multilingual-speech
 * '''Packages affected''': {{{orca, espeak, casper, festival}}}
Line 9: Line 8:
Improve speech synthesis support in multiple languages by providing a greater variety of synthesis engines (eSpeak and mbrola) and access to more languages on the Live CD. Improve speech synthesis support in multiple languages with multiple languages on the Live CD and additional options after install. 
Line 17: Line 16:
 * Marie is a French visually impaired person relying on voice synthesizer. She uses either Speakup on the command line and Orca in Gnome and would like her documents and system messages to be read out in a French voice.

 * Lao is a Chinese blind person. There are currently no Free Chinese voice synthesizer in GNU/Linux, so he uses an English version.

 * Hans is a screen reader user living in Switzerland where there are three main languages. During installation he chose to use a German desktop, with a German synthesiser voice, but he also needs to be able to switch easily to French or Italian.
Line 18: Line 23:

 * Write an eSpeak driver for gnome-speech
 * Promote eSpeak to Main
Line 21: Line 29:
 * Make eSpeak the default syntesiser on the Live CD. It supports more languages and has a much smaller footprint than Festival (though the voices are less natural).
 * Provide an easy way to change voices and synthesisers after install (apt-get one and change settings in Orca/Speech``Dispatcher)
 * Make eSpeak the default synthesizer on the Live CD. It supports more languages and has a much smaller footprint than Festival (though the voices are less natural).
 * Write an eSpeak deriver for gnome-speech and make eSpeak the default driver.
 * Selecting drivers and voices: Orca already has a facility for selecting the synthesiser and voice to be used via gnome-speech.
 * Information on installing additional voices will be provided on {{{http://access.ubuntu.com/speech}}}

=== Optional ===

We may also provide support for various commercial synthesisers such as DECtalk, IBM-TTS and Mbrola. These would be available as gnome-speech drivers or via speech dispatcher (Mbrola). Implementation of this depends on licensing discussions.
Line 26: Line 40:
=== Code ===

=== Data preservation and migration ===
 * This spec depends on the [:Accessibility/Specs/OrcaEspeak:OrcaEspeak] spec
 * Ship eSpeak with up to 10-15 voices on the Live CD (follow the list of most used languages = most likely to have language pack on the CD)
 * The Casper accessibility script checks the language variable set during boot and sets the corresponding eSpeak voice
 * Add support for DECtalk, IBM-TTS, Loquendo, etc to gnome-speech so that these voices can be purchased and installed by users without needing to recompile gnome-speech (pending licensing discussions)
 * Add the Mbrola engine and voices to multiverse or commercial repository (pending licensing discussions)
Line 32: Line 48:
== BoF agenda and discussion ==  * How do we make language selection accessible at the gfxboot menu?
 * Contact makers of speech engines (commercial and academic) to sort out licensing questions

Please check the status of this specification in Launchpad before editing it. If it is Approved, contact the Assignee or another knowledgeable person before making changes.

Summary

Improve speech synthesis support in multiple languages with multiple languages on the Live CD and additional options after install.

Rationale

We aim to provide native language support to our users where we can.

Use cases

  • Marie is a French visually impaired person relying on voice synthesizer. She uses either Speakup on the command line and Orca in Gnome and would like her documents and system messages to be read out in a French voice.
  • Lao is a Chinese blind person. There are currently no Free Chinese voice synthesizer in GNU/Linux, so he uses an English version.
  • Hans is a screen reader user living in Switzerland where there are three main languages. During installation he chose to use a German desktop, with a German synthesiser voice, but he also needs to be able to switch easily to French or Italian.

Scope

  • Write an eSpeak driver for gnome-speech
  • Promote eSpeak to Main

Design

  • Make eSpeak the default synthesizer on the Live CD. It supports more languages and has a much smaller footprint than Festival (though the voices are less natural).
  • Write an eSpeak deriver for gnome-speech and make eSpeak the default driver.
  • Selecting drivers and voices: Orca already has a facility for selecting the synthesiser and voice to be used via gnome-speech.
  • Information on installing additional voices will be provided on http://access.ubuntu.com/speech

Optional

We may also provide support for various commercial synthesisers such as DECtalk, IBM-TTS and Mbrola. These would be available as gnome-speech drivers or via speech dispatcher (Mbrola). Implementation of this depends on licensing discussions.

Implementation

  • This spec depends on the [:Accessibility/Specs/OrcaEspeak:OrcaEspeak] spec

  • Ship eSpeak with up to 10-15 voices on the Live CD (follow the list of most used languages = most likely to have language pack on the CD)
  • The Casper accessibility script checks the language variable set during boot and sets the corresponding eSpeak voice
  • Add support for DECtalk, IBM-TTS, Loquendo, etc to gnome-speech so that these voices can be purchased and installed by users without needing to recompile gnome-speech (pending licensing discussions)
  • Add the Mbrola engine and voices to multiverse or commercial repository (pending licensing discussions)

Unresolved issues

  • How do we make language selection accessible at the gfxboot menu?
  • Contact makers of speech engines (commercial and academic) to sort out licensing questions


CategorySpec

Accessibility/Specs/MultilingualSpeechSynthesis (last edited 2008-08-06 16:39:31 by localhost)