Introduction

About

GamaTTS is an experimental articulatory speech synthesizer, started as a C++ port of Gnuspeech. The port is based on the original TTS_Server (developed for NeXTSTEP), which was written in C (70% in lines of code) and ObjC (30%).

GamaTTS:Editor was later developed to edit the articulatory database, using parts of the source code from Monet (written in ObjC), an editor in Gnuspeech. During the development, the available documentation for Monet was used as a reference.

Some of the changes in GamaTTS (compared with Gnuspeech) are:

Due to these and other changes, most of the file formats are not compatible with Gnuspeech anymore. But the vocal tract models 0, 1 and 2 produce speech almost equivalent to Gnuspeech's output.


Description

Diagram of GamaTTS

The main modules are:

The VTM parameters can be adjusted to produce a "schwa" sound, for example. If the parameter values remain constant, the output will be a continuous sound. In the Control Model, such a configuration of the VTM is called a Posture.

To produce speech, the VTM parameters must change along the time (this is called articulation). In the Control Model, the way the parameters change from Posture to Posture is defined by Transitions.

Transitions use Transition Points to define the piecewise linear function that will control the VTM parameter along the time. The time of each Point can be defined using constants, but this is not very flexible. For this reason the Control Model uses Equations to define the times. The Equations use formulas to calculate time, using as parameters the durations of the Postures involved in a Transition.

The Control Model must decide which Transition will be used for each Posture sequence and for each parameter. The Rules are used to do this selection, they contain boolean expressions to match a sequence of Postures. Boolean expressions can also match Categories, which are groups of Postures.