IEEE Home | Shop IEEE | Join IEEE | myIEEE | Contact IEEE | IEEEXplore
IEEE

IEEE Signal Processing Society
Speech & Language Technical Committee


TC-STAR: Progress in Speech-to-Speech Translation

By STEPHEN COX

The objective of the TC-STAR project was ambitious: to achieve a breakthrough in SST that significantly reduced the gap between human and machine translation (MT) performance.  Funded by the European Commission as part of its Sixth Framework Program, it recently finished after three years of research. 

TC-STAR was unusual in that it sought to develop and integrate three major areas of speech and language processing: speech recognition, machine translation and speech synthesis.  A consortium of 13 leading laboratories in these fields from Europe and the USA was assembled to address this challenge.  The architecture of the system used in TC-STAR is shown below:

The system was developed and tested on material from the European parliament in Spanish and English and from Voice of America in Chinese and English.

TC-STAR was also unusual in that it adopted a DARPA-style competitive strategy, with partners developing technology components and then comparing their performance in competitive evaluations, also open to external participants.  A workshop, organized after each evaluation, facilitated sharing to all participants of the techniques and ideas shown to be most successful.  This approach seems to have paid off: Figure 1 shows the word error rate for speech recognition in TC-STAR and Figure 2 the BLEU score for machine translation (BLEU is an automatic method for scoring the quality of translations that has been shown to correlate well with human judgements).  And end-to-end system was produced for English/Spanish and was evaluated by subjects in terms of its fluency, naturalness, listening effort and overall quality. 

Although there is still a long way to go in SST before we can produce systems that approach human quality, the project leader Gianni Lazzari of ITC is positive: "A key area that TC-STAR addressed was the interface between speech recognition and language translation: research in areas such as confusion network coupling, automatic segmentation and punctuation gave us the ability to build fully-fledged speech translation systems that performed reasonably well. TC-STAR has also revealed many research topics to explore in order to improve the translation quality: for instance, introducing linguistic knowledge in the statistical framework, new learning algorithms, new and more powerful automatic metrics."

 

Figure  1: Speech recognition word-error rates over duration of TC-STAR project

 

Figure  2: BLEU scores over duration of TC-STAR project

 


 
SLTC Home   |    IEEE Home   |   Privacy & Security   |    Terms & Conditions