Show simple item record

dc.contributor.advisorRamakrishnan, A G
dc.contributor.authorKonakanchi, Parthasarathy
dc.date.accessioned2011-08-09T07:38:18Z
dc.date.accessioned2018-07-31T04:58:19Z
dc.date.available2011-08-09T07:38:18Z
dc.date.available2018-07-31T04:58:19Z
dc.date.issued2011-08-09
dc.date.submitted2009
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/1348
dc.identifier.abstracthttp://etd.iisc.ac.in/static/etd/abstracts/1742/G22898-Abs.pdfen_US
dc.description.abstractAfter trying Festival Speech Synthesis System, we decided to develop our own TTS framework, conducive to perform the necessary research experiments for developing good quality TTS for Indian languages. In most of the attempts on Indian language TTS, there is no prosody model, provision for handling foreign language words and no phrase break prediction leading to the possibility of introducing appropriate pauses in the synthesized speech. Further, in the Indian context, there is a real felt need for a bilingual TTS, involving English, along with the Indian language. In fact, it may be desirable to also have a trilingual TTS, which can also take care of the language of the neighboring state or Hindi, in addition. Thus, there is a felt need for a full-fledged TTS development framework, which lends itself for experimentation involving all the above issues and more. This thesis work is therefore such a serious attempt to develop a modular, unit selection based TTS framework. The developed system has been tested for its effectiveness to create intelligible speech in Tamil and Kannada. The created system has also been used to carry out two research experiments on TTS. The first part of the work is the design and development of corpus-based concatenative Tamil speech synthesizer in Matlab and C. A synthesis database has been created with 1027 phonetically rich, pre-recorded sentences, segmented at the phone level. From the sentence to be synthesized, specifications of the required target units are predicted. During synthesis, database units are selected that best match the target specification according to a distance metric and a concatenation quality metric. To accelerate matching, the features of the end frames of the database units have been precomputed and stored. The selected units are concatenated to produce synthetic speech. The high values of the obtained mean opinion scores for the TTS output reveal that speech synthesized using our TTS is intelligible and acceptably natural and can possibly be put to commercial use with some additional features. Experiments carried out by others using my TTS framework have shown that, whenever the required phonetic context is not available in the synthesis database., similar phones that are perceptually indistinguishable may be substituted. The second part of the work deals with the design and modification of the developed TTS framework to be embedded in mobile phones. Commercial GSM FR, EFR and AMR speech codecs are used for compressing our synthesis database. Perception experiments reveal that speech synthesized using a highly compressed database is reasonably natural. This holds promise in the future to read SMSs and emails on mobile phones in Indian languages. Finally, we observe that incorporating prosody and pause models for Indian language TTS would further enhance the quality of the synthetic speech. These are some of the potential, unexplored areas ahead, for research in speech synthesis in Indian languages.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesG22898en_US
dc.subjectSpeech Synthesisen_US
dc.subjectText-To-Speech Synthesisen_US
dc.subjectNatural Language Processingen_US
dc.subjectDigital Language Processingen_US
dc.subjectConcatenation Based Speech Synthesisen_US
dc.subjectTTS Systemen_US
dc.subjectMobile Applicationsen_US
dc.subject.classificationComputer Scienceen_US
dc.titleA Research Bed For Unit Selection Based Text To Speech Synthesis Systemen_US
dc.typeThesisen_US
dc.degree.nameMSc Enggen_US
dc.degree.levelMastersen_US
dc.degree.disciplineFaculty of Engineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record