Speech Signal Classification Using Support Vector Machines

Sood, Gaurav

dc.contributor.advisor	Balakrishnan, N
dc.contributor.author	Sood, Gaurav
dc.date.accessioned	2011-03-16T04:41:08Z
dc.date.accessioned	2018-07-31T05:18:11Z
dc.date.available	2011-03-16T04:41:08Z
dc.date.available	2018-07-31T05:18:11Z
dc.date.issued	2011-03-16
dc.date.submitted	2009
dc.identifier.uri	https://etd.iisc.ac.in/handle/2005/1094
dc.description.abstract	Hidden Markov Models (HMMs) are, undoubtedly, the most employed core technique for Automatic Speech Recognition (ASR). Nevertheless, we are still far from achieving high‐performance ASR systems. Some alternative approaches, most of them based on Artificial Neural Networks (ANNs), were proposed during the late eighties and early nineties. Some of them tackled the ASR problem using predictive ANNs, while others proposed hybrid HMM/ANN systems. However, despite some achievements, nowadays, the dependency on Hidden Markov Models is a fact. During the last decade, however, a new tool appeared in the field of machine learning that has proved to be able to cope with hard classification problems in several fields of application: the Support Vector Machines (SVMs). The SVMs are effective discriminative classifiers with several outstanding characteristics, namely: their solution is that with maximum margin; they are capable to deal with samples of a very higher dimensionality; and their convergence to the minimum of the associated cost function is guaranteed. In this work a novel approach based upon probabilistic kernels in support vector machines have been attempted for speech data classification. The classification accuracy in case of support vector classification depends upon the kernel function used which in turn depends upon the data set in hand. But still as of now there is no way to know a priori which kernel will give us best results The kernel used in this work tries to normalize the time dimension by fitting a probability distribution over individual data points which normalizes the time dimension inherent to speech signals which facilitates the use of support vector machines since it acts on static data only. The divergence between these probability distributions fitted over individual speech utterances is used to form the kernel matrix. Vowel Classification, Isolated Word Recognition (Digit Recognition), have been attempted and results are compared with state of art systems.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	G23702	en_US
dc.subject	Speech Recognition	en_US
dc.subject	Speech Signal Processing	en_US
dc.subject	Automatic Speech Recognition	en_US
dc.subject	Artificial Neural Networks	en_US
dc.subject	Support Vector Machine	en_US
dc.subject	Time Normalization	en_US
dc.subject	Hidden Markov Models (HMMs)	en_US
dc.subject.classification	Computer Science	en_US
dc.title	Speech Signal Classification Using Support Vector Machines	en_US
dc.type	Thesis	en_US
dc.degree.name	MSc Engg	en_US
dc.degree.level	Masters	en_US
dc.degree.discipline	Faculty of Engineering	en_US

Files in this item

Name:: G23702.pdf
Size:: 1.237Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Aerospace Engineering (AE) [469]

Show simple item record