Show simple item record

dc.contributor.advisorGhosh, Prasanta Kumar
dc.contributor.authorKarthik, Girija Ramesan
dc.date.accessioned2021-07-08T10:06:33Z
dc.date.available2021-07-08T10:06:33Z
dc.date.submitted2018
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/5189
dc.description.abstractMachine localization of sound sources is necessary for a wide range of appli- cations, including human-robot interaction, surveillance and hearing aids. Robot sound localization algorithms have been proposed using microphone arrays with varied number of microphones. Adding more microphones helps increase the localization performance as more spatial cues can be obtained based on the number and arrangement of the microphones. However, hu- mans have an incredible ability to accurately localize and attend to target sound sources even in adverse noise conditions. The perceptual organi- zation of sounds in complex auditory scenes is done using various cues that help us group/segregate sounds. Among these, two major spatial cues are the Interaural time difference (ITD) and Interaural level/intensity difference(ILD/IID). Popular algorithms, for binaural source localization, model the distributions of ITD and ILD in each frequency subband us- ing Gaussian Mixture Models (GMMs) and perform likelihood integration across the time-frequency plane to estimate the direction of arrival (DoA) of the sources. In this thesis, we use ITDs and show that the localization performance of a GMM based scheme varies across subbands. We pro- pose subband selection and subband weighting schemes in order to exploit the subband reliability for localization. Source localization results demon- strate that the proposed schemes perform better than uniformly weighing all subbands. In particular, the best set of weights closely correspond to the case of selecting only the most reliable subband. We also propose a new binaural localization technique in which templates, that capture the direction-speci c interaural time di erence patterns, are used to localize sources. These templates are obtained using histograms of ITDs in each subband. DoA is estimated using a template matching scheme, which is experimentally found to perform better than the GMM based scheme.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseries;G29733
dc.rightsI grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertationen_US
dc.subjectGMM based schemeen_US
dc.subjectsubbandsen_US
dc.subjectsound locationen_US
dc.subject.classificationResearch Subject Categories::TECHNOLOGY::Electrical engineering, electronics and photonics::Electrical engineeringen_US
dc.titleBinaural Source Localization using subband reliability and interaural time difference patternsen_US
dc.typeThesisen_US
dc.degree.nameMSen_US
dc.degree.levelMastersen_US
dc.degree.grantorIndian Institute of Scienceen_US
dc.degree.disciplineEngineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record