Show simple item record

dc.contributor.advisorHari, K V S
dc.contributor.authorJoseph, Joby
dc.date.accessioned2005-02-10T05:12:41Z
dc.date.accessioned2018-07-31T07:06:41Z
dc.date.available2005-02-10T05:12:41Z
dc.date.available2018-07-31T07:06:41Z
dc.date.issued2005-02-10T05:12:41Z
dc.date.submitted2004
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/3916
dc.identifier.srno114397480
dc.description.abstractIn this thesis we develop algorithms for estimating broadband source signals from a mixture using only two sensors. This is motivated by what is known in the literature as cocktail party effect, the ability of human beings to listen to the desired source from a mixture of sources with at most two ears. Such a study lets us, achieve a better understanding of the auditory pathway in the brain and confirmation of the results from physiology and psychoacoustics, have a clue to search for an equivalent structure in the brain which corresponds to the modification which improves the algorithm, come up with a benchmark system to automate the evaluation of the systems like 'surround sound', perform speech recognition in noisy environments. Moreover, it is possible that, what we learn about the replication of the functional units in the brain may help us in replacing those using signal processing units for patients suffering due to the defects in these units. There are two parts to the thesis. In the first part we assume the source signals to be broadband and having strong spectral overlap. Channel is assumed to have a few strong multipaths. We propose an algorithm to estimate all the strong multi-paths from each source to the sensors for more than two sources with measurement from two sensors. Because the channel matrix is not invertible when the number of sources is more than the number of sensors, we make use of the estimates of the multi-path delays for each source to improve the SIR of the sources. In the second part we look at a specific scenario of colored signals and channel being one with a prominent direct path. Speech signals as the sources in a weakly reverberant room and a pair of microphones as the sensors satisfy these conditions. We consider the case with and without a head like structure between the microphones. The head like structure we used was a cubical block of wood. We propose an algorithm for separating sources under such a scenario. We identify the features of speech and the channel which makes it possible for the human auditory system to solve the cocktail party problem. These properties are the same as that satisfied by our model. The algorithm works well in a partly acoustically treated room, (with three persons speaking and two microphones and data acquired using standard PC setup) and not so well in a heavily reverberant scenario. We see that there are similarities in the processing steps involved in the algorithm and what we know of the way our auditory system works, especially so in the regions before the auditory cortex in the auditory pathway. Based on the above experiments we give reasons to support the hypothesis about why all the known organisms need to have only two ears and not more but may have more than two eyes to their advantage. Our results also indicate that part of pitch estimation for individual sources might be occurring in the brain after separating the individual source components. This might solve the dilemma of having to do multi-pitch estimation. Recent works suggest that there are parallel pathways in the brain up to the primary auditory cortex which deal with temporal cue based processing and spatial cue based processing. Our model seem to mimic the pathway which makes use of the spatial cues.en
dc.format.extent1019470 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherIndian Institute of Scienceen
dc.rightsI grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation.en
dc.subject.classificationElectrical Communicationsen
dc.subject.keywordHearingen
dc.subject.keywordTwo earsen
dc.subject.keywordTwo sensorsen
dc.subject.keywordMIMOen
dc.subject.keywordSISOen
dc.subject.keywordSource separationen
dc.subject.keywordICAen
dc.subject.keywordSignal separationen
dc.subject.keywordArray processingen
dc.subject.keywordAuditory pathwayen
dc.titleWhy only two ears? Some indicators from the study of source separation using two sensorsen
dc.typeElectronic Thesis and Dissertationen
dc.degree.namePhDen
dc.degree.levelDoctoralen
dc.degree.grantorIndian Institute of Scienceen
dc.degree.disciplineFaculty of Engineeringen


Files in this item

This item appears in the following Collection(s)

Show simple item record