Show simple item record

dc.contributor.advisorRamakrishnan, K R
dc.contributor.authorAnoop, K R
dc.date.accessioned2018-07-14T07:58:06Z
dc.date.accessioned2018-07-31T04:57:21Z
dc.date.available2018-07-14T07:58:06Z
dc.date.available2018-07-31T04:57:21Z
dc.date.issued2018-07-14
dc.date.submitted2015
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/3824
dc.identifier.abstracthttp://etd.iisc.ac.in/static/etd/abstracts/4695/G26976-Abs.pdfen_US
dc.description.abstractOver the years, extensive research has been devoted to the study of people's head pose due to its relevance in security, human-computer interaction, advertising as well as cognitive, neuro and behavioural psychology. One of the main goals of this thesis is to estimate people's 3D head orientation as they freely move around in naturalistic settings such as parties, supermarkets etc. Head pose classification from surveillance images acquired with distant, large field-of-view cameras is difficult as faces captured are at low-resolution with a blurred appearance. Also labelling sufficient training data for headpose estimation in such settings is difficult due to the motion of targets and the large possible range of head orientations. Domain adaptation approaches are useful for transferring knowledge from the training source to the test target data having different attributes, minimizing target data labelling efforts in the process. This thesis examines the use of transfer learning for efficient multi-view head pose classification. Relationship between head pose and facial appearance from many labelled examples corresponding to the source data is learned initially. Domain adaptation techniques are then employed to transfer this knowledge to the target data. The following three challenging situations is addressed (I) ranges of head poses in the source and target images is different, (II) where source images capture a stationary person while target images capture a moving person with varying facial appearance due to changing perspective, scale and (III) a combination of (I) and (II). All proposed transfer learning methods are sufficiently tested and benchmarked on a new compiled dataset DPOSE for headpose classification. This thesis also looks at a novel signature representation for describing object sets for covariance descriptors, Covariance Profiles (CPs). CP is well suited for representing a set of similarly related objects. CPs posit that the covariance matrices, pertaining to a specific entity, share the same eigen-structure. Such a representation is not only compact but also eliminates the need to store all the training data. Experiments on images as well as videos for applications such as object-track clustering and headpose estimation is shown using CP. In the second part, Human-gaze for interest point detection for video retargeting is explored. Regions in video streams attracting human interest contribute significantly to human understanding of the video. Being able to predict salient and informative Regions of Interest (ROIs) through a sequence of eye movements is a challenging problem. This thesis proposes an interactive human-in-loop framework to model eye-movements and predicts visual saliency in yet-unseen frames. Eye-tracking and video content is used to model visual attention in a manner that accounts for temporal discontinuities due to sudden eye movements, noise and behavioural artefacts. Gaze buffering, for eye-gaze analysis and its fusion with content based features is proposed. The method uses eye-gaze information along with bottom-up and top-down saliency to boost the importance of image pixels. Our robust visual saliency prediction is instantiated for content aware Video Retargeting.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesG26976en_US
dc.subjectHeadpose Classificationen_US
dc.subjectVideo Retargetingen_US
dc.subjectHuman Gaze Informationen_US
dc.subjectMultiview Headpose Estimationen_US
dc.subjectDomain Transfer Learningen_US
dc.subjectComputer Visionen_US
dc.subjectARCO-Xboosten_US
dc.subjectCovariance Profilesen_US
dc.subjectHeadpose Estimationen_US
dc.subjectHead Pose Classificationen_US
dc.subjectHead-pose Classificationen_US
dc.subjectCanonical Correlation Analysisen_US
dc.subject.classificationElectrical Engineeringen_US
dc.titleDynamic Headpose Classification and Video Retargeting with Human Attentionen_US
dc.typeThesisen_US
dc.degree.namePhDen_US
dc.degree.levelDoctoralen_US
dc.degree.disciplineFaculty of Engineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record