Show simple item record

dc.contributor.advisorVenkatesh Babu, R
dc.contributor.advisorChakraborty, Anirban
dc.contributor.authorMajumder, Avishek
dc.date.accessioned2020-12-08T09:27:30Z
dc.date.available2020-12-08T09:27:30Z
dc.date.submitted2019
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/4725
dc.description.abstractSaliency plays a key role in various computer vision tasks. Extracting salient regions from images and videos has been a well-established problem in computer vision. Determining salient regions in an image or video has a lot of immediate applications, such as Anomaly detection in videos, efficient data compression and other derived applications, such as robot vision, salient object determination, context-aware image and video retouching, tracking, person reidentification. With the advent of deep convolutional neural networks, many of the existing problems have witnessed a significant boost in performance. Segmentation tasks in particular, which require a general understanding of the scene, achieve a very high-performance gain in terms of IoU (Intersection over Union). The generalizable capability of the filters learned by these neural networks is suitable for tasks for which the network was not even trained. In this thesis, we explore three applications, image segmentation, video segmentation and automatic anomaly detection from videos. For the salient object segmentation in images, we explore two novel recurrent attention based methods. The soft attention method uses a recurrent gating mechanism to extract the salient object segmentation from an image. This technique weighs certain parts of the image more than the others to refine the segmentation outputs gradually. The refinement procedure does not require increased parameters since the recurrent gates have shared weights. The hard attention method tackles the problem by iteratively attending to image patches in a recurrent fashion and subsequently enhancing the predicted segmentation mask. Saliency features are estimated independently for every image patch which is further combined using an aggregation strategy based on a Convolutional GRU based network. The proposed approach works in an ii iii end-to-end manner, removing background noise and false positives incrementally. Through extensive evaluation on various benchmark datasets, we show superior performance to the existing approaches without any post-processing. The task of video object segmentation suffers from a number of challenges such as 3D parallax, camera shake, motion blur, cluttered background etc. To handle the challenges associated with video segmentation we developed a novel unsupervised end-to-end trainable, fully convolutional deep neural network for object segmentation. Our model, though does not use temporal information, is robust and scalable across scenes, as it is tested in an unsupervised manner and can easily infer which objects constitute the foreground of the image. We perform better than all methods using handcrafted features, and close to deep methods using temporal information. The final task is the automatic extraction of anomalous events from a given video. For this, we design a novel method to extract outliers from motion alone. We employ a stacked LSTM encoder-decoder structure to model the regular motion patterns of the given video sequence. The discrepancy between the motion predicted using the model and the actual observed motion in the scene is measured to detect anomalous activities. We show on-par performance with the existing state-of-the-art methods on the benchmark datasets.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseries;G29596
dc.rightsI grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertationen_US
dc.subjectsalient object segmentationen_US
dc.subjectImagesen_US
dc.titleExploring the Inherent Saliency in Visual Data through Convolutional Neural Networksen_US
dc.typeThesisen_US
dc.degree.nameMSen_US
dc.degree.levelMastersen_US
dc.degree.grantorIndian Institute of Scienceen_US
dc.degree.disciplineEngineeringen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record