Learning Hierarchical Image Representation with Sparsity, Saliency and Locality

View Researcher's Other Codes

Disclaimer: The provided code links for this paper are external links. Science Nest has no responsibility for the accuracy, legality or content of these links. Also, by downloading this code(s), you agree to comply with the terms of use as set out by the author(s) of the code(s).

Authors Jimei Yang, Ming-Hsuan Yang
Journal/Conference Name BMVC
Paper Category
Paper Abstract This paper presents a deep learning model of building up hierarchical image representation. Each layer of hierarchy consists of three components: sparse coding, saliency pooling and local grouping. With sparse coding we identify distinctive coefficients for representing raw features of each lower layer; saliency pooling helps suppress noise and enhance translation invariance of sparse representation; we group locally pooled sparse codes to form more complex representations. Instead of using hand-crafted descriptors, our model learns an effective image representation directly from images in a unsupervised data-driven manner. We evaluate our algorithm with several benchmark databases of object recognition and analyze the contributions of different components. Experimental results show that our algorithm performs favorably against the state-of-the-art methods.
Date of publication 2011
Code Programming Language MATLAB
Comment

Copyright Researcher 2021