Augmented Attribute Representations

Sharmanska, Viktoriia; Quadrianto, Novi; Lampert, Christoph H.

doi:10.1007/978-3-642-33715-4_18

Viktoriia Sharmanska²¹,
Novi Quadrianto²² &
Christoph H. Lampert²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7576))

Included in the following conference series:

European Conference on Computer Vision

9830 Accesses
44 Citations

Abstract

We propose a new learning method to infer a mid-level feature representation that combines the advantage of semantic attribute representations with the higher expressive power of non-semantic features. The idea lies in augmenting an existing attribute-based representation with additional dimensions for which an autoencoder model is coupled with a large-margin principle. This construction allows a smooth transition between the zero-shot regime with no training example, the unsupervised regime with training examples but without class labels, and the supervised regime with training examples and with class labels. The resulting optimization problem can be solved efficiently, because several of the necessity steps have closed-form solutions. Through extensive experiments we show that the augmented representation achieves better results in terms of object categorization accuracy than the semantic representation alone.

Download to read the full chapter text

Chapter PDF

Joint Learning of Semantic and Latent Attributes

Robust Attribute-Based Visual Recognition Using Discriminative Latent Representation

Divide, Share, and Conquer: Multi-task Attribute Learning with Selective Sharing

Keywords

References

Lampert, C.H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. In: CVPR, pp. 951–958 (2009)
Google Scholar
Palatucci, M., Pomerleau, D., Hinton, G., Mitchell, T.: Zero-shot learning with semantic output codes. In: NIPS, pp. 1410–1418 (2009)
Google Scholar
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.A.: Describing objects by their attributes. In: CVPR, pp. 1778–1785 (2009)
Google Scholar
Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A., Berg, T.: Baby talk: Understanding and generating simple image descriptions. In: CVPR, pp. 1601–1608 (2011)
Google Scholar
Hinton, G., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)
Article MathSciNet MATH Google Scholar
Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. JMLR 10, 207–244 (2009)
MATH Google Scholar
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.A.: Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR 11, 3371–3408 (2010)
MathSciNet MATH Google Scholar
Quadrianto, N., Lampert, C.H.: Learning multi-view neighborhood preserving projections. In: ICML, pp. 425–432 (2011)
Google Scholar
Ferrari, V., Zisserman, A.: Learning visual attributes. In: NIPS, pp. 433–440 (2008)
Google Scholar
Wang, Y., Mori, G.: A Discriminative Latent Model of Object Classes and Attributes. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 155–168. Springer, Heidelberg (2010)
Chapter Google Scholar
Mahajan, D., Sellamanickam, S., Nair, V.: A joint learning framework for attribute models and object descriptions. In: ICCV, pp. 1227–1234 (2011)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. PAMI 28, 594–611 (2006)
Article Google Scholar
Wolf, L., Hassner, T., Taigman, Y.: Descriptor based methods in the wild. In: ECCV Workshop on Faces in Real Life Images (2008)
Google Scholar
Tommasi, T., Orabona, F., Caputo, B.: Safety in numbers: Learning categories from few examples with multi model knowledge transfer. In: CVPR, pp. 3081–3088 (2010)
Google Scholar
Rohrbach, M., Stark, M., Szarvas, G., Gurevych, I., Schiele, B.: What helps where - and why? semantic relatedness for knowledge transfer. In: CVPR, pp. 910–917 (2010)
Google Scholar
Berg, T., Berg, A., Shih, J.: Automatic Attribute Discovery and Characterization from Noisy Web Data. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 663–676. Springer, Heidelberg (2010)
Chapter Google Scholar
Parikh, D., Grauman, K.: Interactively building a discriminative vocabulary of nameable attributes. In: CVPR, pp. 1681–1688 (2011)
Google Scholar
Welling, M., Rosen-Zvi, M., Hinton, G.: Exponential family harmoniums with an application to information retrieval. In: NIPS (2005)
Google Scholar
Ranzato, M., Huang, F., Boureau, Y., LeCun, Y.: Unsupervised learning of invariant feature hierarchies with applications to object recognition. In: CVPR, pp. 1–8 (2007)
Google Scholar
Gregor, K., LeCun, Y.: Emergence of complex-like cells in a temporal product network with local receptive fields. CoRR abs/1006.0448 (2010)
Google Scholar
Hinton, G., Krizhevsky, A., Wang, S.: Transforming auto-encoders. In: ICANN, pp. 44–51 (2011)
Google Scholar
Salakhutdinov, R., Hinton, G.: Learning a nonlinear embedding by preserving class neighbourhood structure. In: AISTATS (2007)
Google Scholar
Tang, K., Tappen, M., Sukthankar, R., Lampert, C.: Optimizing one-shot recognition with micro-set learning. In: CVPR, pp. 3027–3034 (2010)
Google Scholar
Osherson, D.N., Stern, J., Wilkie, O., Stob, M., Smith, E.E.: Default probability. Cognitive Science 15, 251–269 (1991)
Article Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). In: CVIU, pp. 346–359 (2008)
Google Scholar
Ebert, S., Larlus, D., Schiele, B.: Extracting Structures in Image Collections for Object Recognition. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 720–733. Springer, Heidelberg (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

IST Austria (Institute of Science and Technology Austria), Klosterneuburg, Austria
Viktoriia Sharmanska & Christoph H. Lampert
University of Cambridge, Cambridge, UK
Novi Quadrianto

Authors

Viktoriia Sharmanska
View author publications
You can also search for this author in PubMed Google Scholar
Novi Quadrianto
View author publications
You can also search for this author in PubMed Google Scholar
Christoph H. Lampert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sharmanska, V., Quadrianto, N., Lampert, C.H. (2012). Augmented Attribute Representations. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7576. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33715-4_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-33715-4_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33714-7
Online ISBN: 978-3-642-33715-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Augmented Attribute Representations

Abstract

Chapter PDF

Similar content being viewed by others

Joint Learning of Semantic and Latent Attributes

Robust Attribute-Based Visual Recognition Using Discriminative Latent Representation

Divide, Share, and Conquer: Multi-task Attribute Learning with Selective Sharing

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Augmented Attribute Representations

Abstract

Chapter PDF

Similar content being viewed by others

Joint Learning of Semantic and Latent Attributes

Robust Attribute-Based Visual Recognition Using Discriminative Latent Representation

Divide, Share, and Conquer: Multi-task Attribute Learning with Selective Sharing

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation