SDM-Net: A Simple and Effective Model for Generalized Zero-Shot Learning

Daghaghi, Shabnam; Medini, Tharun; Shrivastava, Anshumali

Computer Science > Computer Vision and Pattern Recognition

arXiv:1909.04790 (cs)

[Submitted on 10 Sep 2019 (v1), last revised 31 Dec 2020 (this version, v2)]

Title:SDM-Net: A Simple and Effective Model for Generalized Zero-Shot Learning

Authors:Shabnam Daghaghi, Tharun Medini, Anshumali Shrivastava

View PDF

Abstract:Zero-Shot Learning (ZSL) is a classification task where we do not have even a single training labeled example from a set of unseen classes. Instead, we only have prior information (or description) about seen and unseen classes, often in the form of physically realizable or descriptive attributes. Lack of any single training example from a set of classes prohibits use of standard classification techniques and losses, including the popular crossentropy loss. Currently, state-of-the-art approaches encode the prior class information into dense vectors and optimize some distance between the learned projections of the input vector and the corresponding class vector (collectively known as embedding models). In this paper, we propose a novel architecture of casting zero-shot learning as a standard neural-network with crossentropy loss. During training our approach performs soft-labeling by combining the observed training data for the seen classes with the similarity information from the attributes for which we have no training data or unseen classes. To the best of our knowledge, such similarity based soft-labeling is not explored in the field of deep learning. We evaluate the proposed model on the four benchmark datasets for zero-shot learning, AwA, aPY, SUN and CUB datasets, and show that our model achieves significant improvement over the state-of-the-art methods in Generalized-ZSL and ZSL settings on all of these datasets consistently.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1909.04790 [cs.CV]
	(or arXiv:1909.04790v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1909.04790

Submission history

From: Shabnam Daghaghi [view email]
[v1] Tue, 10 Sep 2019 23:27:24 UTC (291 KB)
[v2] Thu, 31 Dec 2020 10:27:37 UTC (757 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SDM-Net: A Simple and Effective Model for Generalized Zero-Shot Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SDM-Net: A Simple and Effective Model for Generalized Zero-Shot Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators