Visual Recognition by Counting Instances: A Multi-Instance Cardinality Potential Kernel

Hajimirsadeghi, Hossein; Yan, Wang; Vahdat, Arash; Mori, Greg

Computer Science > Computer Vision and Pattern Recognition

arXiv:1502.02063 (cs)

[Submitted on 6 Feb 2015 (v1), last revised 9 Apr 2015 (this version, v2)]

Title:Visual Recognition by Counting Instances: A Multi-Instance Cardinality Potential Kernel

Authors:Hossein Hajimirsadeghi, Wang Yan, Arash Vahdat, Greg Mori

View PDF

Abstract:Many visual recognition problems can be approached by counting instances. To determine whether an event is present in a long internet video, one could count how many frames seem to contain the activity. Classifying the activity of a group of people can be done by counting the actions of individual people. Encoding these cardinality relationships can reduce sensitivity to clutter, in the form of irrelevant frames or individuals not involved in a group activity. Learned parameters can encode how many instances tend to occur in a class of interest. To this end, this paper develops a powerful and flexible framework to infer any cardinality relation between latent labels in a multi-instance model. Hard or soft cardinality relations can be encoded to tackle diverse levels of ambiguity. Experiments on tasks such as human activity recognition, video event detection, and video summarization demonstrate the effectiveness of using cardinality relations for improving recognition results.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1502.02063 [cs.CV]
	(or arXiv:1502.02063v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1502.02063

Submission history

From: Hossein Hajimirsadeghi [view email]
[v1] Fri, 6 Feb 2015 21:57:55 UTC (2,542 KB)
[v2] Thu, 9 Apr 2015 22:41:49 UTC (2,546 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hossein Hajimirsadeghi
Wang Yan
Arash Vahdat
Greg Mori

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Recognition by Counting Instances: A Multi-Instance Cardinality Potential Kernel

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Recognition by Counting Instances: A Multi-Instance Cardinality Potential Kernel

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators