research-article

A Compressed-domain Robust Descriptor for Near Duplicate Video Copy Detection

Authors:

Amir H. Rouhi,

James A. ThomAuthors Info & Claims

IVCNZ '14: Proceedings of the 29th International Conference on Image and Vision Computing New Zealand

Pages 130 - 135

https://doi.org/10.1145/2683405.2683417

Published: 19 November 2014 Publication History

Get Access

Abstract

This paper introduces a global descriptor from the compressed video domain (H.264) for near duplicate video copy detection tasks. The proposed descriptor uses a spatial-temporal feature structure in an ordinal pattern distribution format. The proposed descriptor is constructed from Intra-Prediction Modes (IPM) of key frames (IDR & I slices) and extracted from the compressed video files, using the MPEG4/AVC (H.264) codec. Intra-prediction is the compression technique used in the key frames of the H.264 codec. As the proposed feature describes pictures globally, this research compares the feature with the two other well-known global image descriptors, ordinal intensity/colour Histograms and ordinal Auto-correlograms, as baselines. Our experiments show how the proposed feature outperforms the baseline features in non-geometric transformations T3, T4 and T5 in effectiveness as well as efficiency. It is due to better representation of the image content and smaller feature vector size. The core competency of the proposed feature is in non-linear brightness and contrast changes (Gamma expansion and compression) in which the intensity/colour Histograms and Auto-correlograms are deficient.

References

[1]

D. Bhat and S. Nayar. Ordinal measures for image correspondence. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 20(4):415--423, 1998.

Digital Library

Google Scholar

[2]

N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 1, pages 886--893. IEEE, 2005.

Digital Library

Google Scholar

[3]

W. Freeman and M. Roth. Orientation histograms for hand gesture recognition. In International Workshop on Automatic Face and Gesture Recognition, volume 12, pages 296--301, 1995.

Google Scholar

[4]

V. Gupta, P. D. Z. Varcheie, L. Gagnon, and G. Boulianne. Crim at trecvid 2011: content-based copy detection using nearest-neighbor mapping. In TRECVID Workshop: NIST, 2011.

Google Scholar

[5]

M. Hill, G. Hua, A. Natsev, J. Smith, L. Xie, B. Huang, M. Merler, H. Ouyang, and M. Zhou. IBM research trecvid-2010 video copy detection and multimedia event detection system. In Proc. TRECVID 2010 Workshop, 2010.

Google Scholar

[6]

Y. Huang, B. Hsieh, T. Chen, and L. Chen. Analysis, fast algorithm, and VLSI architecture design for H. 264/AVC intra frame coder. Circuits and Systems for Video Technology, IEEE Transactions on, 15(3):378--401, 2005.

Digital Library

Google Scholar

[7]

A. Lakdashti and M. S. Moin. A new content-based image retrieval approach based on pattern orientation histogram. In Computer Vision/Computer Graphics Collaboration Techniques, pages 587--595. Springer, 2007.

Digital Library

Google Scholar

[8]

D. Lowe. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2):91--110, 2004.

Digital Library

Google Scholar

[9]

O. Orhan, J. Liu, J. Hochreiter, J. Poock, Q. Chen, A. Chabra, and M. Shah. University of central florida at trecvid 2008 content based copy detection and surveillance event detection. In TRECVID Workshop, Nov, pages 17--18, 2008.

Google Scholar

[10]

P. Over, G. Awad, J. Fiscus, B. Antonishek, M. Michel, A. Smeaton, W. Kraaij, and G. Quénot. An overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID 2011-TREC Video Retrieval Evaluation Online, 2011.

Google Scholar

[11]

A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pages 321--330, New York, NY, USA, 2006. ACM Press.

Digital Library

Google Scholar

[12]

G. Sullivan and T. Wiegand. Rate-distortion optimization for video compression. IEEE Signal Processing Magazine, 15(6):74--90, 1998.

Crossref

Google Scholar

[13]

J. Yuan, L. Duan, Q. Tian, and C. Xu. Fast and robust short video clip search using an index structure. In Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval, pages 61--68. ACM, 2004.

Digital Library

Google Scholar

[14]

F. Zargari, M. Mehrabi, and M. Ghanbari. Compressed domain texture based visual information retrieval method for I-frame coded pictures. IEEE Transactions on Consumer Electronics, 56(2):728--736, 2010.

Digital Library

Google Scholar

Cited By

View all

Spolaôr NLee HTakaki WEnsina LCoy CWu F(2020)A systematic review on content-based video retrievalEngineering Applications of Artificial Intelligence10.1016/j.engappai.2020.10355790:COnline publication date: 29-Jun-2020
https://dl.acm.org/doi/10.1016/j.engappai.2020.103557
Rouhi A(2015)Evaluating Spatio-Temporal Parameters in Video Similarity Detection by Global Descriptors2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2015.7371255(1-8)Online publication date: Nov-2015
https://doi.org/10.1109/DICTA.2015.7371255
Rouhi A(2015)Enhanced-IPMH as a Robust Visual Descriptor from H.264/AVC and Evaluation of Parameters Effects2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2015.7371254(1-8)Online publication date: Nov-2015
https://doi.org/10.1109/DICTA.2015.7371254

Index Terms

A Compressed-domain Robust Descriptor for Near Duplicate Video Copy Detection
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
  2. Computer graphics
    1. Image manipulation

Recommendations

Architecture of Algorithmically Optimized MPEG-4 AVC/H.264 Video Encoder
ICCVG 2012: Proceedings of the International Conference on Computer Vision and Graphics - Volume 7594

Architecture of algorithmically optimized MPEG-4 AVC/ H.264 video encoder is presented in the paper. The paper reveals details of implementation for the proposed MPEG-4 AVC video encoder. The presented MPEG-4 AVC encoder was tested with test video ...
Bit rate transcoding of H.264 encoded movies by dropping frames in the compressed domain

A new technique for controlling the bit-rate of H.264 encoded sequences is presented. The technique achieves bit-rate control by dropping frames directly in the compressed domain. The dropped frames are carefully selected so as to either eliminate or ...
Lossless fragile watermarking algorithm in compressed domain for multiview video coding

The hierarchical B picture (HBP) prediction structure is a typical coding scheme used for multiview video coding (MVC). This paper proposes a fragile watermarking algorithm for HBP-based multiview video coding. B_DIRECT_16 16 and B_SKIP are two types of ...

Comments

Information & Contributors

Information

Published In

IVCNZ '14: Proceedings of the 29th International Conference on Image and Vision Computing New Zealand

November 2014

298 pages

ISBN:9781450331845

DOI:10.1145/2683405

Co-chairs:
John Perrone,
Michael Mayo,
Anthony Blake,
General Chair:
Michael J. Cree,
Publications Chair:
Lee Streeter

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

The University of Waikato

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 November 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

IVCNZ '14

IVCNZ '14: The 29th International Conference on Image and Vision Computing New Zealand

November 19 - 21, 2014

Hamilton, New Zealand

Acceptance Rates

IVCNZ '14 Paper Acceptance Rate 55 of 74 submissions, 74%;

Overall Acceptance Rate 55 of 74 submissions, 74%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
77
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 01 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Spolaôr NLee HTakaki WEnsina LCoy CWu F(2020)A systematic review on content-based video retrievalEngineering Applications of Artificial Intelligence10.1016/j.engappai.2020.10355790:COnline publication date: 29-Jun-2020
https://dl.acm.org/doi/10.1016/j.engappai.2020.103557
Rouhi A(2015)Evaluating Spatio-Temporal Parameters in Video Similarity Detection by Global Descriptors2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2015.7371255(1-8)Online publication date: Nov-2015
https://doi.org/10.1109/DICTA.2015.7371255
Rouhi A(2015)Enhanced-IPMH as a Robust Visual Descriptor from H.264/AVC and Evaluation of Parameters Effects2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA)10.1109/DICTA.2015.7371254(1-8)Online publication date: Nov-2015
https://doi.org/10.1109/DICTA.2015.7371254

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Architecture of Algorithmically Optimized MPEG-4 AVC/H.264 Video Encoder

Bit rate transcoding of H.264 encoded movies by dropping frames in the compressed domain

Lossless fragile watermarking algorithm in compressed domain for multiview video coding

Comments

Published In

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Other Metrics

Article Metrics

Other Metrics

Cited By

Login options

Full Access

PDF

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Architecture of Algorithmically Optimized MPEG-4 AVC/H.264 Video Encoder

Bit rate transcoding of H.264 encoded movies by dropping frames in the compressed domain

Lossless fragile watermarking algorithm in compressed domain for multiview video coding

Comments

Information

Published In

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations