Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Incremental probabilistic Latent Semantic Analysis for video retrieval

Published: 01 June 2015 Publication History

Abstract

Recent research trends in Content-based Video Retrieval have shown topic models as an effective tool to deal with the semantic gap challenge. In this scenario, this paper has a dual target: (1) it is aimed at studying how the use of different topic models (pLSA, LDA and FSTM) affects video retrieval performance; (2) a novel incremental topic model (IpLSA) is presented in order to cope with incremental scenarios in an effective and efficient way. A comprehensive comparison among these four topic models using two different retrieval systems and two reference benchmarking video databases is provided. Experiments revealed that pLSA is the best model in sparse conditions, LDA tend to outperform the rest of the models in a dense space and IpLSA is able to work properly in both cases. Display Omitted A study of the use of topic models for video retrieval is presented.A new topic model to deal with incremental retrieval scenarios is proposed.Comparison of four topic models using two different retrieval functionsThe results highlight the performance differences among the topic models.

References

[1]
S. Antani, A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video, Pattern Recogn., 35 (2002) 945-965.
[2]
M. Lew, N. Sebe, C. Djeraba, R. Jain, Content-based multimedia information retrieval: state of the art and challenges, ACM Trans. Multimed. Comput. Commun. Appl., 2 (2006) 1-19.
[3]
L. Ying, Z. Dengsheng, L. Guojun, M. Wei-Ying, A survey of content-based image retrieval with high-level semantics, Pattern Recogn., 40 (2007) 262-282.
[4]
M. Cord, P.H. Gosselin, S. Philipp-Foliguet, Stochastic exploration and active learning for image retrieval, Image Vis. Comput., 25 (2007) 14-23.
[5]
G. Chechik, V. Sharma, U. Shalit, S. Bengio, Large scale online learning of image similarity through ranking, J. Mach. Learn. Res., 11 (2010) 1109-1135.
[6]
G.-H. Liu, Z.-Y. Li, L. Zhang, Y. Xu, Image retrieval based on micro-structure descriptor, Pattern Recogn., 44 (2011) 2123-2133.
[7]
M. Arevalillo-Herráez, F.J. Ferri, An Improved Distance-based Relevance Feedback Strategy for Image Retrieval, Image Vision and Computing, 2013.
[8]
W. Ren, S. Singh, M. Singh, Y.S. Zhu, State-of-the-art on spatio-temporal information-based video retrieval, Pattern Recogn., 42 (2009) 267-282.
[9]
S. Tong, E. Chang, Support vector machine active learning for image retrieval, in: Proceedings of the Ninth ACM International Conference on Multimedia, ACM, 2001, pp. 107-118.
[10]
K. Tieu, P. Viola, Boosting image retrieval, Int. J. Comput. Vis., 56 (2004) 17-36.
[11]
D. Zhou, J. Weston, A. Gretton, O. Bousquet, B. Schölkopf, Ranking on Data Manifolds, in: Advances in Neural Information Processing Systems 16, MIT Press, 2004.
[12]
Y. Yang, F. Nie, D. Xu, J. Luo, Y. Zhuang, Y. Pan, A multimedia retrieval framework based on semi-supervised ranking and relevance feedback, IEEE Trans. Pattern Anal. Mach. Intell., 34 (2012) 723-742.
[13]
C.G.M. Snoek, M. Worring, J.C. van Gemert, J.-M. Geusebroek, A.W.M. Smeulders, The challenge problem for automated detection of 101 semantic concepts in multimedia, in: Proceedings of the 14th Annual ACM International Conference on Multimedia, ACM, 2006, pp. 421-430.
[14]
A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain, Content-based image retrieval at the end of the early years, IEEE TPAMI, 22 (2000) 1349-1380.
[15]
D.G. Lowe, Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vis., 60 (2004) 91-110.
[16]
I. Laptev, On space-time interest points, Int. J. Comput. Vis., 64 (2005) 107-123.
[17]
C.V. Cotton, D.P.W. Ellis, Audio fingerprinting to identify multiple videos of an event, in: International Conference on Acoustics, Speech and Signal Processing, IEEE, 2010, pp. 2386-2389.
[18]
J. Sivic, A. Zisserman, Video Google: A text retrieval approach to object matching in videos, in: Proceedings of the International Conference on Computer Vision, volume 2, 2003, pp. 1470-1477.
[19]
H. Wang, C. Schmid, Action recognition with improved trajectories, in: ICCV 2013 - IEEE International Conference on Computer Vision, 2013, pp. 3551-3558.
[20]
R. Fernandez-Beltran, F. Pla, An interactive video retrieval approach based on latent topics, in: International Conference on Image Analysis and Processing, vol. 8156, Springer, Berlin Heidelberg, 2013, pp. 290-299.
[21]
D.M. Blei, Probabilistic topic models, Commun. ACM, 55 (2012) 77-84.
[22]
T. Hofmann, Unsupervised learning by probabilistic latent semantic analysis, Mach. Learn., 42 (2001) 177-196.
[23]
D. Blei, A. Ng, M. Jordan, Latent Dirichlet allocation, J. Mach. Learn. Res., 3 (2003) 993-1022.
[24]
C.-Y. Chiu, T.-H. Tsai, Y.-C. Liou, G.-W. Han, H.-S. Chang, Near-duplicate subsequence matching between the continuous stream and large video dataset, IEEE Trans. Multimedia, 16 (2014) 1952-1962.
[25]
D.M. Blei, J.D. Lafferty, Dynamic topic models, in: Proceedings of the 23rd International Conference on Machine Learning, ICML '06, ACM, 2006.
[26]
T.-C. Chou, M.C. Chen, Using incremental PLSI for threshold-resilient online event analysis, IEEE Trans. Knowl. Data Eng., 20 (2008) 289-299.
[27]
H. Wu, Y. Wang, X. Cheng, Incremental probabilistic latent semantic analysis for automatic question recommendation, in: RecSys '08: Proceedings of the 2008 ACM Conference on Recommender Systems, ACM, 2008, pp. 99-106.
[28]
K. Than, T.B. Ho, Fully sparse topic models, in: ECML/PKDD (1), Lecture Notes in Computer Science, Springer, 2012.
[29]
T. Kakkonen, N. Myller, E. Sutinen, J. Timonen, Comparison of dimension reduction methods for automated essay grading, Educ. Technol. Soc., 11 (2008) 275-288.
[30]
X. Yi, J. Allan, A comparative study of utilizing topic models for information retrieval, in: 31th European Conference on IR Research on Advances in Information Retrieval, ECIR '09, Springer-Verlag, 2009.
[31]
Y. Lu, Q. Mei, C. Zhai, Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA, Inf. Retr., 14 (2011) 178-203.
[32]
R. Zhang, Z. Zhang, Effective image retrieval based on hidden concept discovery in image database, IEEE Trans. Image Process., 16 (2007) 562-572.
[33]
Y.G. Jiang, G. Ye, S.F. Chang, D. Ellis, A.C. Loui, Consumer video understanding: a benchmark database and an evaluation of human and machine performance, 2011.
[34]
S. Ayache, G. Quénot, TRECVID 2007 collaborative annotation using active learning, in: In Proceedings of the TRECVID 2007 Workshop, 2007.
[35]
S. Deerwester, S.T. Dumais, G.W. Furnas, T.K. Landauer, R. Harshman, Indexing by latent semantic analysis, J. Mach. Learn. Res., 41 (1990) 391-407.
[36]
J. Chang, S. Gerrish, C. Wang, J.L. Boyd-graber, D.M. Blei, Reading tea leaves: how humans interpret topic models, in: Advances in Neural Information Processing Systems, 22, 2009, pp. 288-296.
[37]
Y.W. Teh, M.I. Jordan, M.J. Beal, D.M. Blei, Hierarchical Dirichlet processes, J. Am. Stat. Assoc., 101 (2004).
[38]
R. Arun, V. Suresh, C.E. Veni Madhavan, M.N. Narasimha Murthy, On finding the natural number of topics with latent Dirichlet allocation: some observations, in: Proceedings of the 14th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining - Volume Part I, PAKDD'10, 2010.
[39]
R. Fernandez-Beltran, R. Montoliu, F. Pla, Vocabulary reduction in bow representing by topic modeling, in: IbPRIA'13, 2013, pp. 648-655.
[40]
T.L. Griffiths, M. Steyvers, Finding scientific topics, Proc. Natl. Acad. Sci., 101 (2004) 5228-5235.
[41]
K.E.A. van de Sande, T. Gevers, C.G.M. Snoek, Evaluating color descriptors for object and scene recognition, IEEE Trans. Pattern Anal. Mach. Intell., 32 (2010) 1582-1596.
[42]
J. Urbano, M. Marrero, D. Martn, A comparison of the optimality of statistical significance tests for information retrieval evaluation, in: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2013, pp. 925-928.

Cited By

View all
  • (2023)VisDmk: visual analysis of massive emotional danmaku in online videosThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-022-02748-z39:12(6553-6570)Online publication date: 1-Dec-2023
  • (2020)A novel time-shifting method to find popular blog post topicsSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-019-04485-324:13(9705-9725)Online publication date: 1-Jul-2020
  • (2019)Based on The Document-Link and Time-Clue Relationships Between Blog Posts to Improve the Performance of Google Blog SearchInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201901010315:1(52-75)Online publication date: 1-Jan-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Image and Vision Computing
Image and Vision Computing  Volume 38, Issue C
June 2015
74 pages

Publisher

Butterworth-Heinemann

United States

Publication History

Published: 01 June 2015

Author Tags

  1. Content-based Video Retrieval
  2. Information retrieval
  3. Latent topics
  4. Relevance Feedback
  5. probabilistic Latent Semantic Analysis (pLSA)

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)VisDmk: visual analysis of massive emotional danmaku in online videosThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-022-02748-z39:12(6553-6570)Online publication date: 1-Dec-2023
  • (2020)A novel time-shifting method to find popular blog post topicsSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-019-04485-324:13(9705-9725)Online publication date: 1-Jul-2020
  • (2019)Based on The Document-Link and Time-Clue Relationships Between Blog Posts to Improve the Performance of Google Blog SearchInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.201901010315:1(52-75)Online publication date: 1-Jan-2019
  • (2018)Prior-based probabilistic latent semantic analysis for multimedia retrievalMultimedia Tools and Applications10.1007/s11042-017-5247-z77:13(16771-16793)Online publication date: 1-Jul-2018
  • (2017)Finding the Semantic Relationship Between Wikipedia Articles Based on a Useful Entry RelationshipInternational Journal of Data Warehousing and Mining10.4018/IJDWM.201710010313:4(33-52)Online publication date: 1-Oct-2017
  • (2016)An effective technique for the content based image retrieval to reduce the semantic gap based on an optimal classifier techniquePattern Recognition and Image Analysis10.1134/S105466181603015926:3(597-607)Online publication date: 1-Jul-2016

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media