Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3206025.3206039acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

PatternNet: Visual Pattern Mining with Deep Neural Network

Published: 05 June 2018 Publication History

Abstract

Visual patterns represent the discernible regularity in the visual world. They capture the essential nature of visual objects or scenes. Understanding and modeling visual patterns is a fundamental problem in visual recognition that has wide ranging applications. In this paper, we study the problem of visual pattern mining and propose a novel deep neural network architecture called PatternNet for discovering these patterns that are both discriminative and representative. The proposed PatternNet leverages the filters in the last convolution layer of a convolutional neural network to find locally consistent visual patches, and by combining these filters we can effectively discover unique visual patterns. In addition, PatternNet can discover visual patterns efficiently without performing expensive image patch sampling, and this advantage provides an order of magnitude speedup compared to most other approaches. We evaluate the proposed PatternNet subjectively by showing randomly selected visual patterns which are discovered by our method and quantitatively by performing image classification with the identified visual patterns and comparing our performance with the current state-of-the-art. We also directly evaluate the quality of the discovered visual patterns by leveraging the identified patterns as proposed objects in an image and compare with other relevant methods. Our proposed network and procedure, PatterNet, is able to outperform competing methods for the tasks described.

References

[1]
R. Agrawal, T. Imieli'nski, and A. Swami. Mining association rules between sets of items in large databases. In ACM SIGMOD Record, volume 22, pages 207--216. ACM, 1993.
[2]
B. Alexe, T. Deselaers, and V. Ferrari. Measuring the objectness of image windows. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 34(11):2189--2202, 2012.
[3]
T. Berg and P. N. Belhumeur. Poof: Part-based one-vs.-one features for fine-grained categorization, face verification, and attribute estimation. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 955--962. IEEE, 2013.
[4]
J. Carreira and C. Sminchisescu. Constrained parametric min-cuts for automatic object segmentation. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3241--3248. IEEE, 2010.
[5]
Y. Chai, V. Lempitsky, and A. Zisserman. Symbiotic segmentation and part localization for fine-grained categorization. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 321--328. IEEE, 2013.
[6]
G. Chen, J. Yang, H. Jin, E. Shechtman, J. Brandt, and T. X. Han. Selective pooling vector for fine-grained recognition. In Applications of Computer Vision (WACV), 2015 IEEE Winter Conference on, pages 860--867. IEEE, 2015.
[7]
C. Doersch, A. Gupta, and A. A. Efros. Mid-level visual element discovery as discriminative mode seeking. In Advances in Neural Information Processing Systems, pages 494--502, 2013.
[8]
I. Endres and D. Hoiem. Category independent object proposals. In Computer Vision--ECCV 2010, pages 575--588. Springer, 2010.
[9]
E. Gavves, B. Fernando, C. G. Snoek, A. W. Smeulders, and T. Tuytelaars. Fine-grained categorization by alignments. In Proceedings of the IEEE International Conference on Computer Vision, pages 1713--1720, 2013.
[10]
E. Gavves, B. Fernando, C. G. Snoek, A. W. Smeulders, and T. Tuytelaars. Local alignments for fine-grained categorization. International Journal of Computer Vision, 111(2):191--212, 2014.
[11]
R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 580--587. IEEE, 2014.
[12]
B. Hariharan, P. Arbeláez, R. Girshick, and J. Malik. Hypercolumns for object segmentation and fine-grained localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 447--456, 2015.
[13]
H. Harzallah, F. Jurie, and C. Schmid. Combining efficient object localization and image classification. In Computer Vision, 2009 IEEE 12th International Conference on, pages 237--244. IEEE, 2009.
[14]
M. Juneja, A. Vedaldi, C. Jawahar, and A. Zisserman. Blocks that shout: Distinctive parts for scene classification. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 923--930. IEEE, 2013.
[15]
J. Krause, H. Jin, J. Yang, and L. Fei-Fei. Fine-grained recognition without part annotations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5546--5555, 2015.
[16]
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097--1105, 2012.
[17]
L.-J. Li, H. Su, L. Fei-Fei, and E. P. Xing. Object bank: A high-level image representation for scene classification & semantic feature sparsification. In Advances in neural information processing systems, pages 1378--1386, 2010.
[18]
Q. Li, J. Wu, and Z. Tu. Harvesting mid-level visual concepts from large-scale internet images. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 851--858. IEEE, 2013.
[19]
Y. Li, L. Liu, C. Shen, and A. van den Hengel. Mid-level deep pattern mining. In CVPR, 2015.
[20]
D. G. Lowe. Object recognition from local scale-invariant features. In Computer vision, 1999. The proceedings of the seventh IEEE international conference on, volume 2, pages 1150--1157. Ieee, 1999.
[21]
S. N. Parizi, A. Vedaldi, A. Zisserman, and P. Felzenszwalb. Automatic discovery and optimization of parts for image classification. arXiv preprint arXiv:1412.6598, 2014.
[22]
J. Pu, Y.-G. Jiang, J. Wang, and X. Xue. Which looks like which: Exploring inter-class relationships in fine-grained visual categorization. In Computer Vision--ECCV 2014, pages 425--440. Springer, 2014.
[23]
A. Quattoni and A. Torralba. Recognizing indoor scenes. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 413--420. IEEE, 2009.
[24]
P. Sermanet, A. Frome, and E. Real. Attention for fine-grained categorization. arXiv preprint arXiv:1412.7054, 2014.
[25]
K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
[26]
S. Singh, A. Gupta, and A. Efros. Unsupervised discovery of mid-level discriminative patches. Computer Vision--ECCV 2012, pages 73--86, 2012.
[27]
J. Sun and J. Ponce. Learning discriminative part detectors for image classification and cosegmentation. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 3400--3407. IEEE, 2013.
[28]
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1--9, 2015.
[29]
J. R. Uijlings, K. E. van de Sande, T. Gevers, and A. W. Smeulders. Selective search for object recognition. International journal of computer vision, 104(2):154--171, 2013.
[30]
A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman. Multiple kernels for object detection. In Computer Vision, 2009 IEEE 12th International Conference on, pages 606--613. IEEE, 2009.
[31]
M. D. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In Computer Vision--ECCV 2014, pages 818--833. Springer, 2014.
[32]
N. Zhang, J. Donahue, R. Girshick, and T. Darrell. Part-based r-cnns for fine-grained category detection. In Computer Vision--ECCV 2014, pages 834--849. Springer, 2014.
[33]
W. Zhang, H. Li, C.-W. Ngo, and S.-F. Chang. Scalable visual instance mining with threads of features. In Proceedings of the ACM International Conference on Multimedia, pages 297--306. ACM, 2014.

Cited By

View all
  • (2024)Multipattern Mining Using Pattern-Level Contrastive Learning and Multipattern Activation MapIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.321807335:7(9080-9094)Online publication date: Jul-2024
  • (2024)Unsupervised discovery of Interpretable Visual ConceptsInformation Sciences10.1016/j.ins.2024.120159(120159)Online publication date: Jan-2024
  • (2023)C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote SensingProceedings of the Fourteenth Indian Conference on Computer Vision, Graphics and Image Processing10.1145/3627631.3627669(1-10)Online publication date: 15-Dec-2023
  • Show More Cited By

Index Terms

  1. PatternNet: Visual Pattern Mining with Deep Neural Network

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval
    June 2018
    550 pages
    ISBN:9781450350464
    DOI:10.1145/3206025
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 June 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Badges

    • Best Poster

    Author Tags

    1. convolutional neural network
    2. image classification
    3. object proposal
    4. visual pattern mining

    Qualifiers

    • Research-article

    Conference

    ICMR '18
    Sponsor:

    Acceptance Rates

    ICMR '18 Paper Acceptance Rate 44 of 136 submissions, 32%;
    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Multipattern Mining Using Pattern-Level Contrastive Learning and Multipattern Activation MapIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.321807335:7(9080-9094)Online publication date: Jul-2024
    • (2024)Unsupervised discovery of Interpretable Visual ConceptsInformation Sciences10.1016/j.ins.2024.120159(120159)Online publication date: Jan-2024
    • (2023)C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote SensingProceedings of the Fourteenth Indian Conference on Computer Vision, Graphics and Image Processing10.1145/3627631.3627669(1-10)Online publication date: 15-Dec-2023
    • (2023)APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW59228.2023.00196(2024-2034)Online publication date: Jun-2023
    • (2023)Stable Visual Pattern Mining via Pattern Probability DistributionPattern Recognition and Computer Vision10.1007/978-981-99-8543-2_23(280-292)Online publication date: 29-Dec-2023
    • (2022)Application of Transfer Learning and Convolutional Neural Networks for Autonomous Oil Sheen MonitoringApplied Sciences10.3390/app1217886512:17(8865)Online publication date: 3-Sep-2022
    • (2022)Proposal of a Swimming Pool Drowning Detection System using Cameras and Raspberry Pi based on Machine Learning2022 2nd International Conference on Robotics, Automation and Artificial Intelligence (RAAI)10.1109/RAAI56146.2022.10092956(178-181)Online publication date: 9-Dec-2022
    • (2022)Visual Explanation Generation Based on Lambda Attention Branch NetworksComputer Vision – ACCV 202210.1007/978-3-031-26284-5_29(475-490)Online publication date: 4-Dec-2022
    • (2022)Passive User Authentication Utilizing Consecutive Touch Action Features for IIoT SystemsScience of Cyber Security10.1007/978-3-031-17551-0_18(276-284)Online publication date: 10-Aug-2022
    • (2021)Landscape Similarity Analysis Using Texture Encoded Deep-Learning Features on Unclassified Remote Sensing ImageryRemote Sensing10.3390/rs1303049213:3(492)Online publication date: 30-Jan-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media