Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3321408.3322624acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesacm-turcConference Proceedingsconference-collections
research-article

Progressive deep feature learning for manga character recognition via unlabeled training data

Published: 17 May 2019 Publication History

Abstract

The recognition of manga (Japanese comics) characters is an essential step in industrial applications, such as manga character retrieval, content analysis and copyright protection. However, conventional methods for manga character recognition are mainly based on handcrafted features which are not robust enough for manga of various style. The emergence of deep learning based methods provides representational features, which has a huge demand for labeled data. In this paper, we propose a framework to exploit unlabeled manga data to facilitate the discriminative capability of deep feature representations for manga character recognition (i.e., unsupervised learning on manga images), which does not rely on any manual annotation. Specifically, we first train an initial feature model using an anime character dataset. Then, we adopt a Progressive Main Characters Mining (PMCM) strategy which iterates between two steps: 1) produce selected data with estimated labels from unlabeled data, 2) update the feature model by the selected data. These two steps are mutually promoted in essence. Experimental results on Manga109 dataset, to which we introduce new head annotations, demonstrate the effectiveness of the proposed framework and the usefulness in manga character verification and retrieval.

References

[1]
Hehe Fan, Liang Zheng, Chenggang Yan, and Yi Yang. 2018. Unsupervised person re-identification: Clustering and fine-tuning. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 14, 4 (2018), 83.
[2]
Raia Hadsell, Sumit Chopra, and Yann LeCun. 2006. Dimensionality reduction by learning an invariant mapping. In null. IEEE, 1735--1742.
[3]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[4]
Gary B Huang, Marwan Mattar, Tamara Berg, and Eric Learned-Miller. 2008. Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on faces in'Real-Life'Images: detection, alignment, and recognition.
[5]
Motoi Iwata, Eiki Imazu, and Koichi Kise. 2015. Similarity learning based on pool-based active learning for manga character retrieval. In Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on. IEEE, 437--442.
[6]
Motoi Iwata, Atsushi Ito, and Koichi Kise. 2014. A study to achieve manga character retrieval method for manga images. In Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on. IEEE, 309--313.
[7]
Thanh-Nam Le, Muhammad Muzzamil Luqman, Jean-Christophe Burie, and Jean-Marc Ogier. 2015. Content-based comic retrieval using multilayer graph representation and frequent graph mining. In Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 761--765.
[8]
Jianming Lv, Weihang Chen, Qing Li, and Can Yang. 2018. Unsupervised Cross-dataset Person Re-identification by Transfer Learning of Spatial-Temporal Patterns. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7948--7956.
[9]
Yusuke Matsui, Kota Ito, Yuji Aramaki, Azuma Fujimoto, Toru Ogawa, Toshihiko Yamasaki, and Kiyoharu Aizawa. 2017. Sketch-based manga retrieval using manga109 dataset. Multimedia Tools and Applications 76, 20 (2017), 21811--21838.
[10]
Rei Narita, Koki Tsubota, Toshihiko Yamasaki, and Kiyoharu Aizawa. 2017. Sketch-Based Manga Retrieval Using Deep Features. In Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on, Vol. 3. IEEE, 49--53.
[11]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in pytorch. (2017).
[12]
Xiaoran Qin, Yafeng Zhou, Zheqi He, Yongtao Wang, and Zhi Tang. 2017. A Faster R-CNN Based Method for Comic Characters Face Detection. In Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on, Vol. 1. IEEE, 1074--1080.
[13]
Christophe Rigaud, Dimosthenis Karatzas, Jean-Christophe Burie, and Jean-Marc Ogier. 2014. Color descriptor for content-based drawing retrieval. In Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on. IEEE, 267--271.
[14]
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, et al. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211--252.
[15]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition. 815--823.
[16]
Weihan Sun, Jean-Christophe Burie, Jean-Marc Ogier, and Koichi Kise. 2013. Specific comic character detection using local feature matching. In Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE, 275--279.
[17]
Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A discriminative feature learning approach for deep face recognition. In European Conference on Computer Vision. Springer, 499--515.
[18]
Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wanli Ouyang, and Yi Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5177--5186.
[19]
Mang Ye, Xiangyuan Lan, and Pong C Yuen. 2018. Robust anchor embedding for unsupervised video person re-identification in the wild. In Proceedings of the European Conference on Computer Vision (ECCV). 170--186.
[20]
Mang Ye, Andy J Ma, Liang Zheng, Jiawei Li, and Pong C Yuen. 2017. Dynamic label graph matching for unsupervised video re-identification. In The IEEE international conference on computer vision (ICCV), Vol. 6.
[21]
Xiaohang Zhan, Ziwei Liu, Junjie Yan, Dahua Lin, and Chen Change Loy. 2018. Consensus-driven propagation in massive unlabeled data for face recognition. In Proceedings of the European Conference on Computer Vision (ECCV). 568--583.
[22]
Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, and Qi Tian. 2017. Person re-identification in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1367--1376.

Cited By

View all
  • (2024)The Manga Whisperer: Automatically Generating Transcriptions for Comics2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01232(12967-12976)Online publication date: 16-Jun-2024
  • (2023)Anime Character Identification and Tag Prediction by Multimodality Modeling: Dataset and Model2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191980(1-8)Online publication date: 18-Jun-2023
  • (2022)Cocomix: Utilizing Comments to Improve Non-Visual Webtoon AccessibilityProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3502081(1-18)Online publication date: 29-Apr-2022
  • Show More Cited By

Index Terms

  1. Progressive deep feature learning for manga character recognition via unlabeled training data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ACM TURC '19: Proceedings of the ACM Turing Celebration Conference - China
    May 2019
    963 pages
    ISBN:9781450371582
    DOI:10.1145/3321408
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 May 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. label estimation
    2. manga character recognition
    3. manga images
    4. unsupervised learning

    Qualifiers

    • Research-article

    Funding Sources

    • National Natural Science Foundation of China

    Conference

    ACM TURC 2019

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)The Manga Whisperer: Automatically Generating Transcriptions for Comics2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01232(12967-12976)Online publication date: 16-Jun-2024
    • (2023)Anime Character Identification and Tag Prediction by Multimodality Modeling: Dataset and Model2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191980(1-8)Online publication date: 18-Jun-2023
    • (2022)Cocomix: Utilizing Comments to Improve Non-Visual Webtoon AccessibilityProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3502081(1-18)Online publication date: 29-Apr-2022
    • (2022)A Challenging Benchmark of Anime Style Recognition2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW56347.2022.00518(4720-4729)Online publication date: Jun-2022
    • (2021)Dual Loss for Manga Character Recognition with Imbalanced Training Data2020 25th International Conference on Pattern Recognition (ICPR)10.1109/ICPR48806.2021.9412282(2166-2171)Online publication date: 10-Jan-2021
    • (2020)Automatic Intelligent Korean Character Semantic Recognition and Analysis Framework based on Machine Learning2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)10.1109/ICECA49313.2020.9297475(1683-1687)Online publication date: 5-Nov-2020

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media