research-article

Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?

Authors:

Jean-Marc Odobez,

Daniel Gatica-PerezAuthors Info & Claims

CBMI '17: Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing

Article No.: 32, Pages 1 - 6

https://doi.org/10.1145/3095713.3095746

Published: 19 June 2017 Publication History

Abstract

This paper investigates two-types of shape representations for individual Maya codical glyphs: traditional bag-of-words built on knowledge-driven local shape descriptors (HOOSC), and Convolutional Neural Networks (CNN) based representations, learned from data. For CNN representations, first, we evaluate the activations of typical CNNs that are pretrained on large-scale image datasets; second, we train a CNN from scratch with all the available individual segments. One of the main challenges while training CNNs is the limited amount of available data (and handling data imbalance issue). Here, we attempt to solve this imbalance issue by introducing class-weights into the loss computation during training. Another possibility is oversampling the minority class samples during batch selection. We show that deep representations outperform the other, but CNN training requires special care for small-scale unbalanced data, that is usually the case in the cultural heritage domain.

References

[1]

1880. Dresden Codex. http://digital.slub-dresden.de/werkansicht/dlf/2967/1/. (1880).

[2]

1883. Madrid Codex. http://www.famsi.org/mayawriting/codices/madrid.html. (1883).

[3]

1887. Paris Codex. http://gallica.bnf.fr/ark:/12148/btv1b8446947j. (1887).

[4]

Serge Belongie, Jitendra Malik, and Jan Puzicha. 2000. Shape context: A new descriptor for shape matching and object recognition. In Conference on Advances in Neural Information Processing Systems, Vol. 2. 3.

[5]

Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. 2016. Evaluating Shape Representations for Maya Glyph Classification. ACM Journal on Computing and Cultural Heritage (JOCCH) 9, 3 (September 2016).

Digital Library

[6]

Gulcan Can, Jean-Marc Odobez, and Daniel Gatica-Perez. 2017. Maya Codical Glyph Segmentation: A Crowdsourcing Approach. Research Report. Idiap.

[7]

Ken Chatfield, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. 2014. Return of the Devil in the Details: Delving Deep into Convolutional Nets. In BMVC.

[8]

Navneet Dalal and Bill Triggs. 2005. Histograms of oriented gradients for human detection. In Conference on Computer Vision and Pattern Recognition, Vol. 1. IEEE, 886--893.

Digital Library

[9]

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition. IEEE, 248--255.

[10]

Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition.

[11]

Mathias Eitz, James Hays, and Marc Alexa. 2012. How Do Humans Sketch Objects? ACM Trans. Graph. 31, 4, Article 44 (jul 2012), 10 pages.

Digital Library

[12]

Morris Franken and Jan C van Gemert. 2013. Automatic egyptian hieroglyph recognition by retrieving images as texts. In International Conference on Multimedia. ACM, 765--768.

Digital Library

[13]

Rui Hu, Gulcan Can, Carlos Pallan Gayol, Guido Krempel, Jakub Spotak, Gabrielle Vail, Stephane Marchand-Maillet, Jean-Marc Odobez, and Daniel Gatica-Perez. 2015. Multimedia Analysis and Access of Ancient Maya Epigraphy. Signal Processing Magazine 32, 4 (jul 2015), 75--84.

[14]

Sergey Ioffe and Christian Szegedy. 2015. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. In Proc. of International Conference on Machine Learning. 448--456.

[15]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in NIPS. 1097--1105.

[16]

Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521, 7553 (2015), 436--444.

[17]

Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proc. IEEE 86, 11 (1998), 2278--2324.

[18]

David G. Lowe. 2004. Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60, 2 (2004), 91--110.

Digital Library

[19]

Nobuyuki Otsu. 1975. A threshold selection method from gray-level histograms. Automatica 11, 285--296 (1975), 23--27.

Digital Library

[20]

Edgar Roman-Rangel, Gulcan Can, Stephane Marchand-Maillet, Rui Hu, Carlos Pallan Gayol, Guido Krempel, Jakub Spotak, Jean-Marc Odobez, and Daniel Gatica-Perez. 2016. Transferring Neural Representations for Low-dimensional Indexing of Maya Hieroglyphic Art. In ECCV Workshop on Computer Vision for Art Analysis.

[21]

Edgar Roman-Rangel, Jean-Marc Odobez, and Daniel Gatica-Perez. 2013. Evaluating Shape Descriptors for Detection of Maya Hieroglyphs. In Mexican Conference on Pattern Recognition.

[22]

Edgar Roman-Rangel, Carlos Pallan, Jean-Marc Odobez, and Daniel Gatica-Perez. 2011. Analyzing ancient maya glyph collections with contextual shape descriptors. IJCV 94, 1 (2011), 101--117.

Digital Library

[23]

Ali Sharif Razavian, Hossein Azizpour, Josephine Sullivan, and Stefan Carlsson. 2014. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition. In CVPR Workshops.

[24]

Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR abs/1409.1556 (2014).

[25]

K. Simonyan and A. Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR abs/1409.1556 (2014).

[26]

Josef Sivic and Andrew Zisserman. 2003. Video Google: A Text Retrieval Approach to Object Matching in Videos. In ICCV (ICCV '03). 1470--. http://dl.acm.org/citation.cfm?id=946247.946751

[27]

Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. 2014. How transferable are features in deep neural networks?. In Advances in NIPS. 3320--3328.

[28]

Qian Yu, Yongxin Yang, Feng Liu, Yi-Zhe Song, Tao Xiang, and Timothy M. Hospedales. 2016. Sketch-a-Net: A Deep Neural Network that Beats Humans. IJCV (2016), 1--15.

[29]

Qian Yu, Yongxin Yang, Yi-Zhe Song, Tao Xiang, and Timothy Hospedales. 2015. Sketch-a-net that beats humans. In Proc. of BMVC. 7.1--7.12.

Cited By

Pinilla-Buitrago LCarrasco-Ochoa JMartínez-Trinidad JRomán-Rangel E(2019)Improved Hieroglyph Representation for Image RetrievalJournal on Computing and Cultural Heritage 10.1145/328438812:2(1-15)Online publication date: 30-Apr-2019
https://dl.acm.org/doi/10.1145/3284388
Can GOdobez JGatica-Perez D(2018)How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNsJournal on Computing and Cultural Heritage 10.1145/323067011:4(1-25)Online publication date: 5-Dec-2018
https://dl.acm.org/doi/10.1145/3230670

Index Terms

Shape Representations for Maya Codical Glyphs: Knowledge-driven or Deep?
1. Applied computing
  1. Arts and humanities
2. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object recognition
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNs

Thanks to the digital preservation of cultural heritage materials, multimedia tools (e.g., based on automatic visual processing) considerably ease the work of scholars in the humanities and help them to perform quantitative analysis of their data. In ...
Computationally efficient wavelet affine invariant functions for shape recognition

An affine invariant function for object recognition is constructed from wavelet coefficients of the object boundary. In previous works, undecimated dyadic wavelet transform was used to construct affine invariant functions. In this paper, an algorithm ...
Evaluating Shape Representations for Maya Glyph Classification

Shape representations are critical for visual analysis of cultural heritage materials. This article studies two types of shape representations in a bag-of-words-based pipeline to recognize Maya glyphs. The first is a knowledge-driven Histogram of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CBMI '17: Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing

June 2017

237 pages

ISBN:9781450353335

DOI:10.1145/3095713

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 June 2017

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

CBMI '17

CBMI '17: International Workshop on Content-Based Multimedia Indexing

June 19 - 21, 2017

Florence, Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
78
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 01 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Pinilla-Buitrago LCarrasco-Ochoa JMartínez-Trinidad JRomán-Rangel E(2019)Improved Hieroglyph Representation for Image RetrievalJournal on Computing and Cultural Heritage 10.1145/328438812:2(1-15)Online publication date: 30-Apr-2019
https://dl.acm.org/doi/10.1145/3284388
Can GOdobez JGatica-Perez D(2018)How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNsJournal on Computing and Cultural Heritage 10.1145/323067011:4(1-25)Online publication date: 5-Dec-2018
https://dl.acm.org/doi/10.1145/3230670

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents