Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3534678.3539050acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Data-Driven Oracle Bone Rejoining: A Dataset and Practical Self-Supervised Learning Scheme

Published: 14 August 2022 Publication History

Abstract

Oracle Bone Inscriptions (OBI) is one of the oldest scripts in the world. The rejoining of Oracle Bone (OB) fragments is of vital importance to the research of ancient scripts and history. Although significant progress has been achieved in the past decades, the rejoining work still heavily relies on domain knowledge and manual work, thus remains a low efficient and time-consuming process Therefore, an automatic and practical algorithm/system for OB rejoining is of great value to the OBI community. To this end, we collect a real-world dataset for rejoining Oracle Bone fragments, namely OB-Rejoin, which consists of 998 OB rubbing images that suffer from low quality image problems, due to intrinsic underground eroding over time and extrinsic imaging conditions in the past. Moreover, a practical Self-Supervised Splicing Network, S3-Net, is proposed to rejoin the OB fragments based on shape similarity of their borderlines. Specifically, we first transform the manually annotated borderline strokes of OB images into times series style shape representations, which are fed as input to a Generative Adversarial Network for augmenting positive pairs of rejoinable OBs for each OB fragment that does not have rejoinable counterparts. A Siamese network is trained on such augmented data in a contrastive learning manner to retrieve the matching OB fragments of an unseen query from an OB fragment gallery. Experiments on the OB-Rejoin benchmark show that our data-driven approach outperforms two recent methods for time-series analysis. In order to demonstrate its practical potential, we deploy the proposed S3-Net method in real tests and ultimately discover dozens of new rejoinings missed by domain experts for decades.

References

[1]
Anthony Bagnall, Jason Lines, Aaron Bostrom, James Large, and Eamonn Keogh. 2017. The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Mining and Knowledge Discovery 31, 3 (2017), 606--660.
[2]
Donald J Berndt and James Clifford. 1994. Using dynamic time warping to find patterns in time series. In KDD workshop, Vol. 10. 359--370.
[3]
Xingyu Cai, Tingyang Xu, Jinfeng Yi, Junzhou Huang, and Sanguthevar Rajasekaran. 2019. DTWNet: a Dynamic Time Warping Network. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.). 11640--11650.
[4]
Hoang Anh Dau, Diego Furtado Silva, François Petitjean, Germain Forestier, Anthony J. Bagnall, Abdullah Mueen, and Eamonn J. Keogh. 2018. Optimizing dynamic time warping's window width for time series data mining applications. Data Min. Knowl. Discov. 32, 4 (2018), 1074--1120.
[5]
Hui Ding, Goce Trajcevski, Peter Scheuermann, Xiaoyue Wang, and Eamonn Keogh. 2008. Querying and mining of time series data: experimental comparison of representations and distance measures. Proceedings of the VLDB Endowment 1, 2 (2008), 1542--1552.
[6]
Mikita Dvornik, Isma Hadji, Konstantinos G Derpanis, Animesh Garg, and Allan Jepson. 2021. Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers. Advances in Neural Information Processing Systems.
[7]
Yannis Assael et al. 2022. Restoring and attributing ancient texts using deep neural networks. Nature 603, 7900 (2022), 280--283.
[8]
Morris Franken and Jan C. van Gemert. 2013. Automatic Egyptian Hieroglyph Recognition by Retrieving Images As Texts. In ACM MM 2013. 765--768.
[9]
Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron C. Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems. 2672--2680.
[10]
Ishaan Gulrajani, Faruk Ahmed, Martín Arjovsky, Vincent Dumoulin, and Aaron C. Courville. 2017. Improved Training of Wasserstein GANs. In Advances in Neural Information Processing Systems. 5767--5777.
[11]
J. Guo, C. Wang, E. Roman-Rangel, H. Chao, and Y. Rui. 2016. Building Hierarchical Representations for Oracle Character and Sketch Recognition. IEEE Transactions on Image Processing 25, 1 (2016), 104--118. https://doi.org/10.1109/TIP.2015.2500019
[12]
Moruo Guo. 1982. Collections of oracle-bone inscriptions. The Zhong Hua Book Press.
[13]
R. Hu, G. Can, C. Pallan Gayol, G. Krempel, J. Spotak, G. Vail, S. Marchand-Maillet, J. Odobez, and D. Gatica-Perez. 2015. Multimedia Analysis and Access of Ancient Maya Epigraphy: Tools to support scholars on Maya hieroglyphics. IEEE Signal Processing Magazine 32, 4 (2015), 75--84.
[14]
Tianshu Huang. 2019. The Fifth Collection of Oracle Bone Rejoinings. The Xue Yuan Book Press.
[15]
Hongming Lin. 2008. Zui Gu: Research on Oracle Bone Rejoinings. The Taiwan Book Store Press.
[16]
Tsung-Yi Lin, Priya Goyal, Ross B. Girshick, Kaiming He, and Piotr Dollár. 2017. Focal Loss for Dense Object Detection. In IEEE International Conference on Computer Vision. 2999--3007.
[17]
Yi Men. 2008. Six Groups of Newly Joined Huang Type Oracle Bone Inscriptions. Cultural Relics of Central China 2 (2008), 88--91.
[18]
Lin Meng. 2017. Two-Stage Recognition for Oracle Bone Inscriptions. In Image Analysis and Processing - ICIAP 2017 - 19th International Conference, Catania, Italy, September 11--15, 2017, Proceedings, Part II. 672--682. https://doi.org/10.1007/978--3--319--68548--9_61
[19]
Bofeng Mo. 2012. Five Groups of Assemblages Oracle Bone Inscription Fragments in Oracle Bone Inscription Collection of Shanghai Museum. Chinese Palace Museum Journal 4 (2012), 120--124.
[20]
Diego F. Silva, Rafael Giusti, Eamonn J. Keogh, and Gustavo E. A. P. A. Batista. 2018. Speeding up similarity search under dynamic time warping by pruning unpromising alignments. Data Min. Knowl. Discov. 32, 4 (2018), 988--1016.
[21]
Chemao Tsai. 1999. Catalog of Oracle Bone Rejoinings. The Le Xue Book Press.
[22]
Changqing Zhang and Aimin Wang. 2012. Method for computer aided rejoining of bones/tortoise shells rubbing. Electronic Design Engineering 20, 17 (2012), 1--3.
[23]
Chongsheng Zhang, Ruixing Zong, Shuang Cao, Yi Men, and Bofeng Mo. 2020. AI-Powered Oracle Bone Inscriptions Recognition and Fragments Rejoining. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI). 5309--5311.
[24]
Weijie Zhang. 2019. The importance of oracle rejoining in the study of ancient characters and the history of the Shang Dynasty. Journal of Chinese Writing Systems 3, 1 (2019), 19--28. https://doi.org/10.1177/2513850218772679

Cited By

View all
  • (2024)Gca-pvt-net: group convolutional attention and PVT dual-branch network for oracle bone drill chisel segmentationHeritage Science10.1186/s40494-024-01378-z12:1Online publication date: 29-Jul-2024
  • (2024)R-GNN: recurrent graph neural networks for font classification of oracle bone inscriptionsHeritage Science10.1186/s40494-024-01133-412:1Online publication date: 29-Jan-2024
  • (2024)Linking unknown characters via oracle bone inscriptions retrievalMultimedia Systems10.1007/s00530-024-01327-730:3Online publication date: 1-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2022
5033 pages
ISBN:9781450393850
DOI:10.1145/3534678
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 August 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. contrastive learning
  2. data augmentation
  3. oracle bone rejoining

Qualifiers

  • Research-article

Conference

KDD '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)111
  • Downloads (Last 6 weeks)7
Reflects downloads up to 13 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Gca-pvt-net: group convolutional attention and PVT dual-branch network for oracle bone drill chisel segmentationHeritage Science10.1186/s40494-024-01378-z12:1Online publication date: 29-Jul-2024
  • (2024)R-GNN: recurrent graph neural networks for font classification of oracle bone inscriptionsHeritage Science10.1186/s40494-024-01133-412:1Online publication date: 29-Jan-2024
  • (2024)Linking unknown characters via oracle bone inscriptions retrievalMultimedia Systems10.1007/s00530-024-01327-730:3Online publication date: 1-Jun-2024
  • (2024)AI for the Restoration of Ancient Inscriptions: A Computational Linguistics PerspectiveDecoding Cultural Heritage10.1007/978-3-031-57675-1_7(137-154)Online publication date: 25-Apr-2024
  • (2023)Machine Learning for Ancient Languages: A SurveyComputational Linguistics10.1162/coli_a_0048149:3(703-747)Online publication date: 1-Sep-2023
  • (2023)Reconnecting the Broken Civilization: Patchwork Integration of Fragments from Ancient ManuscriptsProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3613804(1157-1166)Online publication date: 26-Oct-2023
  • (2023)SFF-Siam: A New Oracle Bone Rejoining Method Based on Siamese NetworkIEEE Computer Graphics and Applications10.1109/MCG.2023.328400043:6(22-32)Online publication date: 14-Jun-2023
  • (2023)Interactively Rejioning 2D Oracle Bone Fragments Based on Contour Matching2023 9th International Conference on Virtual Reality (ICVR)10.1109/ICVR57957.2023.10169751(163-170)Online publication date: 12-May-2023
  • (2023)Non-Redundant Image Clustering of Early Medieval Glass Beads2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA60987.2023.10302468(1-12)Online publication date: 9-Oct-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media