Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3178876.3186136acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Free access

Computationally Inferred Genealogical Networks Uncover Long-Term Trends in Assortative Mating

Published: 23 April 2018 Publication History

Abstract

Genealogical networks, also known as family trees or population pedigrees, are commonly studied by genealogists wanting to know about their ancestry, but they also provide a valuable resource for disciplines such as digital demography, genetics, and computational social science. These networks are typically constructed by hand through a very time-consuming process, which requires comparing large numbers of historical records manually. We develop computational methods for automatically inferring large-scale genealogical networks. A comparison with human-constructed networks attests to the accuracy of the proposed methods. To demonstrate the applicability of the inferred large-scale genealogical networks, we present a longitudinal analysis on the mating patterns observed in a network. This analysis shows a consistent tendency of people choosing a spouse with a similar socioeconomic status, a phenomenon known as assortative mating. Interestingly, we do not observe this tendency to consistently decrease (nor increase) over our study period of 150 years.

References

[1]
Ricardo Baeza-Yates, Álvaro Pereira, and Nivio Ziviani. 2008. Genealogical trees on the Web: a search engine user perspective Proc. WWW.
[2]
Gerrit Bloothooft, Peter Christen, Kees Mandemakers, and Marijn Schraagen. 2015. Population Reconstruction. Springer.
[3]
Glenn W Brier. 1950. Verification of forecasts expressed in terms of probability. Monthey Weather Review Vol. 78, 1 (1950), 1--3.
[4]
Hans Henrik Bull. 2005. Deciding whom to marry in a rural two-class society: Social homogamy and constraints in the marriage market in Rendalen, Norway, 1750--1900. International review of social History Vol. 50, S13 (2005), 43--63.
[5]
Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proc. KDD.
[6]
P. Christen. 2016. Application of advanced record linkage techniques for complex population reconstruction. ArXiv e-prints (Dec. 2016). showeprint{arxiv}cs.DB/1612.04286
[7]
Peter Christen, Dinusha Vatsalan, and Zhichun Fu. 2015. Advanced record linkage methods and privacy aspects for population reconstruction--A survey and case studies. In Population Reconstruction. Springer, 87--110.
[8]
Kyle Cranmer, Juan Pavez, and Gilles Louppe. 2016. Approximating likelihood ratios with calibrated discriminative classifiers. ArXiv e-prints (March. 2016). showeprint{arxiv}stat.AP/1506.02169
[9]
Halbert L Dunn. 1946. Record linkage. American Journal of Public Health and the Nations Health Vol. 36, 12 (1946), 1412--1416.
[10]
Julia Efremova, Bijan Ranjbar-Sahraei, Hossein Rahmani, Frans A Oliehoek, Toon Calders, Karl Tuyls, and Gerhard Weiss. 2015. Multi-source entity resolution for genealogical data. In Population Reconstruction. Springer, 129--154.
[11]
Ivan P Fellegi and Alan B Sunter. 1969. A theory for record linkage. AmerStatAssoc Vol. 64, 328 (1969), 1183--1210.
[12]
Angelos P Giotis, Giorgos Sfikas, Basilis Gatos, and Christophoros Nikou. 2017. A survey of document image word spotting techniques. Pattern Recognition Vol. 68 (2017), 310--332.
[13]
Jeremy Greenwood, Nezih Guner, Georgi Kocharkov, and Cezar Santos. 2014. Marry your like: Assortative mating and income inequality. The American Economic Review Vol. 104, 5 (2014), 348--353.
[14]
Aija Greus and Harri Hirvel"a. 2016. Ammatit (Occupations). http://www.saunalahti.fi/hirvela/historismi_sivut/ammatit%20a-k.html. (2016). Accessed: 2017--10--25.
[15]
Kamal Jain and Vijay V Vazirani. 2001. Approximation algorithms for metric facility location and k-median problems using the primal-dual schema and Lagrangian relaxation. Journal of the ACM (JACM) Vol. 48, 2 (2001), 274--296.
[16]
Matthijs Kalmijn. 1998. Intermarriage and homogamy: Causes, patterns, trends. Annual review of sociology Vol. 24, 1 (1998), 395--421.
[17]
Pigi Kouki, Christopher Marcum, Laura Koehly, and Lise Getoor. 2016. Entity resolution in familial networks. In Proc. MLG.
[18]
Pigi Kouki, Jay Pujara, Christopher Marcum, Laura Koehly, and Lise Getoor. 2017. Collective Entity Resolution in Familial Networks. In Proc. ICDM.
[19]
Paul S Lambert, Richard L Zijdeman, Marco HD Van Leeuwen, Ineke Maas, and Kenneth Prandy. 2013. The construction of HISCAM: A stratification scale based on social interactions for historical comparative research. Historical Methods: A Journal of Quantitative and Interdisciplinary History Vol. 46, 2 (2013), 77--89.
[20]
David Lazer, Alex Sandy Pentland, Lada Adamic, Sinan Aral, Albert Laszlo Barabasi, Devon Brewer, Nicholas Christakis, Noshir Contractor, James Fowler, Myron Gutmann, et almbox. 2009. Life in the network: the coming age of computational social science. Science Vol. 323, 5915 (2009), 721--723.
[21]
Jianxun Lian and Xing Xie. 2016. Cross-device user matching based on massive browse logs: The runner-up solution for the 2016 CIKM Cup.
[22]
Eric Malmi, Sanjay Chawla, and Aristides Gionis. 2017 a. Lagrangian relaxations for multiple network alignment. Data Mining and Knowledge Discovery (2017), 1--28.
[23]
Eric Malmi, Marko Rasa, and Aristides Gionis. 2017 b. AncestryAI: A tool for exploring computationally inferred family trees Proc. WWW Companion.
[24]
Eric Malmi, Evimaria Terzi, and Aristides Gionis. 2017 c. Active Network Alignment: A Matching-Based Approach Proc. CIKM.
[25]
Alexandru Niculescu-Mizil and Rich Caruana. 2005. Predicting good probabilities with supervised learning Proc. ICML.
[26]
Bijan Ranjbar-Sahraei, Julia Efremova, Hossein Rahmani, Toon Calders, Karl Tuyls, and Gerhard Weiss. 2015. HiDER: Query-driven entity resolution for historical data Proc. ECML PKDD.
[27]
Yi Tay, Cong-Minh Phan, and Tuan-Anh Nguyen Pham. 2016. Cross device matching for online advertising with neural feature ensembles: First place solution at CIKM Cup 2016.
[28]
Marco HD Van Leeuwen, Ineke Maas, and Andrew Miles. 2002. HISCO: Historical international standard classification of occupations. Leuven University Press.
[29]
Ingmar Weber and Bogdan State. 2017. Digital Demography. In Proc. WWW Companion.

Cited By

View all
  • (2023)The persistent homology of genealogical networksApplied Network Science10.1007/s41109-023-00538-78:1Online publication date: 23-Feb-2023
  • (2022)Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published researchArchival Science10.1007/s10502-022-09397-022:3(367-392)Online publication date: 17-Jun-2022
  • (2021)Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Portal and Data ServiceThe Semantic Web – ISWC 202110.1007/978-3-030-88361-4_42(714-730)Online publication date: 30-Sep-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '18: Proceedings of the 2018 World Wide Web Conference
April 2018
2000 pages
ISBN:9781450356398
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 23 April 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. assortative mating
  2. family tree
  3. genealogy
  4. homogamy
  5. pedigree
  6. population reconstruction
  7. probabilistic record linkage
  8. social stratification

Qualifiers

  • Research-article

Funding Sources

  • European Commission

Conference

WWW '18
Sponsor:
  • IW3C2
WWW '18: The Web Conference 2018
April 23 - 27, 2018
Lyon, France

Acceptance Rates

WWW '18 Paper Acceptance Rate 170 of 1,155 submissions, 15%;
Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)74
  • Downloads (Last 6 weeks)18
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)The persistent homology of genealogical networksApplied Network Science10.1007/s41109-023-00538-78:1Online publication date: 23-Feb-2023
  • (2022)Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published researchArchival Science10.1007/s10502-022-09397-022:3(367-392)Online publication date: 17-Jun-2022
  • (2021)Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Portal and Data ServiceThe Semantic Web – ISWC 202110.1007/978-3-030-88361-4_42(714-730)Online publication date: 30-Sep-2021
  • (2020)Database Concept for Transcription of Registry Records into Digital FormProceedings of the 3rd International Conference on Software Engineering and Information Management10.1145/3378936.3378974(21-25)Online publication date: 12-Jan-2020
  • (2019)Outlier Detection Based Accurate Geocoding of Historical AddressesData Mining10.1007/978-981-15-1699-3_4(41-53)Online publication date: 23-Nov-2019
  • (2019)Algorithmic Creation of Genealogical ModelsIntelligent Systems Design and Applications10.1007/978-3-030-16660-1_63(650-658)Online publication date: 14-Apr-2019

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media