Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleMay 2020
SLIM: Scalable Linkage of Mobility Data
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1181–1196https://doi.org/10.1145/3318464.3389761We present a scalable solution to link entities across mobility datasets using their spatio-temporal information. This is a fundamental problem in many applications such as linking user identities for security, understanding privacy limitations of ...
- research-articleMay 2020
ZeroER: Entity Resolution using Zero Labeled Examples
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1149–1164https://doi.org/10.1145/3318464.3389743Entity resolution (ER) refers to the problem of matching records in one or more relations that refer to the same real-world entity. While supervised machine learning (ML) approaches achieve the state-of-the-art results, they require a large amount of ...
- research-articleMay 2020
Entity Matching in the Wild: A Consistent and Versatile Framework to Unify Data in Industrial Applications
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 2287–2301https://doi.org/10.1145/3318464.3386143Entity matching -- the task of clustering duplicated database records to underlying entities -- has become an increasingly critical component in modern data integration management. Amperity provides a platform for businesses to manage customer data that ...
- research-articleMay 2020
A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1133–1147https://doi.org/10.1145/3318464.3380597Entity Matching (EM) is a core data cleaning task, aiming to identify different mentions of the same real-world entity. Active learning is one way to address the challenge of scarce labeled data in practice, by dynamically collecting the necessary ...
Towards Interpretable and Learnable Risk Analysis for Entity Resolution
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1165–1180https://doi.org/10.1145/3318464.3380572Machine-learning-based entity resolution has been widely studied. However, some entity pairs may be mislabeled by machine learning models and existing studies do not study the risk analysis problem -- predicting and interpreting which entity pairs are ...
- research-articleMay 2020
Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of DataPages 1197–1212https://doi.org/10.1145/3318464.3380570In this paper, we investigate the possibilities of utilizing deep learning for cardinality estimation of similarity selection. Answering this problem accurately and efficiently is essential to many data management applications, especially for query ...