Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A multiclass classification approach for incremental entity resolution on short textual data

Published: 01 January 2021 Publication History

Abstract

Several web applications maintain data repositories containing references to thousands of real-world entities originating from multiple sources, and they continually receive new data. Identifying the distinct entities and associating the correct references to each one is a problem known as entity resolution. The challenge is to solve the problem incrementally, as the data arrive, especially when those data are described by a single textual attribute. In this paper, we propose a new approach for incremental entity resolution. The method we have implemented, called AssocIER, uses an ensemble of multiclass classifiers with self-training and detection of novel classes. We have evaluated our method in various real-world datasets and scenarios, comparing it with a traditional entity resolution approach. The results show that AssocIER is effective and efficient to solve unstructured data in collections with a large number of entities and features, and is able to detect hundreds of novel classes.

Index Terms

  1. A multiclass classification approach for incremental entity resolution on short textual data
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image International Journal of Business Intelligence and Data Mining
          International Journal of Business Intelligence and Data Mining  Volume 18, Issue 2
          2021
          134 pages
          ISSN:1743-8195
          EISSN:1743-8187
          DOI:10.1504/ijbidm.2021.18.issue-2
          Issue’s Table of Contents

          Publisher

          Inderscience Publishers

          Geneva 15, Switzerland

          Publication History

          Published: 01 January 2021

          Author Tags

          1. entity resolution
          2. associative classification
          3. incremental learning
          4. novel class detection
          5. self-training

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 25 Feb 2025

          Other Metrics

          Citations

          View Options

          View options

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media