Lightweight Lexical and Semantic Evidence for Detecting Classes Among Wikipedia Articles

Published: 30 January 2019


A supervised method relies on simple, lightweight features in order to distinguish Wikipedia articles that are classes (Shield volcano) from other articles (Kilauea). The features are lexical or semantic in nature. Experimental results in multiple languages over multiple evaluation sets demonstrate the superiority of the proposed method over previous work.


Author Tags

  1. classes
  2. knowledge acquisition
  3. open-domain information extraction
  4. semantics
  5. topic classification


