Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1053072.1053091guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Document classification through interactive supervision of document and term labels

Published: 20 September 2004 Publication History

Abstract

Effective incorporation of human expertise, while exerting a low cognitive load, is a critical aspect of real-life text classification applications that is not adequately addressed by batch-supervised high-accuracy learners. Standard text classifiers are supervised in only one way: assigning labels to whole documents. They are thus deprived of the enormous wisdom that humans carry about the significance of words and phrases in context. We present HIClass, an interactive and exploratory labeling package that actively collects user opinion on feature representations and choices, as well as whole-document labels, while minimizing redundancy in the input sought. Preliminary experience suggests that, starting with essentially an unlabeled corpus, very little cognitive labor suffices to set up a labeled collection on which standard classifiers perform well.

Cited By

View all
  • (2015)Utility-Theoretic Ranking for Semiautomated Text ClassificationACM Transactions on Knowledge Discovery from Data10.1145/274254810:1(1-32)Online publication date: 22-Jul-2015
  • (2013)Live and learn from mistakesInformation Processing and Management: an International Journal10.1016/j.ipm.2012.02.00149:1(83-98)Online publication date: 1-Jan-2013
  • (2012)A utility-theoretic ranking method for semi-automated text classificationProceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval10.1145/2348283.2348411(961-970)Online publication date: 12-Aug-2012
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
PKDD '04: Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases
September 2004
558 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 20 September 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Utility-Theoretic Ranking for Semiautomated Text ClassificationACM Transactions on Knowledge Discovery from Data10.1145/274254810:1(1-32)Online publication date: 22-Jul-2015
  • (2013)Live and learn from mistakesInformation Processing and Management: an International Journal10.1016/j.ipm.2012.02.00149:1(83-98)Online publication date: 1-Jan-2013
  • (2012)A utility-theoretic ranking method for semi-automated text classificationProceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval10.1145/2348283.2348411(961-970)Online publication date: 12-Aug-2012
  • (2011)Closing the loopProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145588(1467-1478)Online publication date: 27-Jul-2011
  • (2011)A non-negative matrix factorization based approach for active dual supervision from document and word labelsProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145536(949-958)Online publication date: 27-Jul-2011
  • (2011)Large-scale hierarchical text classification without labelled dataProceedings of the fourth ACM international conference on Web search and data mining10.1145/1935826.1935919(685-694)Online publication date: 9-Feb-2011
  • (2010)A unified approach to active dual supervision for labeling features and examplesProceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I10.5555/1888258.1888269(40-55)Online publication date: 20-Sep-2010
  • (2010)CiteDataProceedings of the 19th ACM international conference on Information and knowledge management10.1145/1871437.1871509(549-558)Online publication date: 26-Oct-2010
  • (2010)A unified approach to active dual supervision for labeling features and examplesProceedings of the 2010th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I10.1007/978-3-642-15880-3_9(40-55)Online publication date: 20-Sep-2010
  • (2009)Active dual supervisionProceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing10.5555/1564131.1564142(49-57)Online publication date: 5-Jun-2009
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media