Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2073876.2073934guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article
Free access

Discriminative probabilistic models for relational data

Published: 01 August 2002 Publication History

Abstract

In many supervised learning tasks, the entities to be labeled are related to each other in complex ways and their labels are not independent. For example, in hypertext classification, the labels of linked pages are highly correlated. A standard approach is to classify each entity independently, ignoring the correlations between them. Recently, Probabilistic Relational Models, a relational version of Bayesian networks, were used to define a joint probabilistic model for a collection of related entities. In this paper, we present an alternative framework that builds on (conditional) Markov networks and addresses two limitations of the previous approach. First, undirected models do not impose the acyclicity constraint that hinders representation of many important relational dependencies in directed models. Second, undirected models are well suited for discriminative training, where we optimize the conditional likelihood of the labels given the features, which generally improves classification accuracy. We show how to train these models effectively, and how to use approximate probabilistic inference over the learned model for collective classification of multiple related entities. We provide experimental results on a webpage classification task, showing that accuracy can be significantly improved by modeling relational dependencies.

References

[1]
S. Chakrabarti, B. Dom, and P. Indyk. Enhanced hypertext categorization using hyperlinks. In Proc. of ACM SIGMOD98, pages 307-318, 1998.
[2]
M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Learning to extract symbolic knowledge from the world wide web. In Proc AAAI 98, pages 509-516, 1998.
[3]
S. Della Pietra, V. Della Pietra, and J. Lafferty. Inducing features of random fields. IEEE Trans. on Pattern Analysis and Machine Intelligence, 19(4):380-393, 1997.
[4]
L. Egghe and R. Rousseau. Introduction to Informetrics. Elsevier, 1990.
[5]
N. Friedman, L. Getoor, D. Koller, and A. Pfeffer. Learning probabilistic relational models. In Proc. IJCAI99, pages 1300- 1309, Stockholm, Sweden, 1999.
[6]
L. Getoor, E. Segal, B. Taskar, and D. Koller. Probabilistic models of text and link structure for hypertext classification. In Proc. IJCAI01 Workshop on Text Learning: Beyond Supervision, Seattle, Wash., 2001.
[7]
T. Joachims. Transductive inference for text classification using support vector machines. In Proc. ICML99, pages 200-209. Morgan Kaufmann Publishers, San Francisco, US, 1999.
[8]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.
[9]
D. Koller and A. Pfeffer. Probabilistic frame-based systems. In Proc. AAAI98, pages 580-587, Madison, Wisc., 1998.
[10]
J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. ICML01, 2001.
[11]
T. Minka. Algorithms for maximum-likelihood logistic regression. http://lib.stat.cmu.edu/~minka/papers/logreg.html, 2000.
[12]
K. P. Murphy, Y. Weiss, and M. I. Jordan. Loopy belief propagation for approximate inference: an empirical study. In Proc. UAI99, pages 467-475, 1999.
[13]
J. Neville and D. Jensen. Iterative classification in relational data. In Proc. AAAI-2000 Workshop on Learning Statistical Models from Relational Data, pages 13-20, 2000.
[14]
J. Pearl. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Francisco, 1988.
[15]
S. Slattery and T. Mitchell. Discovering test set regularities in relational domains. In Proc. ICML00, pages 895-902, 2000.
[16]
B. Taskar, E. Segal, and D. Koller. Probabilistic classification and clustering in relational data. In Proc. IJCAI01, pages 870- 876, Seattle, Wash., 2001.
[17]
V.N. Vapnik. The Nature of Statistical Learning Theory. Springer-Verlag, New York, New York, 1995.
[18]
Y. Yang, S. Slattery, and R. Ghani. A study of approaches to hypertext categorization. Journal of Intelligent Information Systems , 18(2), 2002.
[19]
J. Yedidia, W. Freeman, and Y. Weiss. Generalized belief propagation. In NIPS, pages 689-695, 2000.

Cited By

View all
  • (2023)Selecting Walk Schemes for Database EmbeddingProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615052(1677-1686)Online publication date: 21-Oct-2023
  • (2019)Probabilistic logic neural networks for reasoningProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454980(7712-7722)Online publication date: 8-Dec-2019
  • (2019)Improving Classification Quality in Uncertain GraphsJournal of Data and Information Quality10.1145/324209511:1(1-20)Online publication date: 4-Jan-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
UAI'02: Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
August 2002
584 pages
ISBN:1558608974

Sponsors

  • Information Extraction and Transportation
  • AAAI: American Association for Artificial Intelligence
  • Fair, Isaac and Company, Inc.: Fair, Isaac and Company, Inc.
  • Boeing Research: Boeing Research
  • University of Alberta: University of Alberta

Publisher

Morgan Kaufmann Publishers Inc.

San Francisco, CA, United States

Publication History

Published: 01 August 2002

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)85
  • Downloads (Last 6 weeks)9
Reflects downloads up to 12 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Selecting Walk Schemes for Database EmbeddingProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615052(1677-1686)Online publication date: 21-Oct-2023
  • (2019)Probabilistic logic neural networks for reasoningProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454980(7712-7722)Online publication date: 8-Dec-2019
  • (2019)Improving Classification Quality in Uncertain GraphsJournal of Data and Information Quality10.1145/324209511:1(1-20)Online publication date: 4-Jan-2019
  • (2019)On Mapping the Interconnections in Today’s InternetIEEE/ACM Transactions on Networking10.1109/TNET.2019.294036927:5(2056-2070)Online publication date: 1-Oct-2019
  • (2019)Enhancing subspace clustering based on dynamic predictionFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-018-7128-713:4(802-812)Online publication date: 1-Aug-2019
  • (2019)Recurrent collective classificationKnowledge and Information Systems10.1007/s10115-018-1260-460:2(741-755)Online publication date: 1-Aug-2019
  • (2018)Joint Label Inference in NetworksCompanion Proceedings of the The Web Conference 201810.1145/3184558.3186238(483-487)Online publication date: 23-Apr-2018
  • (2018)Coenrollment networks and their relationship to grades in undergraduate educationProceedings of the 8th International Conference on Learning Analytics and Knowledge10.1145/3170358.3170373(295-304)Online publication date: 7-Mar-2018
  • (2017)Joint label inference in networksThe Journal of Machine Learning Research10.5555/3122009.315301518:1(1941-1979)Online publication date: 1-Jan-2017
  • (2017)Social Media for Opioid Addiction EpidemiologyProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3132857(1259-1267)Online publication date: 6-Nov-2017
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media