Article

Free access

Discriminative probabilistic models for relational data

Authors:

Daphne KollerAuthors Info & Claims

UAI'02: Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence

Pages 485 - 492

Published: 01 August 2002 Publication History

PDF eReader Publisher Site

Abstract

In many supervised learning tasks, the entities to be labeled are related to each other in complex ways and their labels are not independent. For example, in hypertext classification, the labels of linked pages are highly correlated. A standard approach is to classify each entity independently, ignoring the correlations between them. Recently, Probabilistic Relational Models, a relational version of Bayesian networks, were used to define a joint probabilistic model for a collection of related entities. In this paper, we present an alternative framework that builds on (conditional) Markov networks and addresses two limitations of the previous approach. First, undirected models do not impose the acyclicity constraint that hinders representation of many important relational dependencies in directed models. Second, undirected models are well suited for discriminative training, where we optimize the conditional likelihood of the labels given the features, which generally improves classification accuracy. We show how to train these models effectively, and how to use approximate probabilistic inference over the learned model for collective classification of multiple related entities. We provide experimental results on a webpage classification task, showing that accuracy can be significantly improved by modeling relational dependencies.

References

[1]

S. Chakrabarti, B. Dom, and P. Indyk. Enhanced hypertext categorization using hyperlinks. In Proc. of ACM SIGMOD98, pages 307-318, 1998.

Digital Library

[2]

M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Learning to extract symbolic knowledge from the world wide web. In Proc AAAI 98, pages 509-516, 1998.

Digital Library

[3]

S. Della Pietra, V. Della Pietra, and J. Lafferty. Inducing features of random fields. IEEE Trans. on Pattern Analysis and Machine Intelligence, 19(4):380-393, 1997.

Digital Library

[4]

L. Egghe and R. Rousseau. Introduction to Informetrics. Elsevier, 1990.

[5]

N. Friedman, L. Getoor, D. Koller, and A. Pfeffer. Learning probabilistic relational models. In Proc. IJCAI99, pages 1300- 1309, Stockholm, Sweden, 1999.

Digital Library

[6]

L. Getoor, E. Segal, B. Taskar, and D. Koller. Probabilistic models of text and link structure for hypertext classification. In Proc. IJCAI01 Workshop on Text Learning: Beyond Supervision, Seattle, Wash., 2001.

[7]

T. Joachims. Transductive inference for text classification using support vector machines. In Proc. ICML99, pages 200-209. Morgan Kaufmann Publishers, San Francisco, US, 1999.

Digital Library

[8]

J. M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.

Digital Library

[9]

D. Koller and A. Pfeffer. Probabilistic frame-based systems. In Proc. AAAI98, pages 580-587, Madison, Wisc., 1998.

Digital Library

[10]

J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. ICML01, 2001.

Digital Library

[11]

T. Minka. Algorithms for maximum-likelihood logistic regression. http://lib.stat.cmu.edu/~minka/papers/logreg.html, 2000.

[12]

K. P. Murphy, Y. Weiss, and M. I. Jordan. Loopy belief propagation for approximate inference: an empirical study. In Proc. UAI99, pages 467-475, 1999.

Digital Library

[13]

J. Neville and D. Jensen. Iterative classification in relational data. In Proc. AAAI-2000 Workshop on Learning Statistical Models from Relational Data, pages 13-20, 2000.

[14]

J. Pearl. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Francisco, 1988.

Digital Library

[15]

S. Slattery and T. Mitchell. Discovering test set regularities in relational domains. In Proc. ICML00, pages 895-902, 2000.

Digital Library

[16]

B. Taskar, E. Segal, and D. Koller. Probabilistic classification and clustering in relational data. In Proc. IJCAI01, pages 870- 876, Seattle, Wash., 2001.

Digital Library

[17]

V.N. Vapnik. The Nature of Statistical Learning Theory. Springer-Verlag, New York, New York, 1995.

Digital Library

[18]

Y. Yang, S. Slattery, and R. Ghani. A study of approaches to hypertext categorization. Journal of Intelligent Information Systems , 18(2), 2002.

Digital Library

[19]

J. Yedidia, W. Freeman, and Y. Weiss. Generalized belief propagation. In NIPS, pages 689-695, 2000.

Digital Library

Cited By

Lubarsky YTönshof JGrohe MKimelfeld BFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Selecting Walk Schemes for Database EmbeddingProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615052(1677-1686)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3615052
Qu MTang JWallach HLarochelle HBeygelzimer Ad'Alché-Buc FFox E(2019)Probabilistic logic neural networks for reasoningProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454980(7712-7722)Online publication date: 8-Dec-2019
https://dl.acm.org/doi/10.5555/3454287.3454980
Dallachiesa MAggarwal CPalpanas T(2019)Improving Classification Quality in Uncertain GraphsJournal of Data and Information Quality10.1145/324209511:1(1-20)Online publication date: 4-Jan-2019
https://dl.acm.org/doi/10.1145/3242095
Show More Cited By

Index Terms

Discriminative probabilistic models for relational data
1. Computing methodologies
  1. Artificial intelligence
    1. Knowledge representation and reasoning
      1. Probabilistic reasoning
      2. Vagueness and fuzzy logic

Recommendations

A relational approach to probabilistic classification in a transductive setting

Transduction is an inference mechanism adopted from several classification algorithms capable of exploiting both labeled and unlabeled data and making the prediction for the given set of unlabeled data only. Several transductive learning methods have ...
Learning Discriminative Sequence Models from Partially Labelled Data for Activity Recognition
PRICAI '08: Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence

Recognising daily activity patterns of people from low-level sensory data is an important problem. Traditional approaches typically rely on generative models such as the hidden Markov models and training on fully labelled data. While activity data can ...
Discriminative Random Fields

In this research we address the problem of classification and labeling of regions given a single static natural image. Natural images exhibit strong spatial dependencies, and modeling these dependencies in a principled manner is crucial to achieve good ...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

UAI'02: Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence

August 2002

584 pages

ISBN:1558608974

Editors:
Adnan Darwiche
Univeristy of California, Los Angeles
,
Nir Friedman
Hebrew University, Jerusalem

Sponsors

Information Extraction and Transportation
AAAI: American Association for Artificial Intelligence
Fair, Isaac and Company, Inc.: Fair, Isaac and Company, Inc.
Boeing Research: Boeing Research
University of Alberta: University of Alberta

Publisher

Morgan Kaufmann Publishers Inc.

San Francisco, CA, United States

Publication History

Published: 01 August 2002

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

173
Total Citations
View Citations
667
Total Downloads

Downloads (Last 12 months)85
Downloads (Last 6 weeks)9

Reflects downloads up to 12 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Lubarsky YTönshof JGrohe MKimelfeld BFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Selecting Walk Schemes for Database EmbeddingProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615052(1677-1686)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3615052
Qu MTang JWallach HLarochelle HBeygelzimer Ad'Alché-Buc FFox E(2019)Probabilistic logic neural networks for reasoningProceedings of the 33rd International Conference on Neural Information Processing Systems10.5555/3454287.3454980(7712-7722)Online publication date: 8-Dec-2019
https://dl.acm.org/doi/10.5555/3454287.3454980
Dallachiesa MAggarwal CPalpanas T(2019)Improving Classification Quality in Uncertain GraphsJournal of Data and Information Quality10.1145/324209511:1(1-20)Online publication date: 4-Jan-2019
https://dl.acm.org/doi/10.1145/3242095
Motamedi RYeganeh BChandrasekaran BRejaie RMaggs BWillinger W(2019)On Mapping the Interconnections in Today’s InternetIEEE/ACM Transactions on Networking10.1109/TNET.2019.294036927:5(2056-2070)Online publication date: 1-Oct-2019
https://dl.acm.org/doi/10.1109/TNET.2019.2940369
Pech RHao DCheng HZhou T(2019)Enhancing subspace clustering based on dynamic predictionFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-018-7128-713:4(802-812)Online publication date: 1-Aug-2019
https://dl.acm.org/doi/10.1007/s11704-018-7128-7
Fan SHuang B(2019)Recurrent collective classificationKnowledge and Information Systems10.1007/s10115-018-1260-460:2(741-755)Online publication date: 1-Aug-2019
https://dl.acm.org/doi/10.1007/s10115-018-1260-4
Chakrabarti DFuniak SChang JMacskassy SChampin PGandon FMédini LLalmas MIpeirotis P(2018)Joint Label Inference in NetworksCompanion Proceedings of the The Web Conference 201810.1145/3184558.3186238(483-487)Online publication date: 23-Apr-2018
https://dl.acm.org/doi/10.1145/3184558.3186238
Gardner JBrooks CPardo ABartimote-Aufflick KLynch GShum SFerguson RMerceron AOchoa X(2018)Coenrollment networks and their relationship to grades in undergraduate educationProceedings of the 8th International Conference on Learning Analytics and Knowledge10.1145/3170358.3170373(295-304)Online publication date: 7-Mar-2018
https://dl.acm.org/doi/10.1145/3170358.3170373
Chakrabarti DFuniak SChang JMacskassy S(2017)Joint label inference in networksThe Journal of Machine Learning Research10.5555/3122009.315301518:1(1941-1979)Online publication date: 1-Jan-2017
https://dl.acm.org/doi/10.5555/3122009.3153015
Fan YZhang YYe Yli XZheng WLim EWinslett MSanderson MFu ASun JCulpepper SLo EHo JDonato DAgrawal RZheng YCastillo CSun ATseng VLi C(2017)Social Media for Opioid Addiction EpidemiologyProceedings of the 2017 ACM on Conference on Information and Knowledge Management10.1145/3132847.3132857(1259-1267)Online publication date: 6-Nov-2017
https://dl.acm.org/doi/10.1145/3132847.3132857
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents