Detecting Complex Dependencies in Categorical Data

Oates, Tim; Schmill, Matthew D.; Gregory, Dawn E.; Cohen, Paul R.

doi:10.1007/978-1-4612-2404-4_18

Tim Oates³,
Matthew D. Schmill³,
Dawn E. Gregory³ &
…
Paul R. Cohen³

Part of the book series: Lecture Notes in Statistics ((LNS,volume 112))

903 Accesses

Abstract

Locating and evaluating relationships among values in multiple streams of data is a difficult and important task. Consider the data flowing from monitors in an intensive care unit. Readings from various subsets of the monitors are indicative and predictive of certain aspects of the patient’s state. We present an algorithm that facilitates discovery and assessment of the strength of such predictive relationships called Multi-stream Dependency Detection (MSDD). We use heuristic search to guide our exploration of the space of potentially interesting dependencies to uncover those that are significant. We begin by reviewing the dependency detection technique described in [3], and extend it to the multiple stream case, describing in detail our heuristic search over the space of possible dependencies. Quantitative evidence for the utility of our approach is provided through a series of experiments with artificially-generated data. In addition, we present results from the application of our algorithm to two real problem domains: feature-based classification and prediction of pathologies in a simulated shipping network.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A framework for dependency estimation in heterogeneous data streams

Article Open access 06 June 2020

Improving heuristics miners for healthcare applications by discovering optimal dependency graphs

Article 27 June 2022

A 3-Window Framework for the Discovery and Interpretation of Predictive Temporal Functional Dependencies

References

Bennett, K. P. and Mangasarian, O. L. Robust linear programming discrimination of two linearly inseparable sets. In Optimization Methods and Software 1, 1992, 23–34 (Gordon and Breach Science Publishers).
Article Google Scholar
Holte, Robert C. Very simple classification rules perform well on most commonly used datasets. In Machine Learning, (11), pp. 63–91, 1993.
Article MATH Google Scholar
Howe, Adele E. and Cohen, Paul R. Understanding Planner Behavior. To appear in AI Journal, Winter 1995.
Google Scholar
Murphy, P. M., and Aha, D. W. UCI Repository of machine learning databases [Machine-readable data repository]. Irvine, CA: University of California, Department of Information and Computer Science, 1994.
Google Scholar
Oates, Tim. MSDD as a Tool for Classification. Memo 94–29, Experimental Knowledge Systems Laboratory, Department of Computer Science, University of Massachusetts, Amherst, 1994.
Google Scholar
Oates, Tim and Cohen, Paul R. Toward a plan steering agent: Experiments with schedule maintenance. In Proceedings of the Second International Conference on Artificial Intelligence Planning Systems, pp. 134–139, 1994.
Google Scholar
Thrun, S.B. The MONK’s problems: A performance comparison of different learning algorithms. Carnegie Mellon University, CMU-CS-91–197, 1991.
Google Scholar
Wirth, J. and Catlett, J. Experiments on the costs and benefits of windowing in ID3. In Proceedings of the Fifth International Conference on Machine Learning, pp. 87–99, 1988.
Google Scholar
Zheng, Zijian. A benchmark for classifier learning. Basser Department of Computer Science, University of Sydney, NSW.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, LGRC, University of Massachusetts, Box 34610, Amherst, MA, 01003-4610, USA
Tim Oates, Matthew D. Schmill, Dawn E. Gregory & Paul R. Cohen

Authors

Tim Oates
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. Schmill
View author publications
You can also search for this author in PubMed Google Scholar
Dawn E. Gregory
View author publications
You can also search for this author in PubMed Google Scholar
Paul R. Cohen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Vanderbilt University, Box 1679, Station B, Nashville, Tennessee, 37235, USA
Doug Fisher
Department of Economics Institute of Statistics and Econometrics, Free University of Berlin, 14185, Berlin, Garystre 21, Germany
Hans-J. Lenz

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Oates, T., Schmill, M.D., Gregory, D.E., Cohen, P.R. (1996). Detecting Complex Dependencies in Categorical Data. In: Fisher, D., Lenz, HJ. (eds) Learning from Data. Lecture Notes in Statistics, vol 112. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2404-4_18

Download citation

DOI: https://doi.org/10.1007/978-1-4612-2404-4_18
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94736-5
Online ISBN: 978-1-4612-2404-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Detecting Complex Dependencies in Categorical Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A framework for dependency estimation in heterogeneous data streams

Improving heuristics miners for healthcare applications by discovering optimal dependency graphs

A 3-Window Framework for the Discovery and Interpretation of Predictive Temporal Functional Dependencies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Detecting Complex Dependencies in Categorical Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A framework for dependency estimation in heterogeneous data streams

Improving heuristics miners for healthcare applications by discovering optimal dependency graphs

A 3-Window Framework for the Discovery and Interpretation of Predictive Temporal Functional Dependencies

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation