Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Data Integration in Multi-dimensional Data Sets: Informational Asymmetry in the Valid Correlation of Subdivided Samples

  • Conference paper
Biological and Medical Data Analysis (ISBMDA 2006)

Abstract

Background: Flow cytometry is the only currently available high throughput technology that can measure multiple physical and molecular characteristics of individual cells. It is common in flow cytometry to measure a relatively large number of characteristics or features by performing separate experiments on subdivided samples. Correlating data from multiple experiments using certain shared features (e.g. cell size) could provide useful information on the combination pattern of the not shared features. Such correlation, however, are not always reliable. Methods: We developed a method to assess the correlation reliability by estimating the percentage of cells that can be unambiguously correlated between two samples. This method was evaluated using 81 pairs of subdivided samples of microspheres (artificial cells) with known molecular characteristics. Results: Strong correlation (R=0.85) was found between the estimated and actual percentage of unambiguous correlation. Conclusion: The correlation reliability we developed can be used to support data integration of experiments on subdivided samples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Shapiro, H.M.: Practical Flow Cytometry, 4th edn. Wiley-Liss Inc., New York (2002)

    Google Scholar 

  2. Saeed, M., Mark, R.G.: Efficient hemodynamic event detection utilizing relational databases and wavelet analysis. Comput. Cardiol 28, 153–156 (2001)

    Google Scholar 

  3. Chu, S.C., Thom, J.B.: Database issues in object-oriented clinical information systems design. Stud. Health Technol. Inform. 46, 376–382 (1997)

    Google Scholar 

  4. Montgomery Jr., E.B., Huang, H., Assadi, A.: Unsupervised clustering algorithm for N-dimensional data. J. Neurosci. Methods 144(1), 19–24 (2005)

    Article  Google Scholar 

  5. Ben-Hur, A., Elisseeff, A., Guyon, I.: A stability based method for discovering structure in clustered data. In: Pac. Symp. Biocomput., pp. 6–17 (2002)

    Google Scholar 

  6. Dudoit, S., Fridlyand, J.: A prediction-based resampling method for estimating the number of clusters in a dataset. Genome. Biol. 3(7) (2002); RESEARCH0036

    Google Scholar 

  7. Lange, T., Roth, V., Braun, M.L., Buhmann, J.: Stability-based validation of clustering solutions. Neural Comput. 16(6), 1299–1323 (2004)

    Article  MATH  Google Scholar 

  8. Lee, S., Crawford, M.M.: Unsupervised multistage image classification using hierarchical clustering with a Bayesian similarity measure. IEEE Trans Image Process 14(3), 312–320 (2005)

    Article  MathSciNet  Google Scholar 

  9. Chaudhuri, P., Marron, J.S.: SiZer for exploration of structures in curves. Journal of the American Statistical Association 94, 807–823 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  10. Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. John Wiley & Sons, Inc, Chichester (2001)

    MATH  Google Scholar 

  11. Brecher, G., Ms, M., Williams, G.Z.: Evaluation of electronic red blood cell counter. Am.J.Clin.Pathol. 26, 1439–1449 (1956)

    Google Scholar 

  12. Young, I.T.: Proof without prejudice: use of the Kolmogorov-Smirnov test for the analysis of histograms from flow systems and other sources. J. Histochem. Cytochem. 25(7), 935–941 (1977)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zeng, Q.T. et al. (2006). Data Integration in Multi-dimensional Data Sets: Informational Asymmetry in the Valid Correlation of Subdivided Samples. In: Maglaveras, N., Chouvarda, I., Koutkias, V., Brause, R. (eds) Biological and Medical Data Analysis. ISBMDA 2006. Lecture Notes in Computer Science(), vol 4345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11946465_38

Download citation

  • DOI: https://doi.org/10.1007/11946465_38

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68063-5

  • Online ISBN: 978-3-540-68065-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics