Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3341105.3374134acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

Characterization of long-lived and non-long lived profiles through biclustering

Published: 30 March 2020 Publication History

Abstract

The understanding of the variables that influence the human ageing process can help in the development of public policies that improve the quality of life of the elderly population. In previous experiments on a human ageing dataset, we applied biclustering techniques on the binary data in order to find long-lived and non-long lived profiles, but we only found long-lived profiles. Then in this work, we propose to use Factor Analysis to represent this data in reduced dimensionality, generating three datasets where the variables with high correlation with each other belong to the same factor. We observed that some variables have a high correlation with each other in the three datasets, allowing them to be grouped into blocks of correlated variables. Posteriorly we applied biclustering on these datasets and validate the results using p-Value measure, Jaccard similarity, and the priori knowledge of the classes. In this way, we found factors belonging only to non-long lived biclusters and biclusters representatives of both profiles.

References

[1]
Sven Bergmann, Jan Ihmels, and Naama Barkai. 2003. Iterative signature algorithm for the analysis of large-scale gene expression data. Physical review E 67, 3 (2003), 031902.
[2]
Raymond B. Cattell. 1966. The Scree Test For The Number Of Factors. Multivariate Behavioral Research 1, 2 (1966), 245--276.
[3]
Yizong Cheng and George M. Church. 2000. Biclustering of Expression Data. In Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology. AAAI Press, 93--103.
[4]
Gabor Csardi, Zoltan Kutalik, and Sven Bergmann. 2010. Modular analysis of gene expression data with R. Bioinformatics 26 (2010), 1376--7.
[5]
Charles D Dziuban and Edwin C Shirkey. 1974. When is a correlation matrix appropriate for factor analysis? Some decision rules. Psychological bulletin 81, 6 (1974), 358.
[6]
Kemal Eren, Mehmet Deveci, Onur Küçüktunç, and Ümit V Çatalyürek. 2012. A comparative analysis of biclustering algorithms for gene expression data. Briefings in bioinformatics 14, 3 (2012), 279--292.
[7]
J.I. Galbraith, I. Moustaki, D.J. Bartholomew, and F. Steele. 2002. The Analysis and Interpretation of Multivariate Data for Social Scientists. Taylor & Francis.
[8]
Alexander Hinneburg, Charu C. Aggarwal, and Daniel A. Keim. 2000. What Is the Nearest Neighbor in High Dimensional Spaces?. In Proceedings of the 26th International Conference on Very Large Data Bases (VLDB '00). Morgan Kaufmann Publishers Inc., 506--515.
[9]
John L. Horn. 1965. A rationale and test for the number of factors in factor analysis. Psychometrika 30, 2 (01 Jun 1965), 179--185.
[10]
Sebastian Kaiser, Rodrigo Santamaria, Tatsiana Khamiakova, Martin Sill, Roberto Theron, Luis Quintales, Friedrich Leisch, and Ewoud De Troyer. 2018. biclust: BiCluster Algorithms. https://CRAN.R-project.org/package=biclust R package version 2.0.1.
[11]
Sara C. Madeira and Arlindo L. Oliveira. 2004. Biclustering Algorithms for Biological Data Analysis: A Survey. IEEE/ACM Trans. Comput. Biol. Bioinformatics 1, 1 (Jan. 2004), 24--45.
[12]
Sueli Aparecida Mingoti. 2005. Análise de dados através de métodos de estatística multivariada uma abordagem aplicada. UFMG, Belo Horizonte.
[13]
Amela Prelić, Stefan Bleuler, Philip Zimmermann, Anja Wille, Peter Bühlmann, Wilhelm Gruissem, Lars Hennig, Lothar Thiele, and Eckart Zitzler. 2006. A systematic comparison and evaluation of biclustering methods for gene expression data. Bioinformatics 22, 9 (2006), 1122.
[14]
Gilles Raiche. 2010. an R package for parallel analysis and non graphical solutions to the Cattell scree test. http://CRAN.R-project.org/package=nFactors R package version 2.3.3.
[15]
William Revelle. 2018. psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern University, Evanston, Illinois. https://CRAN.R-project.org/package=psych R package version 1.8.12.
[16]
Caio Eduardo Ribeiro, Luis Henrique S. Brito, Cristiane Neri Nobre, Alex A. Freitas, and Luis Enrique Zárate. 2017. A revision and analysis of the comprehensiveness of the main longitudinal studies of human aging for data mining research. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7, 3 (2017), e1202.
[17]
Caio Eduardo Ribeiro and Luis Zárate. 2018. Classifying longevity profiles through longitudinal data mining. Expert Systems with Applications 117 (09 2018).
[18]
Haifa Ben Saber and Mourad Elloumi. 2014. A comparative study of clustering and biclustering of microarray data. International Journal of Computer Science & Information Technology 6, 6 (2014), 93.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '20: Proceedings of the 35th Annual ACM Symposium on Applied Computing
March 2020
2348 pages
ISBN:9781450368667
DOI:10.1145/3341105
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 March 2020

Check for updates

Author Tags

  1. biclustering
  2. factor analysis
  3. features reduction
  4. human ageing
  5. longevity

Qualifiers

  • Poster

Funding Sources

  • Conselho Nacional de Desenvolvimento Científico e Tecnológico
  • Fundação de Amparo à Pesquisa do Estado de Minas Gerais
  • Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

Conference

SAC '20
Sponsor:
SAC '20: The 35th ACM/SIGAPP Symposium on Applied Computing
March 30 - April 3, 2020
Brno, Czech Republic

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 88
    Total Downloads
  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media