Abstract
In pattern classification, when the feature space is of high dimensionality or patterns are “similar” on a subset of features only, the traditional clustering methods do not show good performance. Biclustering is a class of methods that simultaneously carry out grouping on two dimensions and has many applications to different fields, especially gene expression data analysis. Because of simultaneous classification on both rows and columns of a data matrix, the biclustering problem is inherently intractable and computationally complex. One of the most complex models in biclustering problem is linear coherent model. Several biclustering algorithms based on this model have been proposed in recent years. However, none of them is able to perfectly recognize all linear patterns in a bicluster. In this work, we propose a novel algorithm based on Hough transform that can find all linear coherent patterns. In the sequel we apply it to gene expression data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Madeira, S.C., Oliveira, A.L.: Biclustering algorithms for biological data analysis: a survey. IEEE Transactions on Computational Biology and Bioinformatics 1, 24–45 (2004)
Zhao, H., Liew, A.W.C., Wang, D.Z., Yan, H.: Biclustering Analysis for Pattern Discovery: Current Techniques, Comparative Studies and Applications. Current Bioinformatics 7(1), 43–55 (2012)
Li, G., Ma, Q., Tang, H., Paterson, A.H., Xu, Y.: QUBIC: a qualitative algorithm for analyses of gene expression data. Nucleic Acids Research 37, e101 (2009)
Serin, A., Vingron, M.: DeBi: discovering differentially expressed biclusters using a frequent itemset approach. Algorithms for Molecular Biology 6, 18 (2011)
Huang, Q., Tao, D., Li, X., Liew, A.W.C.: Parallelized evolutionary learning for detection of biclusters in gene expression data. IEEE/ACM Transactions on Computational Biology and Bioinformatics 9, 560–570 (2012)
Huang, Q., Lu, M., Yan, H.: An evolutionary algorithm for discovering biclusters in gene expression data of breast cancer. In: IEEE Congress on Evolutionary Computation, pp. 829–834 (2008)
Divina, F., Aguilar-Ruiz, J.S.: Biclustering of expression data with evolutionary computation. IEEE Transactions on Knowledge and Data Engineering 18, 590–602 (2006)
Chakraborty, A., Maka, H.: Biclustering of gene expression data using genetic algorithm. In: Proceeding of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, pp. 1–8 (2005)
Hochreiter, S., Bodenhofer, U., Heusel, M., Mayr, A., Mitterecker, A., Kasim, A., Khamiakova, T., Van Sanden, S., Lin, D., Talloen, W., Bijnens, L., Göhlmann, H.W., Shkedy, Z., Clevert, D.A.: FABIA: factor analysis for bicluster acquisition. Bioinformatics 26, 1520–1527 (2010)
Zhao, H., Liew, A.W.C., Xie, X., Yan, H.: A new geometric biclustering algorithm based on the Hough transform for analysis of large-scale microarray data. Journal of Theoretical Biology 251, 264–274 (2007)
Gan, X., Liew, A.W.C., Yan, H.: Discovering biclusters in gene expression data based on high-dimensional linear geometries. BMC Bioinformatics 9, 209 (2008)
Wang, D.Z., Yan, H.: A graph spectrum based geometric biclustering algorithm. Journal of Theoretical Biology 317, 200–211 (2013)
Wang, Z., Yu, C.W., Cheung, R.C.C., Yan, H.: Hypergraph based geometric biclustering algorithm. Pattern Recognition Letters 33, 1656–1665 (2012)
Goldenshluger, A., Zeevi, A.: The Hough transform estimator. Ann. Stat. 32, 1908–1932 (2004)
Illingworth, J., Kittler, J.: A survey of the Hough transform. Comput. Vision Graphics Image Process 44, 87–116 (1988)
Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J., Church, G.M.: Systematic determination of genetic network architecture. Nat. Genet. 22, 281–285 (1999)
Rosenwald, A., et al.: The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. New Engl. J. Med. 346, 1937–1947 (2002)
Ihmels, J., Bergmann, S., Barkai, N.: Defining transcription modules using large-scale gene expression data. Bioinformatics 20, 1993–2003 (2004)
Murali, T.M., Kasif, S.: Extracting conserved gene expression motifs from gene expression data. Pacific Symposium on Biocomputing 8, 77–88 (2003)
Cheng, Y., Church, G.M.: Biclustering of expression data. In: Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, pp. 93–103 (2000)
Kluger, Y., Basri, R., Chang, J.T., Gerstein, M.: Spectral biclustering of microarray data: coclustering genes and conditions. Genome Research 13, 703–716 (2003)
Turner, H., Bailey, T., Krzanowski, W.: Improved biclustering of microarray data demonstrated through systematic performance tests. Computational Statistics and Data Analysis 48, 235–254 (2005)
Ashburner, M., Ball, C.A., et al.: Gene Ontology: tool for the unification of biology. Nature Genetics 25, 25–29 (2000)
Kanehisa, M., Goto, S.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000)
Boyle, E.I., Weng, S., Gollub, J., Jin, H., Botstein, D., Cherry, J.M., Sherlock, G.: GO:TermFider–open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics 20, 3710–3715 (2004)
Bindea, G., Mlecnik, B., et al.: ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25, 1091–1093 (2009)
Gan, X., Liew, A.W.C., Yan, H.: Biclustering gene expression data based on a high dimensional geometric method. In: Proceedings of the Fourth International Conference on Machine Learning and Cybernetics (ICMLC 2005), vol. 6, pp. 3388–3393 (August 18 – 21, 2005)
Duda, R.O., Hart, P.E.: Use of the Hough transformation to detect lines and curves in pictures. Comm. ACM 15, 11–15 (1972)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
To, C., Nguyen, T.T., Liew, A.WC. (2014). A Hough Transform-Based Biclustering Algorithm for Gene Expression Data. In: Wang, X., Pedrycz, W., Chan, P., He, Q. (eds) Machine Learning and Cybernetics. ICMLC 2014. Communications in Computer and Information Science, vol 481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45652-1_11
Download citation
DOI: https://doi.org/10.1007/978-3-662-45652-1_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45651-4
Online ISBN: 978-3-662-45652-1
eBook Packages: Computer ScienceComputer Science (R0)