Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2382936.2382955acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Discovering geometric patterns in genomic data

Published: 07 October 2012 Publication History

Abstract

ChIP-chip and ChIP-seq are techniques for the isolation and identification of the binding sites of DNA-associated proteins along the genome. Both techniques produce genome-wide location data. The geometric arrangements of these binding sites can provide valuable information about biological function, such as the activation or repression of genes.
In this paper, we formalize this problem and propose a novel graph based algorithm called Patterns of Marks (PoM) to discover efficiently these types of geometric patterns in genomic data. We also describe how we validate the algorithm using experimental data.

References

[1]
http://genepool.bio.ed.ac.uk/illumina/index.html.
[2]
http://www.bionimbus.org:8080/bionimbus/.
[3]
http://www.bioviz.org/igb/.
[4]
http://www.flybase.org/.
[5]
R. Agrawal, T. Imieliński, and A. Swami. Mining association rules between sets of items in large databases. SIGMOD Rec., 22:207--216, June 1993.
[6]
R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. VLDB '94.
[7]
S. Bilodeau, M. H. Kagey, G. M. Frampton, P. B. Rahl, and R. A. Young. Setdb1 contributes to repression of genes encoding developmental regulators and maintenance of es cell state. Genes and Dev., 23(21):2484--2489, 2009.
[8]
D. Chen, A. S. Belmont, and S. Huang. Upstream binding factor association induces large-scale chromatin decondensation. Proc Natl Acad Sci USA, 101:15106--15111, 2004.
[9]
C. Cortes and V. Vapnik. Support-vector networks. Support-Vector Networks, 20, 1995.
[10]
E. B. et al. Regulation by transcription factors in bacteria: beyond description. FEMS Microbiology reviews, 33:133--151, 2009.
[11]
F. T. et al. Flynet: a genomic resource for drosophila melanogaster transcriptional regulatory networks. Bioinformatics, 25:3001--3004, 2009.
[12]
N. N. et al. A cis-regulatory map of the drosophila genome. Nature, 471:527--531, 2011.
[13]
J. Huan, W. Wang, and J. Prins. Efficient mining of frequent subgraphs in the presence of isomorphism. ICDM, 2003.
[14]
A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. ECML PKDD, 2000.
[15]
H. H. Kavi and J. A. Birchler. Drosophila kdm2 is a h3k4me3 demethylase regulating nucleolar organization. BMC Research Notes, 2009.
[16]
M. Kuramochi and G. Karypis. Frequent subgraph discovery. ICDM, 2001.
[17]
L. C. Lindeman, C. L. Winata, H. Aanes, S. Mathavan, P. Aleström, and P. Collas. Chromatin states of developmentally-regulated genes revealed by dna and histone methylation patterns in zebrafish embryos. Int. J. Dev. Biol., 54:803--813, 2010.
[18]
S. Nijssen and J. N. Kok. A quickstart in frequent structure mining can make a difference. KDD, 2004.
[19]
H. Willenbrock and D. W. Ussery. Chromatin architecture and gene expression in escherichia coli. Genome Biology, 5:252, 2004.
[20]
X. Yan. Mining, indexing and similarity search in large graph data sets. PhD thesis, Champaign, IL, USA, 2006. AAI3243031.
[21]
X. Yan and J. Han. Gspan: Graph-based substructure pattern mining. ICDM, 2002.

Cited By

View all
  • (2018)Searching for patterns of association between functional genomic elementsProceedings of the International Conference "Mathematical Biology and Bioinformatics"10.17537/icmbb18.82Online publication date: 7-Nov-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '12: Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine
October 2012
725 pages
ISBN:9781450316705
DOI:10.1145/2382936
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. DNA binding sites
  2. geometric pattern
  3. graph mining

Qualifiers

  • Research-article

Conference

BCB' 12
Sponsor:

Acceptance Rates

BCB '12 Paper Acceptance Rate 33 of 159 submissions, 21%;
Overall Acceptance Rate 254 of 885 submissions, 29%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Searching for patterns of association between functional genomic elementsProceedings of the International Conference "Mathematical Biology and Bioinformatics"10.17537/icmbb18.82Online publication date: 7-Nov-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media