Abstract
Recent large scale genome-wide association studies have been considered to hold promise for unraveling the genetic etiology of complex diseases. It becomes possible now to use these data to assess the influence of interactions from multiple SNPs on a disease. In this paper we formulate the multiple SNP selection problem for determining genetic risk profiles of certain diseases by formulating novel 0/1 IP formulations for this problem, and solving them using a new near-optimal and efficient discrete optimization technique called discretized network flow that has recently been developed by us. One of the highlights of our approach to solving the multiple SNP selection problem is recognizing that there could be different genetic profiles of a disease among the patient population, and it is thus desirable to classify/cluster patients with similar genetic profiles of the disease while simultaneously selecting the right genetic marker sets of the disease for each cluster. This approach coupled with the DNF technique has yielded results for several diseases with some of the highest sensitivities seen so far and specificities that are higher or comparable to state-of-the art techniques, at a fraction of the runtime of these techniques.
This work was supported in part by NSF grants CCR-0204097 and CCF-0811855.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ahuja, R.K.K., Magnanti, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Pearson Education, London (1993)
Atamurk, A., Savelsbergh, M.: Integer Programming Software Systems. Annals of Operations Research 140(1), 67–124 (2005)
Brinza, D., Zelikovsky, A.: Combinatorial Analysis of Disease Association and Susceptibility for Rheumatoid Arthritis SNP Data. In: Proc. of 15th Genetic Analysis Workshop (GAW15), pp. 6–11 (2006)
Brinza, D., Zelikovsky, A.: Combinatorial methods for disease association search and susceptibility prediction. In: Bücher, P., Moret, B.M.E. (eds.) WABI 2006. LNCS (LNBI), vol. 4175, pp. 286–297. Springer, Heidelberg (2006)
Brinza, D., Zelikovsky, A.: Design and Validation of Methods Searching for Risk Factors in Genotype Case-Control Studies. Journal of Computational Biology 15, 81–90 (2008)
Daly, M.J., Rioux, J.D., Schaffner, S.F., Hudson, T.J., Lander, E.S.: High-resolution haplotype structure in the human genome. Nat. Genet. 29, 229–232 (2001)
Dutt, S., Ren, H.: Discretized Network Flow Techniques for Timing and Wire-Length Driven Incremental Placement with White-Space Satisfaction. IEEE Trans. of VLSI (accepted for publication, 2008)
Dutt, S., Ren, H., Suthar, V.: A Network-Flow Approach to Timing-Driven Incremental Placement for ASICs. In: Proc. IEEE Int’l Conf. CAD (ICCAD), pp. 375–382 (2006)
Hirschhorn, J.N., Daly, M.J.: Genome-wide association studies for common diseases and complex traits. Nature Rev. Genet. 6, 95 (2005)
Nahapetyan, A., Pardalos, P.: Adaptive Dynamic Cost Updating Procedure for Solving Fixed Charge Network Flow Problems. Computational Optimization and Appl. Jour. 39(1), 37–50 (2008)
Li, J.: A novel strategy for detecting multiple loci in Genome-Wide Association Studies of complex diseases. International Journal of Bioinformatics Research and Applications 4, 150–163 (2008)
Li, J.: Prioritize and Select SNPs for Association Studies with Multi-Stage Designs. Journal of Computational Biology 15, 241–257 (2008)
McCarthy, M.I., et al.: Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat. Rev. Genet. 9, 356–369 (2008)
Musani, S.K., et al.: Detection of Gene Gene Interactions in Genome-Wide Association Studies of Human Population Data. Hum Hered 63, 67–84 (2007)
Nelson, M., Kardia, S., Ferrell, R.: A combinatorial partitioning method to identify multi-locus genotypic partitions that predict the quantitative trait variation. Genome Res. 11, 2115 (2001)
Ren, H., Dutt, S.: Algorithms for Simultaneous Consideration of Multiple Physical Synthesis Transforms for Timing Closure. In: Proc. IEEE Int’l Conf. CAD (ICCAD) (accepted for publication) (2008)
Ritchie, M.D., Hahn, L.W., Moore, J.H.: Software for MDR and MDR Permutation Testing module Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity. Genet Epidemiol. 24, 150–157 (2003)
Spinola, M., et al.: Association of the PDCD5 Locus With Lung Cancer Risk and Prognosis in Smokers. J. Clin. Oncol. 24, 1672–1678 (2006)
Ueda, H., et al.: Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease. Nature 423, 506–511 (2003)
Zhang, Y., Liu, J.S.: Bayesian inference of epistatic interactions in case-control studies. Nat. Genet. 39, 1167 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dutt, S., Dai, Y., Ren, H., Fontanarosa, J. (2009). Selection of Multiple SNPs in Case-Control Association Study Using a Discretized Network Flow Approach . In: Rajasekaran, S. (eds) Bioinformatics and Computational Biology. BICoB 2009. Lecture Notes in Computer Science(), vol 5462. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00727-9_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-00727-9_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00726-2
Online ISBN: 978-3-642-00727-9
eBook Packages: Computer ScienceComputer Science (R0)