Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1247069.1247125acmconferencesArticle/Chapter ViewAbstractPublication PagessocgConference Proceedingsconference-collections
Article

A geometric framework for solving subsequence problems in computational biology efficiently

Published: 06 June 2007 Publication History

Abstract

In this paper, we introduce the notion of a constrained Minkowski sumwhich for two (finite) point-sets P,Q⊆ R2 and a set of k inequalities Ax≥ b is defined as the point-set (P ⊕ Q)Ax≥ b= x = p+q | ∈ P, q ∈ Q, Ax ≥ b. We show that typical subsequenceproblems from computational biology can be solved by computing a setcontaining the vertices of the convex hull of an appropriatelyconstrained Minkowski sum. We provide an algorithm for computing such a setwith running time O(N log N), where N=|P|+|Q| if k is fixed. For the special case (P⊕ Q)x1≥ β, where P and Q consistof points with integer x1-coordinates whose absolute values arebounded by O(N), we even achieve a linear running time O(N). Wethereby obtain a linear running time for many subsequence problemsfrom the literature and improve upon the best known running times forsome of them.The main advantage of the presented approach is that it provides a generalframework within which a broad variety of subsequence problems canbe modeled and solved.This includes objective functions and constraintswhich are even more complexthan the ones considered before.

References

[1]
P.K. Agarwal, N. Amenta, and M. Sharir. Largest placement of one convex polygon inside another. Discrete & Computational Geometry, 19(1):95--104, 1998.
[2]
L. Allison. Longest biased interval and longest non-negative sum interval. Bioinformatics Application Note, 19(10):1294--1295, 2003.
[3]
M. Ben-Or. Lower bounds for algebraic computation trees. In Proc. of the 15th Annual ACM Symposium on Theory of Computing (STOC '83), pages 80--86, 1983.
[4]
T. Bernholt and T. Hofmeister. An algorithm for a generalized maximum subsequence problem. In Proc. of the 7th Latin American Symposium on Theoretical Informatics(LATIN 2006), pages 178--189, 2006.
[5]
L.P. Chew and K. Kedem. A convex polygon among polygonal obstacles: placement and high--clearance motion. Computational Geometry, 3(2):59--89, 1993.
[6]
T.H. Cormen, C.E. Leiserson, R.L. Rivest, and C. Stein. Introduction to Algorithms (Second Edition). MIT Press & McGraw--Hill, 2001.
[7]
P.L. Davies and A. Kovac. Local extremes, runs, strings and multiresolution (with discussion). Annals of Statistics, 29(1):1--65, 2001.
[8]
M. de Berg, M. van Kreveld, M. Overmars, and O. Schwarzkopf. Computational Geometry Algorithms and Applications. Springer, 1997.
[9]
T.-H. Fan, S. Lee, H.-I. Lu, T.-S. Tsou, T.C. Wang, and A. Yao. An optimal algorithm for maximum-sum segment and its application in bioinformatics. In Proc. of the 8th International Conference on Implementation and Application of Automata (CIAA 2003), pages 251--257, 2003.
[10]
M.H. Goldwasser, M.-Y. Kao, and H.-I. Lu. Linear-time algorithms for computing maximum-density sequence segments with bioinformatics applications. Journal of Computer and System Sciences, 70(2):128--144, 2005.
[11]
L. Guibas, L. Ramshaw, and J. Stolfi. A kinetic framework for computational geometry. In Proc. of the 24th IEEE Symposium on the Foundations of Computer Science (FOCS '83), pages 100--111, 1983.
[12]
L.J. Guibas, M. Sharir, and S. Sifrony. On the general motion--planning problem with two degrees of freedom. Discrete & Computational Geometry, 4(5):491--521, 1989.
[13]
J.C. Latombe. Robot Motion Planning. Kluwer, 1991.
[14]
Y.-L. Lin, T. Jiang, and K.-M. Chao. Efficient algorithms for locating the length-constrained heaviest segments with applications to biomolecular sequence analysis. Journal of Computer and System Sciences, 65(3):570--586, 2002.
[15]
D. Lipson, Y. Aumann, A. Ben-Dor, N. Linial, and Z. Yakhini. Efficient calculation of interval scores for DNA copy numbers. In Proc. of the 9th International Conference on Research in Computational Molecular Biology (RECOMB 2005), pages 83--100, 2005.
[16]
T. Lozano-Perez and M.A. Wesley. An algorithm for planning collision-free paths among polyhedral obstacles. Communications of the ACM, 22:560--570, 1979.
[17]
J. Pach and P.K. Agarwal. Combinatorial geometry. Wiley--Interscience Publication, 1995.
[18]
E. Ramos, 2006. Personal communication.
[19]
G.T. Toussaint. Solving geometric problems with the rotating calipers. In Proc. of the 2nd IEEE Mediterranean Electrotechnical Conference (MELECON '83), pages A10.02/1--4, 1983.

Cited By

View all
  • (2018)Finding long and similar parts of trajectoriesComputational Geometry: Theory and Applications10.1016/j.comgeo.2011.05.00444:9(465-476)Online publication date: 29-Dec-2018
  • (2018)On Locating Disjoint Segments with Maximum Sum of DensitiesAlgorithmica10.1007/s00453-007-9122-654:1(107-117)Online publication date: 31-Dec-2018
  • (2009)Finding long and similar parts of trajectoriesProceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/1653771.1653813(296-305)Online publication date: 4-Nov-2009
  • Show More Cited By

Index Terms

  1. A geometric framework for solving subsequence problems in computational biology efficiently

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SCG '07: Proceedings of the twenty-third annual symposium on Computational geometry
      June 2007
      404 pages
      ISBN:9781595937056
      DOI:10.1145/1247069
      • Program Chair:
      • Jeff Erickson
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 06 June 2007

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Minkowski sums
      2. computational biology
      3. quasiconvex functions
      4. subsequence problems

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 625 of 1,685 submissions, 37%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 22 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Finding long and similar parts of trajectoriesComputational Geometry: Theory and Applications10.1016/j.comgeo.2011.05.00444:9(465-476)Online publication date: 29-Dec-2018
      • (2018)On Locating Disjoint Segments with Maximum Sum of DensitiesAlgorithmica10.1007/s00453-007-9122-654:1(107-117)Online publication date: 31-Dec-2018
      • (2009)Finding long and similar parts of trajectoriesProceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems10.1145/1653771.1653813(296-305)Online publication date: 4-Nov-2009
      • (2009)Optimal algorithms for the average-constrained maximum-sum segment problemInformation Processing Letters10.1016/j.ipl.2008.09.024109:3(171-174)Online publication date: 1-Jan-2009
      • (2008)CNVDetector: locating copy number variations using array CGH dataBioinformatics10.1093/bioinformatics/btn51724:23(2773-2775)Online publication date: 7-Nov-2008
      • (2008)A geometric interpretation of the multiresolution criterion in nonparametric regressionJournal of Nonparametric Statistics10.1080/1048525080236099420:7(599-609)Online publication date: Oct-2008
      • (2008)Minkowski Sum Selection and FindingProceedings of the 19th International Symposium on Algorithms and Computation10.1007/978-3-540-92182-0_42(460-471)Online publication date: 15-Dec-2008
      • (2007)Algorithms for computing the length-constrained max-score segments with applications to DNA copy number data analysisProceedings of the 18th international conference on Algorithms and computation10.5555/1781574.1781667(834-845)Online publication date: 17-Dec-2007
      • (2007)Algorithms for Computing the Length-Constrained Max-Score Segments with Applications to DNA Copy Number Data AnalysisAlgorithms and Computation10.1007/978-3-540-77120-3_72(834-845)Online publication date: 2007

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media