International audienceFinding a sequence of transpositions that transforms a given permutation into the identity permutation and is of the shortest possible length is an important problem in bioinformatics. Here, a transposition consists in exchanging two contiguous intervals of the permutation. Bafna and Pevzner introduced the cycle graph as a tool for working on this problem. In particular, they took advantage of the decomposition of the cycle graph into so-called alternating cycles. Later, Hultman raised the question of determining the number of permutations with a cycle graph containing a given quantity of alternating cycles. The resulting number is therefore similar to the Stirling number of the first kind. We provide an explicit formula for computing what we call the Hultman numbers, and give a few numerical values. We also derive formulae for related cases, as well as for a much more general problem. Finally, we indicate a counting result related to another operation on permu...

Publisher: 'University of Waterloo'

Publication Date: Jun 1, 2007

Research Interests:
Bioinformatics, Mathematics, Applied Mathematics, Combinatorics, Pure Mathematics, and 2 moreNumerical Analysis and Computational Mathematics and Integer sequences

Download (.pdf)

Publication Date: 2013

Download (.pdf)

The prefix exchange distance of a permutation is the length of its shortest factorisation into transpositions that all contain 1. Using a probabilistic approach, we obtain expressions for the mean and the variance, and prove the asymptotic normality of the distribution of this distance for a random permutation verifying the Ewens sampling formula. Analogous results in the uniform setting follow as simple corollaries.

Publisher: Discret. Math.

Publication Date: 2021

Publication Name: Discret. Math.

Research Interests:
Mathematics, Computer Science, Combinatorics, Discrete Mathematics, Random permutation generation, and 2 morePermutation and Central Limit Theorem

Download (.pdf)

Publisher: Springer Science and Business Media LLC

Publication Date: 2019

Publication Name: Theory of Computing Systems

Research Interests:
Applied Mathematics, Computer Science, Distributed Computing, Data Structures and Algorithms, Internet shopping, and 2 moreMathematical Proof and arXiv

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2017

Publication Name: Discrete Applied Mathematics

Research Interests:
Mathematics, Applied Mathematics, Computer Science, Phylogenetic Networks, Phylogenetic Trees, and 3 moreTime Complexity, Phylogenetic Tree, and Discrete Applied Mathematics

Download (.pdf)

Publisher: Springer International Publishing

Publication Date: 2016

Publication Name: Lecture Notes in Computer Science

Research Interests:
Mathematics, Computer Science, Computational Complexity, Combinatorics, NP complete problems, and 3 moreCombinatorial mathematics, Graph Decomposition, and Springer Ebooks

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2016

Publication Name: Advances in Applied Mathematics

Research Interests:
Mathematics, Applied Mathematics, Computer Science, Combinatorics, DISTRIBUTION, and 10 moreDistance, Prefix, asymptotic Analysis, Random permutation generation, Distribution, Permutation Edit Distance, Permutation, Mathematical Proof, Asymptotic distribution, and Asymptotic normality

Download (.pdf)

Publisher: Springer International Publishing

Publication Date: 2016

Publication Name: Lecture Notes in Computer Science

Research Interests:
Mathematics, Computer Science, Algorithms, and Phylogenetics

Download (.pdf)

Publication Date: 2015

Publication Name: Lecture Notes in Computer Science

Research Interests:
Mathematics, Computer Science, Biology, Phylogenetic Networks, Computational linguistic phylogenetics, and 3 morePhylogenetic Tree, Computational Molecular Biology, and arXiv

Download (.pdf)

Publisher: Society for Industrial & Applied Mathematics (SIAM)

Publication Date: 2013

Publication Name: SIAM Journal on Discrete Mathematics

Research Interests:
Mathematics, Computer Science, Combinatorics, Pure Mathematics, Discrete Mathematics, and 9 moreSorting Algorithms, Prefix, Transpositions, Interconnection Networks Design, Symmetric group, Permutations, Genomic Rearrangements, Lower bounds, and Edit Distance

Download (.pdf)

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Publication Date: 2014

Publication Name: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Research Interests:
Computer Science, Algorithms, SAT Solver Design, Phylogenetics, Computational Biology, and 8 moreMolecular Evolution, Medicine, Biological Sciences, Phylogeny, Mathematical Sciences, Heuristic, NP-Hardness, and Epigraph

Download (.pdf)

Publisher: Elsevier BV

Publication Date: 2013

Publication Name: Discrete Applied Mathematics

Research Interests:
Bioinformatics, Mathematics, Applied Mathematics, Computer Science, Statistics, and 10 moreCombinatorics, Discrete Mathematics, Permutations, Genome Rearrangement, Genome Rearrangements, Permutation, Discrete Applied Mathematics, Mathematical Proof, Expected Value, and Breakpoint

Download (.pdf)

Publication Date: 2016

Publication Name: Lecture Notes in Computer Science

Research Interests:
Bioinformatics, Mathematics, Applied Mathematics, Computer Science, Algorithms, and 9 moreCombinatorics, Transposition, Permutations, Sorting, Sorting Algorithm, Genome Rearrangement, Genome Rearrangements, SORT, and Permutation

Download (.pdf)

In this paper, we study the problem of sorting unichromosomal linear genomes by prefix double-cut-and-joins (or DCJs) in both the signed and the unsigned settings. Prefix DCJs cut the leftmost segment of a genome and any other segment, and recombine the severed endpoints in one of two possible ways: one of these options corresponds to a prefix reversal, which reverses the order of elements between the two cuts (as well as their signs in the signed case). Depending on whether we consider both options or reversals only, our main results are: (1) new structural lower bounds based on the breakpoint graph for sorting by unsigned prefix reversals, unsigned prefix DCJs, or signed prefix DCJs; (2) a polynomial-time algorithm for sorting by signed prefix DCJs, thus answering an open question in [8]; (3) a 3/2-approximation for sorting by unsigned prefix DCJs, which is, to the best of our knowledge, the first sorting by prefix rearrangements problem that admits an approximation ratio strictly smaller than 2 (with the obvious exception of the polynomial-time solvable problems); and finally, (4) an FPT algorithm for sorting by unsigned prefix DCJs parameterised by the number of breakpoints in the genome.

Publication Date: 2022

Publication Name: SPIRE

Research Interests:
Genome Rearrangements

Download (.pdf)

Publisher: Wiley

Publication Name: Journal of Graph Theory

Research Interests:
Graph Theory and Pure Mathematics

We study two problems motivated by computational biology: genome rearrangements, which under some assumptions can be recast as the problem of sorting a permutation (therefore viewed as a linear ordering) using as few allowed moves as possible, and the construction of haplotype networks, which generalise haplotype trees in that they allow multiple paths between species. Our main contributions are:• new upper bounds and formulae for computing the exact transposition distance of many permutations (a problem of unknown ...

Publisher: hal.archives-ouvertes.fr

Publication Date: Sep 12, 2008

Download (.pdf)

This paper reports on the use of the FO (·) language and the IDP framework for modeling and solving some machine learning and data mining tasks. The core component of a model in the IDP framework is an FO (·) theory consisting of formulas in first order logic and definitions; the latter are basically logic programs where clause bodies can have arbitrary first order formulas. Hence, it is a small step for a well-versed computer scientist to start modeling. We describe some models resulting from the collaboration between IDP ...

Publication Date: 2012

Research Interests:
Machine Learning

Download (.pdf)

We initiate the study of sorting permutations using prefix block-interchanges, which exchange any prefix of a permutation with another non-intersecting interval. The goal is to transform a given permutation into the identity permutation using as few such operations as possible. We give a 2-approximation algorithm for this problem, show how to obtain improved lower and upper bounds on the corresponding distance, and determine the largest possible value for that distance. 2012 ACM Subject Classification Theory of computation → Design and analysis of algorithms The problem of transforming two sequences into one another using a specified set of operations has received a lot of attention in the last decades, with applications in computational biology as (genome) rearrangement problems [13] as well as interconnection network design [21]. In the context of permutations, it can be equivalently formulated as follows: given a permutation π of [n] = {1, 2,. .. , n} and a generating set S (also consisting of permutations of [n]), find a minimum-length sequence of elements from S that sorts π. The problem is known to be NP-hard in general [15] and W[1]-hard when parameterised by the length of a solution [6], but some families of operations that are important in applications lead to problems that can be solved in polynomial time (e.g. exchanges [17], block-interchanges [10] and signed reversals [14]), while other families yield hard problems that admit good approximations (e.g. 11/8 for reversals [3] and for block-transpositions [12]). Several restrictions of these families have also been studied, one of which stands out in the field of interconnection network design: the so-called prefix constraint, which forces operations to act on a prefix of the permutation rather than on an arbitrary interval. Those restrictions were introduced as a way of reducing the size of the generated network while maintaining a low value for its diameter, thereby guaranteeing a low maximum communication delay [21]. The most famous example is perhaps the restriction of reversals (which reverse the order of elements along an interval) to prefix reversals, and the corresponding problem known as pancake flipping, introduced in [16] and whose complexity was only settled thirty years later [5]. As Table 1 shows (see [13] for undefined terms), although sorting problems using interval transformations are now fairly well understood, progress on the corresponding prefix sorting problems has been lacking, with only two families whose status has been settled and no approximation ratio smaller than 2 for those problems not known to be in P. As a result, while the topology of the Cayley graph generated by those operations might present attractive properties, efficient routing algorithms (which achieve exactly the same task as the sorting algorithms in genome rearrangements) are still needed for the network to be of practical interest.

Publication Date: 2020

Publication Name: ISAAC

Research Interests:
Genome Rearrangements

Download (.pdf)

And 25 more

Talks

Research Interests:
Sorting Algorithm, Genome Rearrangements, and Permutation

Download (.pdf)

We study the problem of computing the minimal number of adjacent, non-intersecting block interchanges required to transform a permutation into the identity permutation. In particular, we use the graph of a permutation to compute that number for a particular class of permutations in linear time and space, and derive a new tight upper bound on the so-called transposition distance.

Event Date: Oct 2005

Research Interests:
Algorithms, Combinatorics, and Genome Rearrangement

Download (.pdf)

Sorting by transpositions consists in finding a shortest sequence of interval displacements that sorts a given permutation; the length of such a sequence is referred to as the permutation's distance (to the identity permutation). The computational complexity of the sorting problem, as well as that of computing the distance, is unknown, as is the largest value that this distance can reach. We present a novel approach that allows us to give a simple characterisation of a class of permutations whose distance can be computed in linear time and space, and give a new general upper bound on the distance of any permutation. Modifying the structure of the aforementioned permutations allows us to derive other tractable classes, which we also describe.

More Info: Talk originally in French

Event Date: Jan 2006

Research Interests:
Algorithms, Combinatorics, and Genome Rearrangement

Download (.pdf)

Genome rearrangement is the field in bioinformatics that studies and models how a collection of genomes evolve and is generally expressed in the following way: given two (or more) genomes, find a shortest sequence of mutations that transform one into the other. Several models have been proposed, that differ either by the kind(s) of mutations taken into account or by the way genomes are represented, depending on the different biological assumptions that can be made. We review some of these models and known results, explain how computers have already helped in this area and suggest some further possible uses for them.

Event Date: Feb 2006

Research Interests:
Algorithms, Combinatorics, and Genome Rearrangement

Genome rearrangement problems are concerned, from a combinatorial point
of view, with sorting ordered sets of elements in as few moves as possible, using
a prescribed set of operations. The nature of those ordered sets may vary, but a large part of the literature is concerned with permutations or signed
permutations, possibly of multisets.
A traditional tool that has been the basis of most algorithmic and other theoretical
results in the field is known as the ``breakpoint graph'', a bicoloured
graph that models both our goal and our present situation and whose decomposition
into alternating cycles yields extremely good bounds for those kinds of
problems. In this talk, we present results that were obtained using tools that
are best known to mathematicians, namely the disjoint cycle decomposition
of permutations. More precisely, we will show that the decomposition can be
used both as a means to obtain bounds on one of the aforementioned sorting
problems, whose complexity is still unknown, and as a way to solve a counting
problem related to the structure of the breakpoint graph.

Event Date: Jun 2007

Research Interests:
Algorithms, Combinatorics, and Genome Rearrangement

In 2005 Cassens, Mardulyn and Milinkovitch proposed a new method for constructing phylogenetic networks in the context of intraspecific genealogies, also known as ``haplotype networks''. The proposed method, called ``Union of Maximum Parsimonious Trees'', is based on the global maximum parsimony approach, which aims at combining all most parsimonious trees into a single graph (in that context, trees are unrooted and undirected). However, their algorithm makes a number of arbitrary choices, produces solutions whose quality depends on the order in which the merging process is performed, and is a heuristic with an unclear objective function. We propose a combinatorial optimisation problem that can be used as a formal model for building such a graph, which consists in finding the minimum common supergraph of a given set of partially labelled trees. We propose a polynomial-time algorithm for solving the problem on two trees of a certain class, and a branch-and-bound algorithm in the case of two arbitrary trees. We will also discuss possible approaches when dealing with more than two trees.

Event Date: Dec 2007

Research Interests:
Algorithms, Phylogenetics, and Phylogenetic Networks

Computing distances between permutations
constitutes a topic with a number of applications, including interconnection
network design and the study of genome rearrangements. Those distances are
defined as the minimum number of moves needed to transform one permutation
into the other (allowed transformations being constrained beforehand). In
this talk, I will present a new formulation of a structure ubiquitous in the
study of genome rearrangements, known as the ``cycle graph'' or ``breakpoint
graph'' of a permutation, which recasts that structure as an even permutation.
This new point of view allows to restate every edit distance computation
problem, as long as it deals with permutations and ``revertible''
rearrangements, in terms of particular factorisations of an(other) even
permutation. I will show how this method allows, on the one hand, to recover
known results about genome rearrangements and to derive new ones, and on the
other hand, to solve counting problems related to the breakpoint graph.

More Info: Talk originally in French

Event Date: Mar 2008

Research Interests:
Algorithms, Combinatorics, and Genome Rearrangement

A number of fields, including genome rearrangements and interconnection network design, are concerned with sorting permutations in ``as few moves as possible'', using a given set of allowed operations. These often act on just one or two segments of the permutation, e.g. by reversing one segment or exchanging two segments. The \emph{cycle graph} of the permutation to sort is a fundamental tool in the theory of genome rearrangements. In this paper, we present an algebraic reinterpretation of the cycle graph as an even permutation, and show how to reformulate our sorting problems in terms of particular factorisations of the latter permutation. Using our framework, we recover known results in a simple and unified way, and obtain a new lower bound on the \emph{prefix transposition distance} (where a \emph{prefix transposition} displaces the initial segment of a permutation), which is shown to outperform previous results. Moreover, we use our approach to improve the best known lower bound on the \emph{prefix transposition diameter} from $2n/3$ to $\left\lfloor\frac{3n+1}{4}\right \rfloor$.

Event Date: Sep 2008

Research Interests:
Algorithms, Combinatorics, and Genome Rearrangement

Related Authors

Anthony Labarre

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Mathematics, Linear Algebra, Complementation, and Graph<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: 2018

Publication Name: Lecture Notes in Computer Science

Research Interests: Computer Science, Internet shopping, Mathematical Proof, and arXiv<div>()</div>

Publisher: University of Waterloo

Publication Date: Jun 1, 2007

Publication Name: Journal of Integer Sequences

Research Interests: Mathematics, Applied Mathematics, Combinatorics, Pure Mathematics, Numerical Analysis and Computational Mathematics, and Integer sequences<div>()</div>

Publisher: Elsevier BV

Publication Date: May 1, 2020

Publication Name: Discrete Applied Mathematics

Publisher: Elsevier BV

Publication Date: Feb 1, 2021

Publication Name: Discrete Mathematics

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Combinatorics, Biology, and String theory (Physics)<div>()</div>

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Mathematics<div>()</div>

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Medicine<div>()</div>

Publisher: Cornell University

Publication Date: Aug 30, 2022

Publication Name: arXiv (Cornell University)

Research Interests: Combinatorics, Prefix, Time Complexity, Sorting, and Genome Rearrangements<div>()</div>

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Publisher: Wiley

Publication Date: Jul 20, 2021

Publication Name: Journal of Graph Theory

Research Interests: Mathematics, Computer Science, Graph Theory, Combinatorics, and Pure Mathematics<div>()</div>

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Genome<div>()</div>

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Mathematics<div>()</div>

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Research Interests: Mathematics, Genome, and Synteny<div>()</div>

Publisher: Cornell University

Publication Date: Apr 16, 2016

Publication Name: arXiv (Cornell University)

Publisher: Springer Nature

Publication Date: 2016

Publication Name: Springer eBooks

Publisher: Elsevier BV

Publication Date: Jul 1, 2013

Publication Name: Discrete Applied Mathematics

Publisher: The MIT Press

Publication Date: Jun 5, 2009

Publication Name: The MIT Press eBooks

Publication Date: Sep 12, 2008

Publisher: Springer Nature

Publication Date: 2005

Publication Name: Springer eBooks

Publisher: Elsevier BV

Publication Date: Mar 1, 2011

Publication Name: Theoretical Computer Science

Publisher: Cornell University

Research Interests:
Mathematics, Linear Algebra, Complementation, and Graph

Research Interests:
Computer Science, Internet shopping, Mathematical Proof, and arXiv

Research Interests:
Mathematics, Applied Mathematics, Combinatorics, Pure Mathematics, Numerical Analysis and Computational Mathematics, and Integer sequences

Research Interests:
Combinatorics, Biology, and String theory (Physics)

Research Interests:
Mathematics

Research Interests:
Medicine

Research Interests:
Combinatorics, Prefix, Time Complexity, Sorting, and Genome Rearrangements

Research Interests:
Mathematics, Computer Science, Graph Theory, Combinatorics, and Pure Mathematics

Research Interests:
Genome

Research Interests:
Mathematics

Research Interests:
Mathematics, Genome, and Synteny

Research Interests:
Mathematics and Combinatorics

Research Interests:
Computer Science, PERL, Citation, and Matching statistics

Research Interests:
Computer Science, Combinatorics, Prefix, Time Complexity, Sorting, and Genome Rearrangements

Research Interests:
Mathematics, Computer Science, Algorithms, and Phylogenetics