Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/800116.803771acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article
Free access

On the complexity of the Extended String-to-String Correction Problem

Published: 05 May 1975 Publication History

Abstract

The Extended String-to-String Correction Problem [ESSCP] is defined as the problem of determining, for given strings A and B over alphabet V, a minimum-cost sequence S of edit operations such that S(A) = B. The sequence S may make use of the operations: Change, Insert, Delete and Swaps, each of constant cost WC, WI, WD, and WS respectively. Swap permits any pair of adjacent characters to be interchanged.
The principal results of this paper are:
(1) a brief presentation of an algorithm (the CELLAR algorithm) which solves ESSCP in time Ø(¦A¦* ¦B¦* ¦V¦s*s), where s = min(4WC, WI+WD)/WS + 1;
(2) presentation of polynomial time algorithms for the cases (a) WS = 0, (b) WS > 0, WC= WI= WD= @@@@;
(3) proof that ESSCP, with WI < WC = WD = @@@@, 0 < WS < @@@@, suitably encoded, is NP-complete. (The remaining case, WS= @@@@, reduces ESSCP to the string-to-string correction problem of [1], where an Ø( ¦A¦* ¦B¦) algorithm is given.) Thus, “almost all” ESSCP's can be solved in deterministic polynomial time, but the general problem is NP-complete.

References

[1]
Wagner, R. A. &amp; Fischer, M. J., "The string-to-string correction problem", Journal of the ACM, Vol. 21, no. l(January 1974), pp.168-173.
[2]
Lowrance, R. &amp; Wagner, R. A., "An extension of the string-to-string correction problem" to be published in the Journal of the ACM.
[3]
Wagner, R. A. &amp; Brown, T. P., "Swap-extended minimum edit distance correction for regular languages", Technical Report, Systems and Information Science Program, Vanderbilt University (January 1975).
[4]
Karp, R. M., "Reducibility among combinatorial problems" in Complexity of Computer Computations, Miller, R. E. &amp; Thatcher, J. W. (ed.), Plenum Press (1972), pp. 85-104.
[5]
Wagner, R. A., "Generalized correction of context-free languages", Technical Report, Systems and Information Science Program, Vanderbilt Univeristy (1973).
[6]
Gaver, G. P. &amp; Thompson, G. L., Mathematical Models: Programming and Probability (1966).

Cited By

View all
  • (2023)PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science, and its application on natural product (NP) occurrence database processingBMC Bioinformatics10.1186/s12859-023-05588-324:1Online publication date: 14-Dec-2023
  • (2022)Set-Theoretical Data Models in the Problem of Calculating Secondary Structures of RNASystem Analysis & Mathematical Modeling10.17150/2713-1734.2022.4(4).343-3574:4(343-357)Online publication date: 29-Dec-2022
  • (2021)Reference Mapping Considering Swaps of Adjacent BasesApplied Sciences10.3390/app1111503811:11(5038)Online publication date: 29-May-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
STOC '75: Proceedings of the seventh annual ACM symposium on Theory of computing
May 1975
265 pages
ISBN:9781450374194
DOI:10.1145/800116
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 May 1975

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Acceptance Rates

STOC '75 Paper Acceptance Rate 31 of 87 submissions, 36%;
Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)62
  • Downloads (Last 6 weeks)10
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)PhyloSophos: a high-throughput scientific name mapping algorithm augmented with explicit consideration of taxonomic science, and its application on natural product (NP) occurrence database processingBMC Bioinformatics10.1186/s12859-023-05588-324:1Online publication date: 14-Dec-2023
  • (2022)Set-Theoretical Data Models in the Problem of Calculating Secondary Structures of RNASystem Analysis & Mathematical Modeling10.17150/2713-1734.2022.4(4).343-3574:4(343-357)Online publication date: 29-Dec-2022
  • (2021)Reference Mapping Considering Swaps of Adjacent BasesApplied Sciences10.3390/app1111503811:11(5038)Online publication date: 29-May-2021
  • (2020)Adaptive Exact Learning in a Mixed-Up World: Dealing with Periodicity, Errors and Jumbled-Index Queries in String ReconstructionString Processing and Information Retrieval10.1007/978-3-030-59212-7_12(155-174)Online publication date: 17-Sep-2020
  • (2019)Computing Nonoverlapping Inversion Distance Between Two Strings in Linear Average TimeJournal of Computational Biology10.1089/cmb.2018.013626:3(193-201)Online publication date: Mar-2019
  • (2019)Modern Aspects of Complexity Within Formal LanguagesLanguage and Automata Theory and Applications10.1007/978-3-030-13435-8_1(3-30)Online publication date: 14-Feb-2019
  • (2018)Adaptive Computation of the Swap-Insert Correction DistanceACM Transactions on Algorithms10.1145/323205714:4(1-16)Online publication date: 9-Aug-2018
  • (2018)Indexed Dynamic Programming to Boost Edit Distance and LCSS ComputationString Processing and Information Retrieval10.1007/978-3-030-00479-8_6(61-73)Online publication date: 14-Sep-2018
  • (2017)A generalization of the most common subgraph distance and its application to graph editingPattern Recognition Letters10.1016/j.patrec.2016.09.00887:C(71-78)Online publication date: 1-Feb-2017
  • (2015)S3JProceedings of the ACM Eighteenth International Workshop on Data Warehousing and OLAP10.1145/2811222.2811226(49-57)Online publication date: 22-Oct-2015
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media