Fact-checked by Grok 2 weeks ago
References
-
[1]
[PDF] A Guided Tour to Approximate String Matching - DCC UChileAbstract. We survey the current techniques to cope with the problem of string matching allowing errors. This is becoming a more and more relevant issue for ...
-
[2]
Levenshtein, V.I. (1966) Binary Codes Capable of Correcting ...Levenshtein, VI (1966) Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Soviet Physics Doklady, 10, 707-710.
-
[3]
Algorithms for approximate string matching - ScienceDirect.com1. Levenshtein V.I.. Binary codes capable of correcting deletions, insertions, and reversals · 2. Lowrance R., Wagner R.A. · 3. Nakatsu N., Kambayashi Y., Yajima ...
-
[4]
[PDF] A Guided Tour to Approximate String MatchingWe survey the current techniques to cope with the problem of string matching that allows errors. This is becoming a more and more relevant issue for many ...
-
[5]
[PDF] Binary codes capable of correcting deletions, insertions, and reversals"It can be shown that if a code K in Bn-7 can correct one deletion, insertion, or reversal (e.g., K = Kn-7.2(n-7)), the code J=K11.01 is admissible. Page 4. 710.
-
[6]
V. I. Levenshtein, “Binary codes capable of correcting deletions ...Binary codes capable of correcting deletions, insertions, and reversals. VI Levenshtein. Full-text PDF (652 kB).
-
[7]
The String-to-String Correction Problem | Journal of the ACMThe string-to-string correction problem is to determine the distance between two strings as measured by the minimum cost sequence of “edit operations”
-
[8]
[PDF] March 11, 1995 - Dartmouth Computer ScienceMar 11, 1995 · Some years ago, S. C. Johnson introduced the UNIX spelling checker spell. His idea was simply to look up every word of a document in a standard ...
-
[9]
Approximate string-matching over suffix trees - SpringerLinkJun 17, 2005 · Tarhio, J. & Ukkonen, E. (1990): Boyer-Moore approach to approximate string matching. 2nd Scand. Workshop on Algorithm Theory, Lect. Notes in ...
-
[10]
Levenshtein Distance - an overview | ScienceDirect TopicsLevenshtein distance is defined as the minimum number of insertions, deletions, or substitutions required to transform one string into another. It serves as a ...Missing: URL | Show results with:URL
-
[11]
[PDF] The Bell System Technical Journal - Zoo | Yale UniversityCopyright, 1950,American Telephone and Telegraph Company. Error Detecting and Error Correcting Codes. By R. W. HAMMING ... errors. Since the machines were run ...
-
[12]
Hamming Distance - an overview | ScienceDirect TopicsHamming Distance refers to the number of positions at which two strings of the same length differ. It is a metric used in computer science to measure ...Missing: citation | Show results with:citation
-
[13]
A technique for computer detection and correction of spelling errorsA technique for computer detection and correction of spelling errors. Author: Fred J. Damerau. Fred J. Damerau. IBM Corporation, Yorktown Heights, NY. View ...
-
[14]
[PDF] Edit-Distance of Weighted Automata: General Definitions and ...The paper is organized as follows. In Section 2, we introduce the definition of the edit-distances of two languages, two distributions, or two automata. Sec ...
-
[15]
[PDF] The String Edit Distance Matching Problem with MovesThe goal is that string edit operations only have a localized effect on the ET. ... Approximate nearest neighbors and sequence comparison with block operations.
-
[16]
A general method applicable to the search for similarities in the ...A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed.Missing: paper | Show results with:paper
-
[17]
An Extension of the String-to-String Correction ProblemThe Extended String-to-String Correction Problem [ESSCP] is defined as the problem of determining, for given strings A and B over alphabet V, a minimum-cost ...<|control11|><|separator|>
-
[18]
A guided tour to approximate string matching - ACM Digital LibraryWe focus on online searching and mostly on edit distance, explaining the problem and its relevance, its statistical behavior, its history and current ...
-
[19]
[PDF] Fast approximate string matching with suffix arrays and A* parsingOur method uses a suffix array to find n- gram matches (Section 4), principles of A* search to filter the matches, and an A* parsing method to iden- tify the ...
-
[20]
[PDF] Indexing Text with Approximate q-grams - DCC UChileThe distance h between samples is computed so that there are at least k + s q-samples inside any occurrence.Missing: pairs | Show results with:pairs
-
[21]
Fast text searching: allowing errors: Communications of the ACMFast text searching: allowing errors. Authors: Sun Wu. Sun Wu. Bell Labs, Murray ... Wu, S., Manber, U. and Myers,. E,.W',. A Sub-Quadratic Algorithm for ...
-
[22]
[PDF] FAST PATTERN MATCHING IN STRINGS*MORRIS, JR. AND VAUGHAN R. PRATT, Fast pattern matching in strings, Tech. Rep. CS440, Computer Science Department, Stanford Univ.,. Stanford, Calif., 1974 ...
-
[23]
How to Write a Spelling Corrector - Peter NorvigThe list of known words at edit distance two away, if there are any; otherwise. The original word, even though it is not known.
-
[24]
Introduction to Levenshtein distance - GeeksforGeeksJan 31, 2024 · Let's see an example that there is String A: "kitten" which need to be converted in String B: "sitting" so we need to determine the minimum ...
-
[25]
[PDF] agrep — a fast approximate pattern-matching tool - USENIXIn this paper we describe a new tool, called agrep, for approximate pattern matching. Agrep is based on a new efficient and flexible algorithm for ...Missing: survey | Show results with:survey
-
[26]
[PDF] A. Survey of Entity Resolution and Record Linkage MethodologiesMuch of the literature's focus on matching criteria for entity resolution and record linkage employ text-based matches. This is not surprising given that a ...
-
[27]
Comparison of text preprocessing methods | Natural Language ...Jun 13, 2022 · Approximate string matching in tokenization is also helpful in NER of biomedical and chemical terms (Akkasi et al. Reference Akkasi, Varoğlu ...
-
[28]
Plagiarism detection using stopword n-grams - Semantic ScholarIt is shown that stopword n-grams reveal important information for plagiarism detection since they are able to capture syntactic similarities between ...
-
[29]
Hamming Distance Explained: The Theory and ApplicationsApr 16, 2025 · Hamming distance measures the number of positions at which two strings of equal length have different symbols.
-
[30]
Hybridizing Fuzzy String Matching and Machine Learning for ... - MDPIThis paper proposed a novel method that hybridizes fuzzy string-matching algorithms and the Deep Bidirectional Transformer (BERT) deep learning model with ...Missing: approximate tokenization
-
[31]
(PDF) On Nonlinear Learned String Indexing - ResearchGateWe investigate the potential of several artificial neural network architectures to be used as an index on a sorted set of strings, namely, as a mapping from a ...
-
[32]
[PDF] Entity Matching with Transformer Architectures - A Step Forward in ...Entity matching (EM) is finding data instances referring to the same real-world entity, crucial for data integration and cleaning.
-
[33]
A Grover-based Quantum Algorithm for Approximate String Matching ...Jun 11, 2025 · This paper presents the design and validation of a novel quantum algorithm for approximate string matching, which quadratically accelerates the ...
-
[34]
Quantum Approximate k-Minimum Finding - DROPSOct 1, 2025 · Quantum speed-ups for string synchronizing sets, longest common substring, and k-mismatch matching. ... String matching in Õ(√n+√m) quantum time.
-
[35]
[PDF] GenASM: A High-Performance, Low-Power Approximate String ...We present GenASM, a novel approximate string match- ing acceleration framework for genome sequence analysis. GenASM is a power- and area-efficient hardware ...
-
[36]
[PDF] Low-power, high-performance and scalable genome sequence ...Aug 31, 2025 · A fast bit-vector algorithm for approximate string matching ... FPGA-based hardware acceleration for local complexity analysis of massive genomic ...
-
[37]
Secure Approximate String Matching using Homomorphic ...Oct 5, 2022 · Several techniques have been proposed on privacy-preserving approximate string matching such as Secure Hash Encoding etc. Relative to other ...
-
[38]
Context-aware Transliteration of Romanized South Asian LanguagesJun 1, 2024 · In this article, we present a demonstration of how important contextual information is for this task, as well as exploring several methods for jointly ...
-
[39]
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness ...Aug 6, 2025 · Evaluations on 13 multilingual benchmarks show that models trained with a Parity-aware tokenizer match or exceed downstream performance compared ...