Searching and Indexing Circular Patterns

Circular DNA sequences can be found in viruses, as plasmids in archaea and bacteria, and in the mitochondria and plastids of eukaryotic cells. Hence, circular sequence comparison finds applications in several biological contexts (Barton et al., Experimental algorithms. Lecture notes in computer science, vol 9125, pp 247–258, 2015; Barton et al., Algorithms Mol Biol 9(9):2014; Uliel et al., Protein Eng 14(8):533–542, 2001). This motivates the design of efficient algorithms (Barton et al., Language and automata theory and applications. Lecture notes in computer science, vol 8977, pp 85–96. Springer, Berlin, 2015) and data structures (Hon et al., Combinatorial pattern matching. Lecture notes in computer science, vol 7922, pp 142–152. Springer, Berlin/Heidelberg, 2013) that are devoted to the specific comparison of circular sequences, as they can be relevant in the analysis of organisms with such structure (Grossi et al., Proceedings of algorithms in bioinformatics - 15th international workshop, WABI 2015, Atlanta, GA, Sept 10–12, 2015. Lecture notes in computer science, vol 9289, pp 203–216. Springer, Berlin, 2015; Gusfield, Algorithms on strings, trees, and sequences - computer science and computational biology. Cambridge University Press, Cambridge, 1997).

[1]  Roberto Grossi,et al.  Circular Sequence Comparison with q-grams , 2015, WABI.

[2]  Wing-Kai Hon,et al.  Compressed indexes for dynamic text collections , 2007, TALG.

[3]  Djamal Belazzougui Succinct Dictionary Matching with No Slowdown , 2010, CPM.

[4]  Enno Ohlebusch,et al.  Bidirectional search in a string with wavelet trees and bidirectional matching statistics , 2012, Inf. Comput..

[5]  Costas S. Iliopoulos,et al.  Fast algorithms for approximate circular string matching , 2014, Algorithms for Molecular Biology.

[6]  S Uliel,et al.  Naturally occurring circular permutations in proteins. , 2001, Protein engineering.

[7]  M. Lothaire,et al.  Applied Combinatorics on Words , 2005 .

[8]  Maxime Crochemore,et al.  Algorithms on strings , 2007 .

[9]  Richard C. T. Lee,et al.  Bit-Parallel Algorithms for Exact Circular String Matching , 2014, Comput. J..

[10]  Giovanni Manzini,et al.  Opportunistic data structures with applications , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[11]  Volker Heun,et al.  Space-Efficient Preprocessing Schemes for Range Minimum Queries on Static Arrays , 2011, SIAM J. Comput..

[12]  Gad M. Landau,et al.  Construction of Aho Corasick automaton in linear time for integer alphabets , 2006, Inf. Process. Lett..

[13]  Costas S. Iliopoulos,et al.  Circular string matching revisited , 2013 .

[14]  Costas S. Iliopoulos,et al.  Indexing Circular Patterns , 2008, WALCOM.

[15]  Ronald L. Rivest,et al.  Partial-Match Retrieval Algorithms , 1976, SIAM J. Comput..

[16]  Wing-Kai Hon,et al.  Space-Efficient Construction Algorithm for the Circular Suffix Tree , 2013, CPM.

[17]  Costas S. Iliopoulos,et al.  REAL: an efficient REad ALigner for next generation sequencing reads , 2010, BCB '10.

[18]  Wing-Kai Hon,et al.  Approximate string matching using compressed suffix arrays , 2006, Theor. Comput. Sci..

[19]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[20]  Johannes Fischer,et al.  Inducing the LCP-Array , 2011, WADS.

[21]  Wing-Kai Hon,et al.  Faster Compressed Dictionary Matching , 2010, SPIRE.

[22]  Lucian Ilie,et al.  The longest common extension problem revisited and applications to approximate string searching , 2010, J. Discrete Algorithms.

[23]  Costas S. Iliopoulos,et al.  Fast circular dictionary-matching algorithm , 2015, Mathematical Structures in Computer Science.

[24]  Andrés Marzal,et al.  Speeding up the cyclic edit distance using LAESA with early abandon , 2015, Pattern Recognit. Lett..

[25]  Udi Manber,et al.  Fast text searching: allowing errors , 1992, CACM.

[26]  Szymon Grabowski,et al.  Average-optimal string matching , 2009, J. Discrete Algorithms.

[27]  Eugene W. Myers,et al.  Suffix arrays: a new method for on-line string searches , 1993, SODA '90.

[28]  Costas S. Iliopoulos,et al.  Average-Case Optimal Approximate Circular String Matching , 2014, LATA.

[29]  Alfred V. Aho,et al.  Efficient string matching , 1975, Commun. ACM.

[30]  Ge Nong,et al.  Linear Suffix Array Construction by Almost Pure Induced-Sorting , 2009, 2009 Data Compression Conference.

[31]  Peter Weiner,et al.  Linear Pattern Matching Algorithms , 1973, SWAT.

[32]  Esko Ukkonen,et al.  On-line construction of suffix trees , 1995, Algorithmica.

[33]  Wing-Kai Hon,et al.  Succinct Indexes for Circular Patterns , 2011, ISAAC.