skip to main content

Alignment-free sequence comparison—a review

Vinga, Susana ; Almeida, Jonas

Bioinformatics, 2003-03, Vol.19 (4), p.513-523 [Peer Reviewed Journal]

Oxford: Oxford University Press

Full text available

Citations Cited by
  • Title:
    Alignment-free sequence comparison—a review
  • Author: Vinga, Susana ; Almeida, Jonas
  • Subjects: Algorithms ; Animals ; Biological and medical sciences ; Fundamental and applied biological sciences. Psychology ; General aspects ; Humans ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Models, Genetic ; Models, Statistical ; Sequence Alignment - methods ; Sequence Analysis - methods ; Sequence Homology
  • Is Part Of: Bioinformatics, 2003-03, Vol.19 (4), p.513-523
  • Notes: local:190513
    ark:/67375/HXZ-M2XWQJ2N-3
    istex:248F4D3E263A60DF7BAADC1843ACA1A6F08BC3A7
    PII:1460-2059
    ObjectType-Article-2
    SourceType-Scholarly Journals-1
    ObjectType-Feature-1
    content type line 23
    ObjectType-Article-1
    ObjectType-Feature-3
    ObjectType-Review-2
  • Description: Motivation: Genetic recombination and, in particular, genetic shuffling are at odds with sequence comparison by alignment, which assumes conservation of contiguity between homologous segments. A variety of theoretical foundations are being used to derive alignment-free methods that overcome this limitation. The formulation of alternative metrics for dissimilarity between sequences and their algorithmic implementations are reviewed. Results: The overwhelming majority of work on alignment-free sequence has taken place in the past two decades, with most reports published in the past 5 years. Two main categories of methods have been proposed—methods based on word (oligomer) frequency, and methods that do not require resolving the sequence with fixed word length segments. The first category is based on the statistics of word frequency, on the distances defined in a Cartesian space defined by the frequency vectors, and on the information content of frequency distribution. The second category includes the use of Kolmogorov complexity and Chaos Theory. Despite their low visibility, alignment-free metrics are in fact already widely used as pre-selection filters for alignment-based querying of large applications. Recent work is furthering their usage as a scale-independent methodology that is capable of recognizing homology when loss of contiguity is beyond the possibility of alignment. Availability: Most of the alignment-free algorithms reviewed were implemented in MATLAB code and are available at http://bioinformatics.musc.edu/resources.html Contact: almeidaj@musc.edu; svinga@itqb.unl.pt * To whom correspondence should be addressed.
  • Publisher: Oxford: Oxford University Press
  • Language: English

Searching Remote Databases, Please Wait

  • Searching for
  • inscope:(USP_VIDEOS),scope:("PRIMO"),scope:(USP_FISICO),scope:(USP_EREVISTAS),scope:(USP),scope:(USP_EBOOKS),scope:(USP_PRODUCAO),primo_central_multiple_fe
  • Show me what you have so far