Markov models of genome segmentation

No Thumbnail Available
Date
2007-01-25
Authors
Thakur, Vivek
Azad, Rajeev K.
Ramaswamy, Ram
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedure based on the Jensen-Shannon divergence that has been introduced earlier. Higher-order Markov models are more sensitive to the details of local patterns and in application to genome analysis, this makes it possible to segment a sequence at positions that are biologically meaningful. We show the advantage of higher-order Markov-model-based segmentation procedures in detecting compositional inhomogeneity in chimeric DNA sequences constructed from genomes of diverse species, and in application to the E. coli K12 genome, boundaries of genomic islands, cryptic prophages, and horizontally acquired regions are accurately identified. © 2007 The American Physical Society.
Description
Keywords
Citation
Physical Review E - Statistical, Nonlinear, and Soft Matter Physics. v.75(1)