Genome Clustering: From Linguistic Models to Classification of Genetic Texts
Springer| May 18, 2010 | ISBN-10: 364212951X | 206 pages | PDF | 3.64 MB
Springer| May 18, 2010 | ISBN-10: 364212951X | 206 pages | PDF | 3.64 MB
This book deals with the methods of text comparison which are based on different techniques of converting the text into a distribution on a certain finite support, be it a genetic text or a text of some other type. Such distribution is usually referred to as “spectrum”. The measure of dissimilarity of two texts is formally expressed as a certain “distance” between the spectra of these texts. Such definition implies that the similarity of the texts results from the similarity of the random processes generating the texts.