Genomic sequences are usually compared using evolutionarydistance, a procedure that implies the alignment of the sequences. Alignmentof long sequences is a long procedure and the obtained dissimilarityresults is not a metric. Recently the normalized compression distance wasintroduced as a method to calculate the distance between two generic digitalobjects, and it seems a suitable way to compare genomic strings. Inthis paper the clustering and the mapping, obtained using a SOM, withthe traditional evolutionary distance and the compression distance arecompared in order to understand if the two distances sets are similar.The first results indicate that the two distances catch different aspectsof the genomic sequences and further investigations are needed to obtaina definitive result.
|Numero di pagine||14|
|Rivista||LECTURE NOTES IN ARTIFICIAL INTELLIGENCE|
|Stato di pubblicazione||Published - 2008|
All Science Journal Classification (ASJC) codes