Comparison of genomic sequences clustering using Normalized Compression Distance and Evolutionary Distance

Salvatore Gaglio, Massimo La Rosa, Alfonso Urso, Salvatore Gaglio, Riccardo Rizzo, Massimo La Rosa

Risultato della ricerca: Article

7 Citazioni (Scopus)

Abstract

Genomic sequences are usually compared using evolutionarydistance, a procedure that implies the alignment of the sequences. Alignmentof long sequences is a long procedure and the obtained dissimilarityresults is not a metric. Recently the normalized compression distance wasintroduced as a method to calculate the distance between two generic digitalobjects, and it seems a suitable way to compare genomic strings. Inthis paper the clustering and the mapping, obtained using a SOM, withthe traditional evolutionary distance and the compression distance arecompared in order to understand if the two distances sets are similar.The first results indicate that the two distances catch different aspectsof the genomic sequences and further investigations are needed to obtaina definitive result.
Lingua originaleEnglish
Numero di pagine14
RivistaLECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Volume2008
Stato di pubblicazionePublished - 2008

Fingerprint

Genomics
Compression
Clustering
Alignment
Strings
Imply
Calculate
Metric

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cita questo

Comparison of genomic sequences clustering using Normalized Compression Distance and Evolutionary Distance. / Gaglio, Salvatore; La Rosa, Massimo; Urso, Alfonso; Gaglio, Salvatore; Rizzo, Riccardo; La Rosa, Massimo.

In: LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, Vol. 2008, 2008.

Risultato della ricerca: Article

Gaglio, Salvatore ; La Rosa, Massimo ; Urso, Alfonso ; Gaglio, Salvatore ; Rizzo, Riccardo ; La Rosa, Massimo. / Comparison of genomic sequences clustering using Normalized Compression Distance and Evolutionary Distance. In: LECTURE NOTES IN ARTIFICIAL INTELLIGENCE. 2008 ; Vol. 2008.
@article{f46c260449e7415eb58ce9670661a3d6,
title = "Comparison of genomic sequences clustering using Normalized Compression Distance and Evolutionary Distance",
abstract = "Genomic sequences are usually compared using evolutionarydistance, a procedure that implies the alignment of the sequences. Alignmentof long sequences is a long procedure and the obtained dissimilarityresults is not a metric. Recently the normalized compression distance wasintroduced as a method to calculate the distance between two generic digitalobjects, and it seems a suitable way to compare genomic strings. Inthis paper the clustering and the mapping, obtained using a SOM, withthe traditional evolutionary distance and the compression distance arecompared in order to understand if the two distances sets are similar.The first results indicate that the two distances catch different aspectsof the genomic sequences and further investigations are needed to obtaina definitive result.",
author = "Salvatore Gaglio and {La Rosa}, Massimo and Alfonso Urso and Salvatore Gaglio and Riccardo Rizzo and {La Rosa}, Massimo",
year = "2008",
language = "English",
volume = "2008",
journal = "LECTURE NOTES IN ARTIFICIAL INTELLIGENCE",

}

TY - JOUR

T1 - Comparison of genomic sequences clustering using Normalized Compression Distance and Evolutionary Distance

AU - Gaglio, Salvatore

AU - La Rosa, Massimo

AU - Urso, Alfonso

AU - Gaglio, Salvatore

AU - Rizzo, Riccardo

AU - La Rosa, Massimo

PY - 2008

Y1 - 2008

N2 - Genomic sequences are usually compared using evolutionarydistance, a procedure that implies the alignment of the sequences. Alignmentof long sequences is a long procedure and the obtained dissimilarityresults is not a metric. Recently the normalized compression distance wasintroduced as a method to calculate the distance between two generic digitalobjects, and it seems a suitable way to compare genomic strings. Inthis paper the clustering and the mapping, obtained using a SOM, withthe traditional evolutionary distance and the compression distance arecompared in order to understand if the two distances sets are similar.The first results indicate that the two distances catch different aspectsof the genomic sequences and further investigations are needed to obtaina definitive result.

AB - Genomic sequences are usually compared using evolutionarydistance, a procedure that implies the alignment of the sequences. Alignmentof long sequences is a long procedure and the obtained dissimilarityresults is not a metric. Recently the normalized compression distance wasintroduced as a method to calculate the distance between two generic digitalobjects, and it seems a suitable way to compare genomic strings. Inthis paper the clustering and the mapping, obtained using a SOM, withthe traditional evolutionary distance and the compression distance arecompared in order to understand if the two distances sets are similar.The first results indicate that the two distances catch different aspectsof the genomic sequences and further investigations are needed to obtaina definitive result.

UR - http://hdl.handle.net/10447/48454

M3 - Article

VL - 2008

JO - LECTURE NOTES IN ARTIFICIAL INTELLIGENCE

JF - LECTURE NOTES IN ARTIFICIAL INTELLIGENCE

ER -