10.6084/m9.figshare.7516742.v1
Leila Maria Ferreira
Leila Maria
Ferreira
Thelma Sáfadi
Thelma
Sáfadi
Juliano Lino Ferreira
Juliano Lino
Ferreira
Wavelet-domain elastic net for clustering on genomes strains
SciELO journals
2018
Elastic net
genome
GC-content
cluster analysis
wavelet transform
2018-12-26 05:37:15
Dataset
https://scielo.figshare.com/articles/dataset/Wavelet-domain_elastic_net_for_clustering_on_genomes_strains/7516742
<div><p>Abstract We propose to evaluate genome similarity by combining discrete non-decimated wavelet transform (NDWT) and elastic net. The wavelets represent a signal with levels of detail, that is, hidden components are detected by means of the decomposition of this signal, where each level provides a different characteristic. The main feature of the elastic net is the grouping of correlated variables where the number of predictors is greater than the number of observations. The combination of these two methodologies applied in the clustering analysis of the Mycobacterium tuberculosis genome strains proved very effective, being able to identify clusters at each level of decomposition.</p></div>