10.6084/m9.figshare.7516742.v1 Leila Maria Ferreira Leila Maria Ferreira Thelma Sáfadi Thelma Sáfadi Juliano Lino Ferreira Juliano Lino Ferreira Wavelet-domain elastic net for clustering on genomes strains SciELO journals 2018 Elastic net genome GC-content cluster analysis wavelet transform 2018-12-26 05:37:15 Dataset https://scielo.figshare.com/articles/dataset/Wavelet-domain_elastic_net_for_clustering_on_genomes_strains/7516742 <div><p>Abstract We propose to evaluate genome similarity by combining discrete non-decimated wavelet transform (NDWT) and elastic net. The wavelets represent a signal with levels of detail, that is, hidden components are detected by means of the decomposition of this signal, where each level provides a different characteristic. The main feature of the elastic net is the grouping of correlated variables where the number of predictors is greater than the number of observations. The combination of these two methodologies applied in the clustering analysis of the Mycobacterium tuberculosis genome strains proved very effective, being able to identify clusters at each level of decomposition.</p></div>