10.6084/m9.figshare.8195177.v1 Carlos Alberto Araújo Júnior Carlos Alberto Araújo Júnior Pábulo Diogo de Souza Pábulo Diogo de Souza Adriana Leandra de Assis Adriana Leandra de Assis Christian Dias Cabacinha Christian Dias Cabacinha Helio Garcia Leite Helio Garcia Leite Carlos Pedro Boechat Soares Carlos Pedro Boechat Soares Antonilmar Araújo Lopes da Silva Antonilmar Araújo Lopes da Silva Renato Vinícius Oliveira Castro Renato Vinícius Oliveira Castro Artificial neural networks, quantile regression, and linear regression for site index prediction in the presence of outliers SciELO journals 2019 Eucalyptus artificial intelligence dominant height forest inventory forest modelling non-sampling errors 2019-05-29 02:45:17 Dataset https://scielo.figshare.com/articles/dataset/Artificial_neural_networks_quantile_regression_and_linear_regression_for_site_index_prediction_in_the_presence_of_outliers/8195177 <div><p>Abstract: The objective of this work was to compare methods of obtaining the site index for eucalyptus (Eucalyptus spp.) stands, as well as to evaluate their impact on the stability of this index in databases with and without outliers. Three methods were tested, using linear regression, quantile regression, and artificial neural network. Twenty-two permanent plots from a continuous forest inventory were used, measured in trees with ages from 23 to 83 months. The outliers were identified using a boxplot graphic. The artificial neural network showed better results than the linear and quantile regressions, both for dominant height and site index estimates. The stability obtained for the site index classification by the artificial neural network was also better than the one obtained by the other methods, regardless of the presence or the absence of outliers in the database. This shows that the artificial neural network is a solid modelling technique in the presence of outliers. When the cause of the presence of outliers in the database is not known, they can be kept in it if techniques as artificial neural networks or quantile regression are used.</p></div>