Three Approaches to supervised learning for compositional data with pairwise logratios
dc.contributor.author
dc.date.accessioned
2024-01-24T06:09:30Z
dc.date.available
2024-01-24T06:09:30Z
dc.date.issued
2023-11-09
dc.identifier.issn
0266-4763
dc.identifier.uri
dc.description.abstract
Logratios between pairs of compositional parts (pairwise logratios) are the easiest to interpret in compositional data analysis, and include the well-known additive logratios as particular cases. When the number of parts is large (sometimes even larger than the number of cases), some form of logratio selection is needed. In this article, we present three alternative stepwise supervised learning methods to select the pairwise logratios that best explain a dependent variable in a generalized linear model, each geared for a specific problem. The first method features unrestricted search, where any pairwise logratio can be selected. This method has a complex interpretation if some pairs of parts in the logratios overlap, but it leads to the most accurate predictions. The second method restricts parts to occur only once, which makes the corresponding logratios intuitively interpretable. The third method uses additive logratios, so that K−1 selected logratios involve a K-part subcomposition. Our approach allows logratios or non-compositional covariates to be forced into the models based on theoretical knowledge, and various stopping criteria are available based on information measures or statistical significance with the Bonferroni correction. We present an application on a dataset from a study predicting Crohn's disease
dc.format.extent
22 p.
dc.format.mimetype
application/pdf
dc.language.iso
eng
dc.relation.isformatof
Versió postprint del document publicat a: https://doi.org/10.1080/02664763.2022.2108007
dc.relation.ispartof
© Journal of Applied Statistics, 2023, vol. 50, núm. 16, p. 3272-3293
dc.relation.ispartofseries
Articles publicats (D-EC)
dc.rights
Tots els drets reservats
dc.source
Coenders, Germà Greenacre, Michael J. 2023 Three Approaches to supervised learning for compositional data with pairwise logratios Journal of Applied Statistics 50 16 3272 3293
dc.subject
dc.title
Three Approaches to supervised learning for compositional data with pairwise logratios
dc.type
info:eu-repo/semantics/article
dc.rights.accessRights
info:eu-repo/semantics/embargoedAccess
dc.embargo.lift
2024-11-09T00:00:00Z
dc.embargo.terms
2024-11-09T00:00:00Z
dc.date.embargoEndDate
info:eu-repo/date/embargoEnd/2024-11-09
dc.type.version
info:eu-repo/semantics/acceptedVersion
dc.identifier.doi
dc.identifier.idgrec
037467
dc.type.peerreviewed
peer-reviewed
dc.identifier.eissn
1360-0532