Compositional Data Analysis in E-Tourism Research
dc.contributor.author
dc.date.accessioned
2025-04-04T08:32:28Z
dc.date.available
2025-04-04T08:32:28Z
dc.date.issued
2022
dc.identifier.isbn
978-3-030-48651-8
dc.identifier.uri
dc.description.abstract
Compositional Data (CoDa) contain information about the relative importance of parts of a whole, which the researcher deems more interesting than overall size or volume. In web mining, for instance, the relative frequency of a term is normally given more importance than absolute frequency, which mostly tells about web size, in other words, the sheer volume of online content. Many research questions in e-tourism are either related to the distribution of a whole or relative importance: How do the most salient contents in hotel Facebook accounts relate to hotel characteristics? What are the dominant topics on TripAdvisor comments about fish freshness in seafood restaurants? How does the relative popularity of search terms in Google relate to destination market share?
In CoDa, most of the basic statistical notions, such as center, variation, association, and distance, are flawed unless they are re-expressed by means of logarithms of ratios. The appeal of log-ratios is that once they are computed, standard statistical methods can be used. On the other hand, since one part can only increase in relative terms if some other(s) decrease, statistics need to be multivariate.
This chapter uses an example based on TripAdvisor hotel reviews from one of the most visited cities worldwide, Barcelona, focusing on what users complain about, to illustrate the main multivariate exploratory and descriptive tools in CoDa, including imputation of zeros prior to computing the log-ratios, multivariate outlier detection, principal component analysis, cluster analysis, and multivariate data visualization tools. The use of CoDaPack, a popular CoDa freeware, is described in a step-by-step fashion
dc.description.sponsorship
The authors acknowledge the support of the Catalan Government for the accreditation as Consolidated Research Groups TURESCO (2017 SGR 49) and COSDA (2017 SGR656). This work was supported by the Spanish Ministry of Economy, Industry and Competitiveness (Grant id.: TURCOLAB ECO2017-88984-R), by the Spanish Ministry of Science, Innovation and Universities and FEDER (Grant id.: CODAMET RTI2018-095518-B-C21)
dc.format.mimetype
application/pdf
dc.language.iso
eng
dc.publisher
Springer
dc.relation
RTI2018-095518-B-C21
dc.relation.isformatof
Versió postprint del capítol de llibre publicat a: https://doi.org/10.1007/978-3-030-48652-5_136
dc.relation.ispartof
© Xiang, Z., Fuchs, M., Gretzel, U., Höpken, W. (eds). (2022). Handbook of e-Tourism. Springer, Cham. https://doi.org/10.1007/978-3-030-48652-5_136, p. 893-917
dc.relation.ispartofseries
Llibres / Capítols de LLibre (D-EC)
dc.rights
Tots els drets reservats
dc.subject
dc.title
Compositional Data Analysis in E-Tourism Research
dc.type
info:eu-repo/semantics/bookPart
dc.rights.accessRights
info:eu-repo/semantics/openAccess
dc.relation.projectID
info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-095518-B-C21/ES/METODOS DEL ANALISIS COMPOSICIONAL DE DATOS/
dc.type.version
info:eu-repo/semantics/acceptedVersion
dc.identifier.doi
dc.contributor.funder
dc.type.peerreviewed
peer-reviewed
dc.relation.FundingProgramme
dc.relation.ProjectAcronym
dc.identifier.eisbn
978-3-030-48652-5