Session 6: Bio and social compositional data

Session 6: Bio and social compositional data http://hdl.handle.net/10256/644 Wed, 13 Aug 2025 07:27:39 GMT 2025-08-13T07:27:39Z Scoring Methods for Ordinal Multidimensional Forced-Choice Items http://hdl.handle.net/10256/744 Scoring Methods for Ordinal Multidimensional Forced-Choice Items Vries, Anton L.M. de; Ark, L. Andries van der‏ Daunis-i-Estadella, Pepus; Martín Fernández, Josep Antoni In most psychological tests and questionnaires, a test score is obtained by taking the sum of the item scores. In virtually all cases where the test or questionnaire contains multidimensional forced-choice items, this traditional scoring method is also applied. We argue that the summation of scores obtained with multidimensional forced-choice items produces uninterpretable test scores. Therefore, we propose three alternative scoring methods: a weak and a strict rank preserving scoring method, which both allow an ordinal interpretation of test scores; and a ratio preserving scoring method, which allows a proportional interpretation of test scores. Each proposed scoring method yields an index for each respondent indicating the degree to which the response pattern is inconsistent. Analysis of real data showed that with respect to rank preservation, the weak and strict rank preserving method resulted in lower inconsistency indices than the traditional scoring method; with respect to ratio preservation, the ratio preserving scoring method resulted in lower inconsistency indices than the traditional scoring method Thu, 29 May 2008 00:00:00 GMT http://hdl.handle.net/10256/744 2008-05-29T00:00:00Z Coherent forecasting of multiple-decrement life tables: a test using Japanese cause of death data http://hdl.handle.net/10256/742 Coherent forecasting of multiple-decrement life tables: a test using Japanese cause of death data Oeppen, Jim Daunis-i-Estadella, Pepus; Martín Fernández, Josep Antoni Planners in public and private institutions would like coherent forecasts of the components of age-specic mortality, such as causes of death. This has been di cult to achieve because the relative values of the forecast components often fail to behave in a way that is coherent with historical experience. In addition, when the group forecasts are combined the result is often incompatible with an all-groups forecast. It has been shown that cause-specic mortality forecasts are pessimistic when compared with all-cause forecasts (Wilmoth, 1995). This paper abandons the conventional approach of using log mortality rates and forecasts the density of deaths in the life table. Since these values obey a unit sum constraint for both conventional single-decrement life tables (only one absorbing state) and multiple-decrement tables (more than one absorbing state), they are intrinsically relative rather than absolute values across decrements as well as ages. Using the methods of Compositional Data Analysis pioneered by Aitchison (1986), death densities are transformed into the real space so that the full range of multivariate statistics can be applied, then back-transformed to positive values so that the unit sum constraint is honoured. The structure of the best-known, single-decrement mortality-rate forecasting model, devised by Lee and Carter (1992), is expressed in compositional form and the results from the two models are compared. The compositional model is extended to a multiple-decrement form and used to forecast mortality by cause of death for Japan Thu, 29 May 2008 00:00:00 GMT http://hdl.handle.net/10256/742 2008-05-29T00:00:00Z Compositional amalgamations and balances: a critical approach http://hdl.handle.net/10256/738 Compositional amalgamations and balances: a critical approach Mateu i Figueras, Glòria; Daunis-i-Estadella, Pepus Daunis-i-Estadella, Pepus; Martín Fernández, Josep Antoni The amalgamation operation is frequently used to reduce the number of parts of compositional data but it is a non-linear operation in the simplex with the usual geometry, the Aitchison geometry. The concept of balances between groups, a particular coordinate system designed over binary partitions of the parts, could be an alternative to the amalgamation in some cases. In this work we discuss the proper application of both concepts using a real data set corresponding to behavioral measures of pregnant sows Thu, 29 May 2008 00:00:00 GMT http://hdl.handle.net/10256/738 2008-05-29T00:00:00Z Hardy-Weinberg Equilibrium and the Ternary Plot http://hdl.handle.net/10256/737 Hardy-Weinberg Equilibrium and the Ternary Plot Graffelman, Jan Daunis-i-Estadella, Pepus; Martín Fernández, Josep Antoni The Hardy-Weinberg law, formulated about 100 years ago, states that under certain assumptions, the three genotypes AA, AB and BB at a bi-allelic locus are expected to occur in the proportions p2, 2pq, and q2 respectively, where p is the allele frequency of A, and q = 1-p. There are many statistical tests being used to check whether empirical marker data obeys the Hardy-Weinberg principle. Among these are the classical xi-square test (with or without continuity correction), the likelihood ratio test, Fisher's Exact test, and exact tests in combination with Monte Carlo and Markov Chain algorithms. Tests for Hardy-Weinberg equilibrium (HWE) are numerical in nature, requiring the computation of a test statistic and a p-value. There is however, ample space for the use of graphics in HWE tests, in particular for the ternary plot. Nowadays, many genetical studies are using genetical markers known as Single Nucleotide Polymorphisms (SNPs). SNP data comes in the form of counts, but from the counts one typically computes genotype frequencies and allele frequencies. These frequencies satisfy the unit-sum constraint, and their analysis therefore falls within the realm of compositional data analysis (Aitchison, 1986). SNPs are usually bi-allelic, which implies that the genotype frequencies can be adequately represented in a ternary plot. Compositions that are in exact HWE describe a parabola in the ternary plot. Compositions for which HWE cannot be rejected in a statistical test are typically “close" to the parabola, whereas compositions that differ significantly from HWE are “far". By rewriting the statistics used to test for HWE in terms of heterozygote frequencies, acceptance regions for HWE can be obtained that can be depicted in the ternary plot. This way, compositions can be tested for HWE purely on the basis of their position in the ternary plot (Graffelman & Morales, 2008). This leads to nice graphical representations where large numbers of SNPs can be tested for HWE in a single graph. Several examples of graphical tests for HWE (implemented in R software), will be shown, using SNP data from different human populations Thu, 29 May 2008 00:00:00 GMT http://hdl.handle.net/10256/737 2008-05-29T00:00:00Z