<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns="http://www.w3.org/2005/Atom">
<title>Session 6: Bio and social compositional data</title>
<link href="http://hdl.handle.net/10256/644" rel="alternate"/>
<subtitle/>
<id>http://hdl.handle.net/10256/644</id>
<updated>2013-05-19T10:24:50Z</updated>
<dc:date>2013-05-19T10:24:50Z</dc:date>
<entry>
<title>Scoring Methods for Ordinal Multidimensional Forced-Choice Items</title>
<link href="http://hdl.handle.net/10256/744" rel="alternate"/>
<author>
<name>De Vries, Anton L.M.</name>
</author>
<author>
<name>Van der Ark, L. Andries</name>
</author>
<id>http://hdl.handle.net/10256/744</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-29T00:00:00Z</published>
<summary type="text">Scoring Methods for Ordinal Multidimensional Forced-Choice Items
De Vries, Anton L.M.; Van der Ark, L. Andries
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
In most psychological tests and questionnaires, a test score is obtained by&#13;
taking the sum of the item scores. In virtually all cases where the test or&#13;
questionnaire contains multidimensional forced-choice items, this traditional&#13;
scoring method is also applied. We argue that the summation of scores obtained with multidimensional forced-choice items produces uninterpretable&#13;
test scores. Therefore, we propose three alternative scoring methods: a weak&#13;
and a strict rank preserving scoring method, which both allow an ordinal&#13;
interpretation of test scores; and a ratio preserving scoring method, which&#13;
allows a proportional interpretation of test scores. Each proposed scoring&#13;
method yields an index for each respondent indicating the degree to which&#13;
the response pattern is inconsistent. Analysis of real data showed that with&#13;
respect to rank preservation, the weak and strict rank preserving method&#13;
resulted in lower inconsistency indices than the traditional scoring method;&#13;
with respect to ratio preservation, the ratio preserving scoring method resulted in lower inconsistency indices than the traditional scoring method
</summary>
<dc:date>2008-05-29T00:00:00Z</dc:date>
</entry>
<entry>
<title>Coherent forecasting of multiple-decrement life tables: a test using Japanese cause of death data</title>
<link href="http://hdl.handle.net/10256/742" rel="alternate"/>
<author>
<name>Oeppen, Jim</name>
</author>
<id>http://hdl.handle.net/10256/742</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-29T00:00:00Z</published>
<summary type="text">Coherent forecasting of multiple-decrement life tables: a test using Japanese cause of death data
Oeppen, Jim
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
Planners in public and private institutions would like coherent forecasts of the components of age-specic mortality, such as causes of death. This has been di cult to&#13;
achieve because the relative values of the forecast components often fail to behave in&#13;
a way that is coherent with historical experience. In addition, when the group forecasts are combined the result is often incompatible with an all-groups forecast. It has&#13;
been shown that cause-specic mortality forecasts are pessimistic when compared with&#13;
all-cause forecasts (Wilmoth, 1995). This paper abandons the conventional approach&#13;
of using log mortality rates and forecasts the density of deaths in the life table. Since&#13;
these values obey a unit sum constraint for both conventional single-decrement life tables (only one absorbing state) and multiple-decrement tables (more than one absorbing&#13;
state), they are intrinsically relative rather than absolute values across decrements as&#13;
well as ages. Using the methods of Compositional Data Analysis pioneered by Aitchison&#13;
(1986), death densities are transformed into the real space so that the full range of multivariate statistics can be applied, then back-transformed to positive values so that the&#13;
unit sum constraint is honoured. The structure of the best-known, single-decrement&#13;
mortality-rate forecasting model, devised by Lee and Carter (1992), is expressed in&#13;
compositional form and the results from the two models are compared. The compositional model is extended to a multiple-decrement form and used to forecast mortality&#13;
by cause of death for Japan
</summary>
<dc:date>2008-05-29T00:00:00Z</dc:date>
</entry>
<entry>
<title>Compositional amalgamations and balances: a critical approach</title>
<link href="http://hdl.handle.net/10256/738" rel="alternate"/>
<author>
<name>Mateu i Figueras, Glòria</name>
</author>
<author>
<name>Daunis i Estadella, Josep</name>
</author>
<id>http://hdl.handle.net/10256/738</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-29T00:00:00Z</published>
<summary type="text">Compositional amalgamations and balances: a critical approach
Mateu i Figueras, Glòria; Daunis i Estadella, Josep
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
The amalgamation operation is frequently used to reduce the number of parts of compositional data but it is a non-linear operation in the simplex with the usual geometry,&#13;
the Aitchison geometry. The concept of balances between groups, a particular coordinate system designed over binary partitions of the parts, could be an alternative to the&#13;
amalgamation in some cases. In this work we discuss the proper application of both&#13;
concepts using a real data set corresponding to behavioral measures of pregnant sows
</summary>
<dc:date>2008-05-29T00:00:00Z</dc:date>
</entry>
<entry>
<title>Hardy-Weinberg Equilibrium and the Ternary Plot</title>
<link href="http://hdl.handle.net/10256/737" rel="alternate"/>
<author>
<name>Graffelman, Jan</name>
</author>
<id>http://hdl.handle.net/10256/737</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-29T00:00:00Z</published>
<summary type="text">Hardy-Weinberg Equilibrium and the Ternary Plot
Graffelman, Jan
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
The Hardy-Weinberg law, formulated about 100 years ago, states that under certain&#13;
assumptions, the three genotypes AA, AB and BB at a bi-allelic locus are expected to occur in&#13;
the proportions p2, 2pq, and q2 respectively, where p is the allele frequency of A, and q = 1-p.&#13;
There are many statistical tests being used to check whether empirical marker data obeys the&#13;
Hardy-Weinberg principle. Among these are the classical xi-square test (with or without&#13;
continuity correction), the likelihood ratio test, Fisher's Exact test, and exact tests in combination&#13;
with Monte Carlo and Markov Chain algorithms. Tests for Hardy-Weinberg equilibrium (HWE)&#13;
are numerical in nature, requiring the computation of a test statistic and a p-value.&#13;
There is however, ample space for the use of graphics in HWE tests, in particular for the ternary&#13;
plot. Nowadays, many genetical studies are using genetical markers known as Single&#13;
Nucleotide Polymorphisms (SNPs). SNP data comes in the form of counts, but from the counts&#13;
one typically computes genotype frequencies and allele frequencies. These frequencies satisfy&#13;
the unit-sum constraint, and their analysis therefore falls within the realm of compositional data&#13;
analysis (Aitchison, 1986). SNPs are usually bi-allelic, which implies that the genotype&#13;
frequencies can be adequately represented in a ternary plot. Compositions that are in exact&#13;
HWE describe a parabola in the ternary plot. Compositions for which HWE cannot be rejected in&#13;
a statistical test are typically “close" to the parabola, whereas compositions that differ&#13;
significantly from HWE are “far". By rewriting the statistics used to test for HWE in terms of&#13;
heterozygote frequencies, acceptance regions for HWE can be obtained that can be depicted in&#13;
the ternary plot. This way, compositions can be tested for HWE purely on the basis of their&#13;
position in the ternary plot (Graffelman &amp; Morales, 2008). This leads to nice graphical&#13;
representations where large numbers of SNPs can be tested for HWE in a single graph. Several&#13;
examples of graphical tests for HWE (implemented in R software), will be shown, using SNP&#13;
data from different human populations
</summary>
<dc:date>2008-05-29T00:00:00Z</dc:date>
</entry>
<entry>
<title>“Unmixing” Tissue Gene Expression Signatures from Tumor Biopsies</title>
<link href="http://hdl.handle.net/10256/736" rel="alternate"/>
<author>
<name>Billheimer, Dean</name>
</author>
<id>http://hdl.handle.net/10256/736</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-29T00:00:00Z</published>
<summary type="text">“Unmixing” Tissue Gene Expression Signatures from Tumor Biopsies
Billheimer, Dean
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
Emergent molecular measurement methods, such as DNA microarray, qRTPCR, and&#13;
many others, offer tremendous promise for the personalized treatment of cancer. These&#13;
technologies measure the amount of specific proteins, RNA, DNA or other molecular&#13;
targets from tumor specimens with the goal of “fingerprinting” individual cancers. Tumor&#13;
specimens are heterogeneous; an individual specimen typically contains unknown&#13;
amounts of multiple tissues types. Thus, the measured molecular concentrations result&#13;
from an unknown mixture of tissue types, and must be normalized to account for the&#13;
composition of the mixture.&#13;
For example, a breast tumor biopsy may contain normal, dysplastic and cancerous&#13;
epithelial cells, as well as stromal components (fatty and connective tissue) and blood&#13;
and lymphatic vessels. Our diagnostic interest focuses solely on the dysplastic and&#13;
cancerous epithelial cells. The remaining tissue components serve to “contaminate”&#13;
the signal of interest. The proportion of each of the tissue components changes as&#13;
a function of patient characteristics (e.g., age), and varies spatially across the tumor&#13;
region. Because each of the tissue components produces a different molecular signature,&#13;
and the amount of each tissue type is specimen dependent, we must estimate the tissue&#13;
composition of the specimen, and adjust the molecular signal for this composition.&#13;
Using the idea of a chemical mass balance, we consider the total measured concentrations&#13;
to be a weighted sum of the individual tissue signatures, where weights&#13;
are determined by the relative amounts of the different tissue types. We develop a&#13;
compositional source apportionment model to estimate the relative amounts of tissue&#13;
components in a tumor specimen. We then use these estimates to infer the tissuespecific&#13;
concentrations of key molecular targets for sub-typing individual tumors. We&#13;
anticipate these specific measurements will greatly improve our ability to discriminate&#13;
between different classes of tumors, and allow more precise matching of each patient to&#13;
the appropriate treatment
</summary>
<dc:date>2008-05-29T00:00:00Z</dc:date>
</entry>
</feed>
