<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns="http://www.w3.org/2005/Atom">
<title>Session 0: About zeroes</title>
<link href="http://hdl.handle.net/10256/638" rel="alternate"/>
<subtitle/>
<id>http://hdl.handle.net/10256/638</id>
<updated>2013-05-26T08:08:46Z</updated>
<dc:date>2013-05-26T08:08:46Z</dc:date>
<entry>
<title>Bayesian tools for zero counts in compositional data</title>
<link href="http://hdl.handle.net/10256/713" rel="alternate"/>
<author>
<name>Daunis i Estadella, Josep</name>
</author>
<author>
<name>Martín Fernández, Josep Antoni</name>
</author>
<author>
<name>Palarea Albaladejo, Javier</name>
</author>
<id>http://hdl.handle.net/10256/713</id>
<updated>2012-11-19T08:54:39Z</updated>
<published>2008-05-27T00:00:00Z</published>
<summary type="text">Bayesian tools for zero counts in compositional data
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni; Palarea Albaladejo, Javier
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
The log-ratio methodology makes available powerful tools for analyzing compositional&#13;
data. Nevertheless, the use of this methodology is only possible for those data sets&#13;
without null values. Consequently, in those data sets where the zeros are present, a&#13;
previous treatment becomes necessary. Last advances in the treatment of compositional&#13;
zeros have been centered especially in the zeros of structural nature and in the rounded&#13;
zeros. These tools do not contemplate the particular case of count compositional data&#13;
sets with null values. In this work we deal with \count zeros" and we introduce a&#13;
treatment based on a mixed Bayesian-multiplicative estimation. We use the Dirichlet&#13;
probability distribution as a prior and we estimate the posterior probabilities. Then we&#13;
apply a multiplicative modi¯cation for the non-zero values. We present a case study&#13;
where this new methodology is applied.&#13;
Key words: count data, multiplicative replacement, composition, log-ratio analysis
</summary>
<dc:date>2008-05-27T00:00:00Z</dc:date>
</entry>
<entry>
<title>Discrete and continuous compositions</title>
<link href="http://hdl.handle.net/10256/712" rel="alternate"/>
<author>
<name>Bacon Shone, John</name>
</author>
<id>http://hdl.handle.net/10256/712</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-27T00:00:00Z</published>
<summary type="text">Discrete and continuous compositions
Bacon Shone, John
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
This paper examines a dataset which is modeled well by the&#13;
Poisson-Log Normal process and by this process mixed with Log&#13;
Normal data, which are both turned into compositions. This&#13;
generates compositional data that has zeros without any need for&#13;
conditional models or assuming that there is missing or censored&#13;
data that needs adjustment. It also enables us to model dependence&#13;
on covariates and within the composition
</summary>
<dc:date>2008-05-27T00:00:00Z</dc:date>
</entry>
<entry>
<title>When zero doesn't mean it and other geomathematical mischief</title>
<link href="http://hdl.handle.net/10256/711" rel="alternate"/>
<author>
<name>Valls Alvarez, Ricardo.A.</name>
</author>
<id>http://hdl.handle.net/10256/711</id>
<updated>2013-01-08T10:53:15Z</updated>
<published>2008-05-27T00:00:00Z</published>
<summary type="text">When zero doesn't mean it and other geomathematical mischief
Valls Alvarez, Ricardo.A.
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
There is almost not a case in exploration geology, where the studied data doesn’t&#13;
includes below detection limits and/or zero values, and since most of the geological data&#13;
responds to lognormal distributions, these “zero data” represent a mathematical&#13;
challenge for the interpretation.&#13;
We need to start by recognizing that there are zero values in geology. For example the&#13;
amount of quartz in a foyaite (nepheline syenite) is zero, since quartz cannot co-exists&#13;
with nepheline. Another common essential zero is a North azimuth, however we can&#13;
always change that zero for the value of 360°. These are known as “Essential zeros”, but&#13;
what can we do with “Rounded zeros” that are the result of below the detection limit of&#13;
the equipment?&#13;
Amalgamation, e.g. adding Na2O and K2O, as total alkalis is a solution, but sometimes&#13;
we need to differentiate between a sodic and a potassic alteration. Pre-classification into&#13;
groups requires a good knowledge of the distribution of the data and the geochemical&#13;
characteristics of the groups which is not always available. Considering the zero values&#13;
equal to the limit of detection of the used equipment will generate spurious&#13;
distributions, especially in ternary diagrams. Same situation will occur if we replace the&#13;
zero values by a small amount using non-parametric or parametric techniques&#13;
(imputation).&#13;
The method that we are proposing takes into consideration the well known relationships&#13;
between some elements. For example, in copper porphyry deposits, there is always a&#13;
good direct correlation between the copper values and the molybdenum ones, but while&#13;
copper will always be above the limit of detection, many of the molybdenum values will&#13;
be “rounded zeros”. So, we will take the lower quartile of the real molybdenum values&#13;
and establish a regression equation with copper, and then we will estimate the&#13;
“rounded” zero values of molybdenum by their corresponding copper values.&#13;
The method could be applied to any type of data, provided we establish first their&#13;
correlation dependency.&#13;
One of the main advantages of this method is that we do not obtain a fixed value for the&#13;
“rounded zeros”, but one that depends on the value of the other variable.&#13;
Key words: compositional data analysis, treatment of zeros, essential zeros, rounded&#13;
zeros, correlation dependency
</summary>
<dc:date>2008-05-27T00:00:00Z</dc:date>
</entry>
<entry>
<title>Inference of distributional parameters from compositional samples containing nondetects</title>
<link href="http://hdl.handle.net/10256/708" rel="alternate"/>
<author>
<name>Olea, Ricardo A.</name>
</author>
<id>http://hdl.handle.net/10256/708</id>
<updated>2012-06-28T12:30:36Z</updated>
<published>2008-05-27T00:00:00Z</published>
<summary type="text">Inference of distributional parameters from compositional samples containing nondetects
Olea, Ricardo A.
Daunis i Estadella, Josep; Martín Fernández, Josep Antoni
Low concentrations of elements in geochemical analyses have the peculiarity of being&#13;
compositional data and, for a given level of significance, are likely to be beyond the&#13;
capabilities of laboratories to distinguish between minute concentrations and complete&#13;
absence, thus preventing laboratories from reporting extremely low concentrations of the&#13;
analyte. Instead, what is reported is the detection limit, which is the minimum&#13;
concentration that conclusively differentiates between presence and absence of the&#13;
element. A spatially distributed exhaustive sample is employed in this study to generate&#13;
unbiased sub-samples, which are further censored to observe the effect that different&#13;
detection limits and sample sizes have on the inference of population distributions&#13;
starting from geochemical analyses having specimens below detection limit (nondetects).&#13;
The isometric logratio transformation is used to convert the compositional data in the&#13;
simplex to samples in real space, thus allowing the practitioner to properly borrow from&#13;
the large source of statistical techniques valid only in real space. The bootstrap method is&#13;
used to numerically investigate the reliability of inferring several distributional&#13;
parameters employing different forms of imputation for the censored data. The case&#13;
study illustrates that, in general, best results are obtained when imputations are made&#13;
using the distribution best fitting the readings above detection limit and exposes the&#13;
problems of other more widely used practices. When the sample is spatially correlated, it&#13;
is necessary to combine the bootstrap with stochastic simulation
</summary>
<dc:date>2008-05-27T00:00:00Z</dc:date>
</entry>
</feed>
