where
is a parametric density function with location parameter
. We then derive the
corresponding reference posterior distribution
, and use this to obtain a
Bayesian kernel estimate
of the unknown distribution as the
corresponding reference predictive density
We discuss some simulated examples and gauge the corresponding results using the logarithmic divergence
as an appropriate loss function.
In this comunication we extend the previous characterization to sequences of exchangeable, arbitrary random elements.
Moreover we use this extension to revisit well known characterizations of the laws of a Dirichlet process and of a Pólya tree process.
To achieve this aim, each explanatory variable is associated to the partition of the observations induced by its levels. We then consider, conditionally on each partition, a hierarchical Bayesian model on the hazard rates. We then derive as a measure of the importance of each prognostic factor, the posterior probability of each partition. Such probabilities are finally employed to estimate the hazard functions by averaging the estimated conditional hazard over the set of all partitions.
More specifically, for a collection of individuals, , let
be a random variable
representing the
failure time of subject i, possibly censored by
, and let
.
In order to better illustrate and interpretate our proposed
methodology, we shall consider a collection of discrete failure times:
, as in Hjort (1990)
and let the individual hazard at time
be the following
Given a collection of observed discrete times, the likelihood of the cumulative
hazard functions
is:
where
,
and
.
Let now g indicate a partition of the index set
with
subsets
,
for
.
In other words, g defines a partition of the subjects,
assuming that the hazards of the individuals in the same subset,
,
are equal.
The consideration of a partition of the individuals
leads to a new likelihood for , which is conditional upon g:
where
is the number of failures in the interval
for group j,
is the number of subjects at risk in the interval
for group
j.
In our exploratory approach, we shall entertain several partition structures. Each partition corresponds to a specific stratification of a potential prognostic factor. This amounts to consider a collection of alternative partial exchangeability structures for the survival times, as suggested in the context of binomial studies by Consonni and Veronese (1995).
Our first aim is to
evaluate the importance of each prognostic factor.
This can be achieved calculating, given the observed
evidence D,
the posterior probability of each partition, p(g|D).
Our second aim is to estimate the
hazard function,
in order to make predictions on survival
times.
This task can be performed
in two steps: first we work conditionally
on a partition g, and determine a Bayesian estimate
of each individual hazard, by calculating
the posterior means .
The second step of the estimation procedure
involves using p(g|D) to calculate the marginal posterior
expectation
of each individual hazard via the law of total probabilities:
In order to perform the above computation two alternative
prior specifications on will be considered.
First we take, conditionally on a partition g,
each
as a stochastic process, with
independent summands, each of which is distributed as follows:
for
and for
and
with
and
positive constants.
We then consider a Markovian dependency structure among the hazards in
different time intervals, similarly to what suggested in Arjas (1994).
This prior specification has the advantage of requiring a fewer number of
hyperparameters
and to allow borrowing
strength between time intervals, thus
making the cumulative
hazards in marginally dependent.
The proposed class of
prior distribution corresponds to consider the hazard increments as
exchangeable.
The two resulting methodologies are applied and compared, using appropriate MCMC method on both simulated and observed survival data.
Essential References
Arjas, E. and Gasbarra, D. (1994). Nonparametric Bayesian inference from right censored survival data, using the Gibbs Sampler. Statistica Sinica 4, 505-524.
Consonni, G. and Veronese, P. (1995). A Bayesian method for combining results from several binomial experiments. Journal of the American Statistical Association 90, 935-944.
Hjort, N. L. (1990). Non parametric Bayes estimators based on beta processes in models for life history data. Annals of Statistics 18, 1259-1294.