|Authors:||Heather Battey and Oliver Linton|
|Date:||22 August 2013|
|Type:||cemmap Working Paper, CWP41/13|
This paper considers the class of p-dimensional elliptic distributions (p ≥ 1) satisfying the consistency property (Kano, 1994) and within this general frame work presents a two-stage semiparametric estimator for the Lebesgue density based on Gaussian mixture sieves. Under the online Exponentiated Gradient (EG) algorithm of Helmbold et al. (1997) and without restricting the mixing measure to have compact support, the estimator produces estimates converging uniformly in probability to the true elliptic density at a rate that is independent of the dimension of the problem, hence circumventing the familiar curse of dimensionality inherent to many semiparametric estimators. The rate performance of our estimator depends on the tail behaviour of the underlying mixing density (and hence that of the data) rather than smoothness properties. In fact, our method achieves a rate of at least Op(n-1/4), provided only some positive moment exists. When further moments exists, the rate improves reaching Op(n-3/8) as the tails of the true density converge to those of a normal. Unlike the elliptic density estimator of Liebscher (2005), our sieve estimator always yields an estimate that is a valid density, and is also attractive from a practical perspective as it accepts data as a stream, thus significantly reducing computational and storage requirements. Monte Carlo experimentation indicates encouraging finite sample performance over a range of elliptic densities. The estimator is also implemented in a binary classification task using the well-known Wisconsin breast cancer dataset.
This paper is forthcoming in the The Journal of Multivariate AnalysisDownload full version