copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Estimating the number of species to attain sufficient representation in a random sample

C. Deng, and A. Smith. ArXiv e-prints, (July 2016)

Abstract

The statistical problem of using an initial sample to estimate the number of unique species in a larger sample has found important applications in fields far removed from ecology. Here we address the general problem of estimating the number of species that will be represented at least r times, for any r ≥ 1, in a future sample. We derive a procedure to construct estimators that apply universally for a given population: once constructed, they can be evaluated as a simple function of r. Our approach is based on a relationship between the number of species represented at least r times and the higher derivatives of the number of unique species seen per unit of sampling. We further show the estimators retain asymptotic behaviors that are essential for applications on large-scale data sets and for large r. We validate practical performance of this approach by applying it to analyze Dickens’ vocabulary, the topology of a Twitter social network, and DNA sequencing data.

Links and resources

BibTeX key: deng2016estimating
entry type: article
year: 2016
month: jul
journal: ArXiv e-prints
eprint: 1607.02804
adsurl: http://adsabs.harvard.edu/abs/2016arXiv160702804D
archiveprefix: arXiv
adsnote: Provided by the SAO/NASA Astrophysics Data System
primaryclass: stat.ME

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Estimating the number of species to attain sufficient representation in a random sample

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Estimating the number of species to attain sufficient representation in a random sample

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Estimating the number of species to attain sufficient representation in a random sample

Comments and Reviews
(0)