Instance optimal learning of discrete distributions
暂无分享,去创建一个
[1] Alex Samorodnitsky,et al. Approximating entropy from sublinear samples , 2007, SODA '07.
[2] B. Efron,et al. Estimating the number of unseen species: How many words did Shakespeare know? Biometrika 63 , 1976 .
[3] Alon Orlitsky,et al. 25th Annual Conference on Learning Theory Competitive Classification and Closeness Testing , 2022 .
[4] James Zou,et al. Quantifying the unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects , 2015, bioRxiv.
[5] Daniel M. Kane,et al. Testing Identity of Structured Distributions , 2014, SODA.
[6] Dana Ron,et al. Property testing and its connection to learning and approximation , 1998, JACM.
[7] Sudipto Guha,et al. Streaming and sublinear approximation of entropy and information distances , 2005, SODA '06.
[8] Gabor T. Marth,et al. A global reference for human genetic variation , 2015, Nature.
[9] Alon Orlitsky,et al. Optimal Probability Estimation with Applications to Prediction and Classification , 2013, COLT.
[10] Moses Charikar,et al. On the Advantage over Random for Maximum Acyclic Subgraph , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).
[11] Yanjun Han,et al. Minimax Estimation of Functionals of Discrete Distributions , 2014, IEEE Transactions on Information Theory.
[12] Ronitt Rubinfeld,et al. The complexity of approximating entropy , 2002, STOC '02.
[13] Alon Orlitsky,et al. Always Good Turing: Asymptotically Optimal Probability Estimation , 2003, Science.
[14] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.
[15] David A. McAllester,et al. On the Convergence Rate of Good-Turing Estimators , 2000, COLT.
[16] Liam Paninski,et al. Estimating entropy on m bins given fewer than m samples , 2004, IEEE Transactions on Information Theory.
[17] Ronitt Rubinfeld,et al. Testing random variables for independence and identity , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.
[18] Paul Valiant,et al. Estimating the Unseen , 2013, NIPS.
[19] I. Good. THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .
[20] William A. Gale,et al. Good-Turing Frequency Estimation Without Tears , 1995, J. Quant. Linguistics.
[21] Graham Cormode,et al. A near-optimal algorithm for computing the entropy of a stream , 2007, SODA '07.
[22] Ronitt Rubinfeld,et al. Testing that distributions are close , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.
[23] I. Good,et al. THE NUMBER OF NEW SPECIES, AND THE INCREASE IN POPULATION COVERAGE, WHEN A SAMPLE IS INCREASED , 1956 .
[24] Rocco A. Servedio,et al. Explorer Efficient Density Estimation via Piecewise Polynomial Approximation , 2013 .
[25] Paul Valiant. Testing symmetric properties of distributions , 2008, STOC '08.
[26] Alon Orlitsky,et al. A Competitive Test for Uniformity of Monotone Distributions , 2013, AISTATS.
[27] P. McCullagh. Estimating the Number of Unseen Species: How Many Words did Shakespeare Know? , 2008 .
[28] Gregory Valiant,et al. An Automatic Inequality Prover and Instance Optimal Identity Testing , 2014, 2014 IEEE 55th Annual Symposium on Foundations of Computer Science.
[29] Ronitt Rubinfeld,et al. Sublinear algorithms for testing monotone and unimodal distributions , 2004, STOC '04.
[30] Alon Orlitsky,et al. Competitive Distribution Estimation: Why is Good-Turing Good , 2015, NIPS.
[31] Alon Orlitsky,et al. Competitive Closeness Testing , 2011, COLT.
[32] Gregory Valiant,et al. Estimating the unseen: an n/log(n)-sample estimator for entropy and support size, shown optimal via new CLTs , 2011, STOC '11.
[33] Gregory Valiant,et al. The Power of Linear Estimators , 2011, 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science.
[34] Gregory Valiant,et al. Quantifying the unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects , 2015 .
[35] Dana Ron,et al. Strong Lower Bounds for Approximating Distribution Support Size and the Distinct Elements Problem , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).
[36] Joseph K. Pickrell,et al. A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes , 2012, Science.
[37] Yishay Mansour,et al. Concentration Bounds for Unigrams Language Model , 2005, COLT.
[38] Ilias Diakonikolas,et al. Optimal Algorithms for Testing Closeness of Discrete Distributions , 2013, SODA.
[39] Rocco A. Servedio,et al. Testing k-Modal Distributions: Optimal Algorithms via Reductions , 2011, SODA.