论文信息 - Statistical Analysis and Design of Crowdsourcing Applications - 字舞流文

Statistical Analysis and Design of Crowdsourcing Applications

STATISTICAL ANALYSIS AND DESIGN OF CROWDSOURCING APPLICATIONS

Adam Kapelner | A. Kapelner

[1] Chris Callison-Burch,et al. Creating Speech and Language Data With Amazon’s Mechanical Turk , 2010, Mturk@HLT-NAACL.

[2] J. Friedman. Stochastic gradient boosting , 2002 .

[3] R. Larsen,et al. The Satisfaction with Life Scale , 1985, Journal of personality assessment.

[4] Adam Kapelner,et al. Bayesian Additive Regression Trees With Parametric Models of Heteroskedasticity , 2014 .

[5] Stanford E. Taylor. Eye Movements in Reading: Facts and Fallacies , 1965 .

[6] J. Angrist,et al. Estimation of Limited Dependent Variable Models With Dummy Endogenous Regressors , 2001 .

[7] C. Keyes,et al. The structure of psychological well-being revisited. , 1995, Journal of personality and social psychology.

[8] C. Assaid,et al. The Theory of Response-Adaptive Randomization in Clinical Trials , 2007 .

[9] Carlo Strapparava,et al. Learning to identify emotions in text , 2008, SAC '08.

[10] D. Rubin,et al. MULTIPLE IMPUTATIONS IN SAMPLE SURVEYS-A PHENOMENOLOGICAL BAYESIAN APPROACH TO NONRESPONSE , 2002 .

[11] R. Tibshirani,et al. Bayesian Backfitting , 1998 .

[12] Marion K Campbell,et al. The method of minimization for allocation to clinical trials. a review. , 2002, Controlled clinical trials.

[13] J. Rowe,et al. Human aging: usual and successful. , 1987, Science.

[14] S. Kruger. Design Of Observational Studies , 2016 .

[15] Susan Holmes,et al. Quantitative, Architectural Analysis of Immune Cell Subsets in Tumor-Draining Lymph Nodes from Breast Cancer Patients and Healthy Lymph Nodes , 2010, PloS one.

[16] S. Chib,et al. Bayesian analysis of binary and polychotomous response data , 1993 .

[17] Cindy K. Chung,et al. The development and psychometric properties of LIWC2007 , 2007 .

[18] Lydia B. Chilton,et al. Task search in a human computation market , 2010, HCOMP '10.

[19] Dana Chandler,et al. Breaking Monotony with Meaning: Motivation in Crowdsourcing Markets , 2012, ArXiv.

[20] Jeffrey H Silber,et al. Optimal multivariate matching before randomization. , 2004, Biostatistics.

[21] Moses Abramovitz. Economic Growth and Its Discontents , 1973 .

[22] Scott A. Golder,et al. Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures , 2011 .

[23] H. Chipman,et al. BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[24] Peter Bühlmann,et al. MissForest - non-parametric missing value imputation for mixed-type data , 2011, Bioinform..

[25] Wendy McColskey,et al. Measuring Student Engagement in Upper Elementary through High School: A Description of 21 Instruments. Summary. Issues & Answers. REL 2011-No. 098. , 2011 .

[26] T. Cook,et al. Quasi-experimentation: Design & analysis issues for field settings , 1979 .

[27] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[28] B. Efron. Forcing a sequential experiment to be balanced , 1971 .

[29] Adam D. I. Kramer. An unobtrusive behavioral model of "gross national happiness" , 2010, CHI.

[30] S. Raudenbush,et al. Strategies for Improving Precision in Group-Randomized Experiments , 2007 .

[31] R Core Team,et al. R: A language and environment for statistical computing. , 2014 .

[32] Mike Wells,et al. Structured Models for Fine-to-Coarse Sentiment Analysis , 2007, ACL.

[33] Anne Elizabeth Preston,et al. The Nonprofit Worker in a For-Profit World , 1989, Journal of Labor Economics.

[34] Mark Stevenson,et al. Introduction to the special issue on word sense disambiguation , 2004, Comput. Speech Lang..

[35] J. Krosnick. Response strategies for coping with the cognitive demands of attitude measures in surveys , 1991 .

[36] Susan Holmes,et al. An Interactive Java Statistical Image Segmentation System: GemIdent. , 2009, Journal of statistical software.

[37] James K. Harter,et al. INTERPERSONAL RELATIONS AND GROUP PROCESSES Wealth and Happiness Across the World : Material Prosperity Predicts Life Evaluation , Whereas Psychosocial Prosperity Predicts Positive Feeling , 2010 .

[38] Jeffrey S. Simonoff,et al. An Investigation of Missing Data Methods for Classification Trees , 2006, J. Mach. Learn. Res..

[39] H. Friedman. The Oxford Handbook of Health Psychology , 2013 .

[40] Pushmeet Kohli,et al. Personality and patterns of Facebook usage , 2012, WebSci '12.

[41] Adam Kapelner,et al. Prediction with missing data via Bayesian Additive Regression Trees , 2013, ArXiv.

[42] Wei-Yin Loh,et al. Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[43] Jerome H. Friedman. Multivariate adaptive regression splines (with discussion) , 1991 .

[44] E. M. Elderton. THE LANARKSHIRE MILK EXPERIMENT , 1933 .

[45] Nancy Ide,et al. Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[46] Hsin-Hsi Chen,et al. Emotion Modeling from Writer/Reader Perspectives Using a Microblog Dataset , 2011 .

[47] Edward I. George,et al. Variable selection for BART: An application to gene regulation , 2013, 1310.4887.

[48] James J. Appleton,et al. Student engagement with school: Critical conceptual and methodological issues of the construct. , 2008 .

[49] Shein-Chung Chow,et al. Adaptive design methods in clinical trials – a review , 2008, Orphanet journal of rare diseases.

[50] David G. Rand,et al. The online laboratory: conducting experiments in a real labor market , 2010, ArXiv.

[51] K. Mealey,et al. Clinical pharmacology and therapeutics. , 2013, The Veterinary clinics of North America. Small animal practice.

[52] Christiane Fellbaum,et al. Analysis of a Hand-Tagging Task , 1997, Workshop On Tagging Text With Lexical Semantics: Why, What, And How?.

[53] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[54] Martin E. P. Seligman,et al. Doing the right thing: Measuring wellbeing for public policy , 2011 .

[55] W. K. Hastings,et al. Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[56] Sam K. Hui,et al. Green-lighting Movie Scripts : Revenue Forecasting and Risk Management , 2010 .

[57] Panagiotis G. Ipeirotis. Demographics of Mechanical Turk , 2010 .

[58] 清川英男,et al. CHALL, J. S. and DALE, E. (1995) Readability Revisited : The New Dale-Chall Readability Formula., Brookline Books , 1996 .

[59] Adam Kapelner,et al. bartMachine: A Powerful Tool for Machine Learning , 2013, ArXiv.

[60] S. Pocock,et al. Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial. , 1975, Biometrics.

[61] Eli Biham,et al. Advances in Cryptology — EUROCRYPT 2003 , 2003, Lecture Notes in Computer Science.

[62] David A. Forsyth,et al. Utility data annotation with Amazon Mechanical Turk , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[63] J. Friedman. Greedy function approximation: A gradient boosting machine. , 2001 .

[64] B. Hansen,et al. Optimal Full Matching and Related Designs via Network Flows , 2006 .

[65] Ronald Inglehart,et al. Theory and Validity of Life Satisfaction Scales , 2013 .

[66] Jun S. Liu,et al. Extracting sequence features to predict protein–DNA interactions: a comparative study , 2008, Nucleic acids research.

[67] KyungMann Kim. Group Sequential Methods with Applications to Clinical Trials , 2001 .

[68] M. Seligman. Flourish: A Visionary New Understanding of Happiness and Well-being , 2011 .

[69] Avalanche Forecasting: Using Bayesian Additive Regression Trees (BART) , 2014 .

[70] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[71] C. B. Colby. The weirdest people in the world , 1973 .

[72] E. George. Capture—recapture estimation via Gibbs sampling , 1992 .

[73] Rada Mihalcea,et al. Amazon Mechanical Turk for Subjectivity Word Sense Disambiguation , 2010, Mturk@HLT-NAACL.

[74] A. Bakker,et al. The Measurement of Work Engagement With a Short Questionnaire , 2006 .

[75] L. Eyde,et al. Psychological testing and psychological assessment. A review of evidence and issues. , 2001, The American psychologist.

[76] Deborah M. Gordon,et al. The effect of individual variation on the structure and function of interaction networks in harvester ants , 2011, Journal of The Royal Society Interface.

[77] C B Begg,et al. The impact of treatment allocation procedures on nominal significance levels and bias. , 1987, Controlled clinical trials.

[78] Klaus Krippendorff,et al. Estimating the Reliability, Systematic Error and Random Error of Interval Data , 1970 .

[79] Shalabh. Design of Experiments: an Introduction based on Linear Models , 2012 .

[80] Steven D. Levitt,et al. What Do Laboratory Experiments Measuring Social Preferences Reveal About the Real World , 2007 .

[81] R. McCrae,et al. An introduction to the five-factor model and its applications. , 1992, Journal of personality.

[82] Steven D. Levitt,et al. FIELD EXPERIMENTS IN ECONOMICS : THE PAST , THE PRESENT , AND THE FUTURE , 2008 .

[83] Damian McEntegart,et al. Randomization by minimization for unbalanced treatment allocation , 2009, Statistics in medicine.

[84] Jon Oberlander,et al. What Are They Blogging About? Personality, Topic and Motivation in Blogs , 2009, ICWSM.

[85] Nancy Cartwright,et al. Are RCTs the Gold Standard? , 2007 .

[86] E. Diener,et al. Social relations, health behaviors, and health outcomes: a survey and synthesis. , 2013, Applied psychology. Health and well-being.

[87] Lydia B. Chilton,et al. The labor economics of paid crowdsourcing , 2010, EC '10.

[88] L. Pearlin,et al. The structure of coping. , 1978, Journal of health and social behavior.

[89] E. Deci,et al. Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being. , 2000, The American psychologist.

[90] Shelley E. Taylor. Social Support: A Review , 2011 .

[91] Brent D. Rosso,et al. On the meaning of work: A theoretical integration and review , 2010 .

[92] David J. Hand,et al. Good methods for coping with missing data in decision trees , 2008, Pattern Recognit. Lett..

[93] Russ B. Altman,et al. Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[94] H. Simon,et al. A Behavioral Model of Rational Choice , 1955 .

[95] Jonathon Read,et al. Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[96] G. Harrison,et al. Field experiments , 1924, The Journal of Agricultural Science.

[97] Y. Hayakawa,et al. A BETA‐BINOMIAL MODEL FOR ESTIMATING THE SIZE OF A HETEROGENEOUS POPULATION , 2005 .

[98] Daniel M. Oppenheimer,et al. Instructional Manipulation Checks: Detecting Satisficing to Increase Statistical Power , 2009 .

[99] Martha Palmer,et al. Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation? , 2010, LREC.

[100] Emil Pitkin,et al. Peeking Inside the Black Box: Visualizing Statistical Learning With Plots of Individual Conditional Expectation , 2013, 1309.6392.

[101] Erik T. Mueller,et al. Open Mind Common Sense: Knowledge Acquisition from the General Public , 2002, OTM.

[102] C B Begg,et al. A treatment allocation procedure for sequential clinical trials. , 1980, Biometrics.

[103] T. Kashdan,et al. Understanding the search for meaning in life: personality, cognitive style, and the dynamic between seeking and experiencing meaning. , 2008, Journal of personality.

[104] D R Taves,et al. Minimization: A new method of assigning patients to treatment and control groups , 1974, Clinical pharmacology and therapeutics.

[105] James R. Gattiker,et al. Parallel Bayesian Additive Regression Trees , 2013, 1309.1906.

[106] Damian McEntegart,et al. The Pursuit of Balance Using Stratified and Dynamic Randomization Techniques: An Overview , 2003 .

[107] A. Kapelner,et al. An Interactive Statistical Image Segmentation and Visualization System , 2007, International Conference on Medical Information Visualisation - BioMedical Visualisation (MediVis 2007).

[108] Gregory J. Park,et al. Predicting Dark Triad Personality Traits from Twitter Usage and a Linguistic Analysis of Tweets , 2012, 2012 11th International Conference on Machine Learning and Applications.

[109] M. Csíkszentmihályi. Creativity: Flow and the Psychology of Discovery and Invention , 1996 .

[110] Rada Mihalcea,et al. Exploiting Agreement and Disagreement of Human Annotators for Word Sense Disambiguation , 2003 .

[111] A. Atkinson. Optimum biased coin designs for sequential clinical trials with prognostic factors , 1982 .

[112] R. Little. Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[113] Ananthanarayanan Parasuraman,et al. Assessing response quality. A self‐disclosure approach to assessing response quality in mall intercept and telephone interviews , 1984 .

[114] J. Cacioppo,et al. The need for cognition. , 1982 .

[115] Bo Pang,et al. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[116] David W. Aha,et al. Instance‐based prediction of real‐valued attributes , 1989, Comput. Intell..

[117] H Merabet,et al. The design and analysis of sequential clinical trials , 2013 .

[118] Martha Palmer,et al. SemEval-2007 Task-17: English Lexical Sample, SRL and All Words , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[119] Bhekisipho Twala,et al. AN EMPIRICAL COMPARISON OF TECHNIQUES FOR HANDLING INCOMPLETE DATA USING DECISION TREES , 2009, Appl. Artif. Intell..

[120] James W. Pennebaker,et al. Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[121] Tal Yarkoni. Personality in 100,000 Words: A large-scale analysis of personality and word use among bloggers. , 2010, Journal of research in personality.

[122] Rachel T. A. Croson,et al. Gender Differences in Preferences , 2009 .

[123] Andy Liaw,et al. Classification and Regression by randomForest , 2007 .

[124] D. Rubin,et al. Using Multivariate Matched Sampling and Regression Adjustment to Control Bias in Observational Studies , 1978 .

[125] Panagiotis G. Ipeirotis,et al. Running Experiments on Amazon Mechanical Turk , 2010, Judgment and Decision Making.

[126] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[127] Richard H. Thaler,et al. Mental Accounting and Consumer Choice , 1985, Mark. Sci..

[128] J. Stiglitz,et al. Report by the commission on the measurement of economic performance and social progress , 2011 .

[129] R. Simon,et al. Restricted randomization designs in clinical trials. , 1979, Biometrics.

[130] Roger Tourangeau,et al. Taking the Audio Out of Audio-CASI , 2009 .

[131] Roberto Navigli,et al. Word sense disambiguation: A survey , 2009, CSUR.

[132] David A. Freedman,et al. On regression adjustments to experimental data , 2008, Adv. Appl. Math..

[133] James J. Appleton,et al. Measuring cognitive and psychological engagement: Validation of the Student Engagement Instrument , 2006 .

[134] D. Ariely,et al. Man's search for meaning: The case of Legos , 2008 .

[135] E. Lukacz,et al. Effect of amitriptyline on symptoms in treatment naïve patients with interstitial cystitis/painful bladder syndrome. , 2010, The Journal of urology.

[136] W. Rosenberger,et al. The theory of response-adaptive randomization in clinical trials , 2006 .

[137] Nathan Halko,et al. Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions , 2009, SIAM Rev..

[138] E. Diener,et al. Review of the Satisfaction with Life Scale , 1993 .

[139] Adam J. Berinsky,et al. Evaluating Online Labor Markets for Experimental Research: Amazon.com's Mechanical Turk , 2012, Political Analysis.

[140] Cecilia Ovesdotter Alm,et al. Emotions from Text: Machine Learning for Text-based Emotion Prediction , 2005, HLT.

[141] Brendan T. O'Connor,et al. Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[142] K. Anders Ericsson,et al. Attaining Excellence Through Deliberate Practice: Insights from the Study of Expert Performance , 2008 .

[143] Hugo Liu,et al. A Corpus-based Approach to Finding Happiness , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[144] Siddharth Suri,et al. Conducting behavioral research on Amazon’s Mechanical Turk , 2010, Behavior research methods.

[145] Adam Kapelner,et al. Matching on‐the‐fly: Sequential allocation with higher power and efficiency , 2014, Biometrics.

[146] Kari Lock Morgan,et al. Rerandomization to improve covariate balance in experiments , 2012, 1207.5625.

[147] P. Rothwell,et al. External validity of randomised controlled trials: “To whom do the results of this trial apply?” , 2005, The Lancet.

[148] Donald Geman,et al. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[149] Richard E. Lucas,et al. Predictors of Regional Well-Being: A County Level Analysis , 2011 .

[150] Maxine Eskénazi,et al. Clustering dictionary definitions using Amazon Mechanical Turk , 2010, Mturk@HLT-NAACL.

[151] Marilyn A. Walker,et al. Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[152] Richard Layard,et al. Rethinking public economics: the implications of rivalry and habit , 2003 .

[153] Peter D. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[154] Nicole A. Lazar,et al. Statistical Analysis With Missing Data , 2003, Technometrics.

[155] Stan Szpakowicz,et al. Identifying Expressions of Emotion in Text , 2007, TSD.

[156] Christopher M. Danforth,et al. Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter , 2011, PloS one.

[157] Damaraju Raghavarao. Use of Distance Function in Sequential Treatment Assignment for Prognostic Factors in the Controlled Clinical Trial , 1980 .

[158] J. Fleiss,et al. Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[159] J. Hoffman,et al. The role of visual attention in saccadic eye movements , 1995, Perception & psychophysics.

[160] Dana Chandler,et al. Preventing Satisficing in Online Surveys: A "Kapcha" to Ensure Higher Quality Data , 2010 .

[161] M. Seligman,et al. Pursuit of pleasure, engagement, and meaning: Relationships to subjective and objective measures of well-being , 2010 .

[162] David E. Booth,et al. Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[163] William F. Rosenberger,et al. Handling Covariates in the Design of Clinical Trials. , 2008, 1102.3773.

[164] C. F. Kao,et al. The efficient assessment of need for cognition. , 1984, Journal of personality assessment.

[165] Brent Simpson,et al. Emotional reactions to losing explain gender differences in entering a risky lottery , 2010, Judgment and Decision Making.

[166] David R. Anderson,et al. Statistical inference from capture data on closed animal populations , 1980 .

[167] B. Kindo,et al. MBACT - Multiclass Bayesian Additive Classication Trees , 2013 .

[168] Jon Oberlander,et al. Identifying more bloggers: Towards large scale personality classification of personal weblogs , 2007, ICWSM.

[169] John Langford,et al. CAPTCHA: Using Hard AI Problems for Security , 2003, EUROCRYPT.

[170] Ari Rappoport,et al. Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[171] M. Larsen,et al. The Psychology of Survey Response , 2002 .

[172] Robert B. Gramacy,et al. Dynamic Trees for Learning and Design , 2009, 0912.1586.

[173] P. McCullagh. Estimating the Number of Unseen Species: How Many Words did Shakespeare Know? , 2008 .

[174] J. Chall,et al. Readability revisited : the new Dale-Chall readability formula , 1995 .

[175] Peter Green,et al. Markov chain Monte Carlo in Practice , 1996 .

[176] Christian P. Robert,et al. The Bayesian choice : from decision-theoretic foundations to computational implementation , 2007 .

[177] J. Krosnick,et al. Survey research. , 1999, Annual review of psychology.

[178] Nansook Park,et al. Positive Psychology as the Evenhanded Positive Psychologist Views It. , 2003 .

[179] Dean P. Foster,et al. New Insights from Coarse Word Sense Disambiguation in the Crowd , 2012, COLING.

[180] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[181] Leo Breiman,et al. Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[182] Ed Diener,et al. Very Happy People , 2002, Psychological science.

[183] P. Costa,et al. Validation of the five-factor model of personality across instruments and observers. , 1987, Journal of personality and social psychology.

[184] Jon Sprouse. A validation of Amazon Mechanical Turk for the collection of acceptability judgments in linguistic theory , 2010, Behavior research methods.

[185] Rohit Prasad,et al. Automatic Detection of Psychological Distress Indicators and Severity Assessment from Online Forum Posts , 2012, COLING.

[186] Jon A. Krosnick,et al. Satisficing in surveys: Initial evidence , 1996 .

[187] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[188] T. Therneau,et al. An Introduction to Recursive Partitioning Using the RPART Routines , 2015 .

[189] Barry Schwartz,et al. Jobs, Careers, and Callings: People's Relations to Their Work , 1997 .

[190] Arthur E. Hoerl,et al. Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[191] John K Kruschke,et al. Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[192] George Metakides,et al. Web Science , 2008, ECAI.

[193] Stephen Senn. Consensus and Controversy in Pharmaceutical Statistics , 2000 .