Inherent Measurement Challenges in the Next Generation Science Standards for Both Formative and Summative Assessment

[1]  T. Haladyna Developing and validating multiple-choice test items, 3rd ed. , 2004 .

[2]  R. Mislevy Evidence and inference in educational assessment , 1994 .

[3]  J. H. McMillan Annual Meeting of the American Educational Research , 2001 .

[4]  K. Rust,et al.  Population Inferences and Variance Estimation for NAEP Data , 1992 .

[5]  Lihua Yao,et al.  Methods and Models for Vertical Scaling , 2007 .

[6]  Robert J. Mislevy,et al.  Psychometric and Evidentiary Approaches to Simulation Assessment in Packet Tracer Software , 2009, 2009 Fifth International Conference on Networking and Services.

[7]  Margaret Heritage,et al.  Formative Assessment: What Do Teachers Need to Know and Do? , 2007 .

[8]  R. Bennett,et al.  Transforming K–12 Assessment: Integrating Accountability Testing, Formative Assessment and Professional Support , 2009 .

[9]  Helen R. Quinn,et al.  A Framework for K-12 Science Education: Practices, Crosscutting Concepts, and Core Ideas , 2013 .

[10]  William H. Schmidt,et al.  A Coherent Curriculum: The Case of Mathematics. , 2002 .

[11]  Mark Wilson Saltus: A psychometric model of discontinuity in cognitive development. , 1989 .

[12]  James E. Carlson Statistical Models for Vertical Linking , 2009 .

[13]  Robert J. Mislevy,et al.  Automated scoring of complex tasks in computer-based testing , 2006 .

[14]  Seock-Ho Kim,et al.  A Comparison of Linking and Concurrent Calibration Under the Graded Response Model , 1997 .

[15]  V. Shute SteAlth ASSeSSment in computer-BASed GAmeS to Support leArninG , 2011 .

[16]  Robert J. Mislevy,et al.  An evidence centered design for learning and assessment in the digital world , 2010 .

[17]  Robert J. Mislevy,et al.  Intuitive Test Theory , 2005 .

[18]  Eugene G. Johnson,et al.  Scaling Procedures in NAEP , 1992 .

[19]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[20]  Karen Barton,et al.  Using Technology to Assess Hard-to-Measure Constructs in the Common Core State Standards and to Expand Accessibility: English Language Arts , 2012 .

[21]  Joanna S. Gorin Test Design with Cognition in Mind , 2007 .

[22]  R. Sternberg,et al.  Complex Problem Solving : Principles and Mechanisms , 1992 .

[23]  Effect of Examinee Ability on Test Equating Invariance , 1988 .

[24]  R. Almond,et al.  Focus Article: On the Structure of Educational Assessments , 2003 .

[25]  W. M. Yen Vertical Scaling and No Child Left Behind , 2007 .

[26]  R. Shavelson,et al.  Rhetoric and reality in science performance assessments: An update. , 1996 .

[27]  Eric T. Bradlow,et al.  Testlet Response Theory and Its Applications , 2007 .

[28]  B. Junker,et al.  Cognitive Assessment Models with Few Assumptions, and Connections with Nonparametric Item Response Theory , 2001 .

[29]  M. Oliveri,et al.  The Learning Sciences in Educational Assessment: The Role of Cognitive Models , 2011, Alberta Journal of Educational Research.

[30]  William Stout,et al.  The theoretical detect index of dimensionality and its application to approximate simple structure , 1999 .

[31]  Robert J. Mislevy,et al.  Evidence-Centered Design of Epistemic Games: Measurement Principles for Complex Learning Environments. , 2010 .

[32]  Deborah L Thurston,et al.  Meta-level strategies for reformulation of evaluation function during iterative design , 2001 .

[33]  Randy Elliot Bennett,et al.  Cognitively Based Assessment of, for, and as Learning (CBAL): A Preliminary Theory of Action for Summative and Formative Assessment , 2010 .

[34]  Mark J. Gierl,et al.  Evaluating DETECT Classification Accuracy and Consistency When Data Display Complex Structure , 2006 .

[35]  Dubravka Svetina Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory With Complex Structures , 2013 .

[36]  R. Almond,et al.  Making Sense of Data From Complex Assessments , 2002 .

[37]  Ronald J. Ziegler Complexity reduction in automotive design and development , 2005 .

[38]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[39]  Brian C. Nelson,et al.  Evidence-centered Design for Diagnostic Assessment within Digital Learning Environments: Integrating Modern Psychometrics and Educational Data Mining , 2012, EDM 2012.

[40]  Mark R. Wilson,et al.  Towards coherence between classroom assessment and accountability , 2004 .

[41]  Matthias von Davier Mixture Distribution Diagnostic Models. Research Report. ETS RR-07-32. , 2007 .

[42]  Allen Newell,et al.  Human Problem Solving. , 1973 .

[43]  David Hammer,et al.  A critique of how learning progressions research conceptualizes sophistication and progress , 2010, ICLS.

[44]  Hong Jiao,et al.  Construct Equivalence Across Grades in a Vertical Scale for a K-12 Large-Scale Reading Assessment , 2009 .

[45]  R. Mayer Thinking, problem solving, cognition, 2nd ed. , 1992 .

[46]  Kentaro Yamamoto,et al.  Item Response Theory Scale Linking in NAEP , 1992 .

[47]  Meryl W. Bertenthal,et al.  Systems for state science assessment , 2005 .

[48]  Rebecca Zwick,et al.  Overview of the National Assessment of Educational Progress , 1992 .

[49]  Ying Li,et al.  Exploring the Full-Information Bifactor Model in Vertical Scaling With Construct Shift , 2012 .

[50]  Bert F. Green,et al.  In defense of measurement. , 1978 .

[51]  Calibration of Response Data Using MIRT Models With Simple and Mixed Structures , 2012 .

[52]  David A. Gillam,et al.  A Framework for K-12 Science Education: Practices, Crosscutting Concepts, and Core Ideas , 2012 .

[53]  Derek C. Briggs Making Inferences about Growth and Value-Added: Design Issues for the PARCC Consortium. A White Paper. , 2011 .

[54]  Robert J. Mislevy,et al.  Putting ECD into Practice: The Interplay of Theory and Data in Evidence Models within a Digital Learning Environment , 2012, EDM 2012.

[55]  Mike U. Smith A View from Biology , 2012 .

[56]  Aaron Rogat,et al.  Learning Progressions in Science: An Evidence-Based Approach to Reform. CPRE Research Report # RR-63. , 2009 .

[57]  Eugene G. Johnson,et al.  Sampling and Weighting in the National Assessment. , 1992 .

[58]  A. Béguin,et al.  MCMC estimation and some model-fit analysis of multidimensional IRT models , 2001 .

[59]  Roy Levy,et al.  A generalized dimensionality discrepancy measure for dimensionality assessment in multidimensional item response theory. , 2011, The British journal of mathematical and statistical psychology.

[60]  Joan L. Herman,et al.  Coherence: Key to Next Generation Assessment Success. AACC Report. , 2010 .

[61]  M. Heritage,et al.  Learning Progressions: Supporting Instruction and Formative Assessment Margaret Heritage the Council of Chief State School Officers T He Fast Scass ƒ Formative Assessment for Teachers and Learners Learning Progressions: Supporting Instruction and Formative Assessment , 2008 .

[62]  William F. McComas,et al.  The Atlas of Science Literacy , 2014 .

[63]  Bernard R. Gifford,et al.  Computer-Based Assessment in E-Learning: A Framework for Constructing "Intermediate Constraint" Questions and Tasks for Technology Platforms , 2006 .

[64]  Dubravka Svetina Assessing Dimensionality in Complex Data Structures: A Performance Comparison of DETECT and NOHARM Procedures , 2011 .

[65]  R. Brennan,et al.  Test Equating, Scaling, and Linking: Methods and Practices , 2004 .