Educational Statistics, 17(2), 191-204. https://doi.org/10.2307/1165169
Brookhart, S. M. (2010). How to assess higher-order thinking skills in your classroom. ASCD.
Brookhart, S. M., & Chen, F. (2015). The quality and effectiveness of descriptive rubrics.
Educational Review, 67(3), 343-368. https://doi.org/10.1080/00131911.2014.929565
Chall, J. S. (1983). Stages of reading development. McGraw-Hill.
Chall, J. S. (1996). Stages of reading development (2nd ed.). Harcourt Brace College Publishers.
Chen, K.-M., & Jen, T.-H. (2012, March 5-7). Positional effect of item blocks in international
large-scale assessment caused the item difficulty different between Taiwan and the USA
[Paper presentation]. The 6th International Technology, Education and Development
Conference, Valencia, Spain.
Cizek, G. J. (2012). An introduction to contemporary standard setting: Concepts, characteristics,
and contexts. In G. J. Cizek (Ed.), Setting performance standards: Foundations, methods,
and innovations (2nd ed., pp. 3-14). Taylor & Francis.
Common Core State Standards Initiative. (2009). Common core state standards initiative
standards-setting criteria. http://www.corestandards.org/about-the-standards/developmentprocess/
Common Core State Standards Initiative. (2022). Key shifts in English language arts. http://
www.corestandards.org/other-resources/key-shifts-in-english-language-arts/
Council of Europe. (2001). Common European framework of reference for languages.
Cambridge University Press.
Council of Europe. (2020). Common European framework of reference for languages: Learning,
teaching, assessment– Companion volume. Council of Europe Publishing.
Dijk, T. A. v., & Kintsch, W. (1983). Strategies of discourse comprehension. Academic Press.
DJEFAFLIA, K. (2018). Developing critical thinking skill through reading: Both teachers and
first year master students’ perspectives. Guelma University.
Gonzalez, E. J., Joseph Galia, A. A., Erberber, E., & Diaconu, D. (2004). Reporting student
achievement in mathematics and science. In M. O. Martin, I. V. S. Mullis, & S. J.
Chrostowski (Eds.), TIMSS 2003 technical report (pp. 275-307). TIMSS & PIRLS
International Study Center.
Gouëdard, P., Pont, B., Hyttinen, S., & Huang, P. (2020). Curriculum reform: A literature review
to support effective implementation (OECD Education Working Paper No. 239). OECD.
https://doi.org/10.1787/efe8a48c-en
Haladyna, T. M., & Rodriguez, M. C. (2013). Developing and validating test items. Routledge.
James, Z., & Rossiter, J. (2018). Using scale-anchoring to interpret the young lives 2016-17
achievement scale. Young Lives.
Kintsch, W. (1988). The role of knowledge in discourse comprehension: A constructionintegration
model. Psychotocical Review, 95(2), 163-182. https://doi.org/10.1016/S0166-
4115(08)61551-4
Kintsch, W. (2019). Revisiting the construction-integration model of text comprehension and its
implications for instruction. In D. E. Alvermann, N. J. Unrau, M. Sailors, & R. B. Ruddell
(Eds.), Theoretical models and processes of literacy (7th ed., pp. 178-203). Routledge.
Kintsch, W., & Rawson, K. A. (2005). Comprehension. In M. J. Snowling & C. Hulme
(Eds.), The science of reading: A handbook (pp. 209-226). Blackwell. https://doi.org/
10.1002/9780470757642.ch12
Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking: Methods and
practices (3rd ed.). Springer.
Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149-174.
https://doi.org/10.1007/BF02296272
New Zealand Ministry of Education. (2015). The New Zealand curriculum. Crown.
New Zealand Ministry of Education. (2025a). The learning progression frameworks. https://
curriculumprogresstools.education.govt.nz/lpf-tool/
New Zealand Ministry of Education. (2025b). Understanding the reading framework. https://
curriculumprogresstools.education.govt.nz/lpfs/understanding-the-reading-framework/
Organisation for Economic Co-operation and Development. (2017). Proficiency scale
construction. In OECD (Ed.), PISA 2015 technical report (pp. 275-287).
Organisation for Economic Co-operation and Development. (2019). PISA 2018 assessment and
analytical framework.
Organisation for Economic Co-operation and Development. (2020). PISA 2018 technical report.
Organisation for Economic Co-operation and Development. (2021). 21st-century readers:
Developing literacy skills in a digital world.
Olsen, R. V., & Nilsen, T. (2017). Standard setting in PISA and TIMSS and how these procedures
can be used nationally. In S. Blömeke & J.-E. Gustafsson (Eds.), Standard setting in
education: The Nordic countries in an international perspective (pp. 69-84). Springer.
Perie, M. (2008). A guide to understanding and developing performance-level descriptors.
Educational Measurement: Issues and Practice, 27(4), 15-29. https://doi.org/10.1111/
j.1745-3992.2008.00135.x
Phillips, G. W. (2012). The benchmark method of standard setting. In G. J. Cizek (Ed.), Setting
performance standards: Foundations, methods, and innovations (2nd ed., pp. 323-345).
Routledge.
Schneider, M. C., Huff, K. L., Egan, K. L., Gaines, M. L., & Ferrara, S. (2013). Relationships
among item cognitive complexity, contextual demands, and item difficulty: Implications
for achievement-level descriptors. Educational Assessment, 18(2), 99-121. https://doi.org/
10.1080/10627197.2013.789296
Shalihin, R. R. (2024). Higher-order versus lower-order thinking skills: How much does the
hierarchy matter? https://www.bera.ac.uk/blog/higher-order-versus-lower-order-thinkingskills-
how-much-does-the-hierarchy-matter
Smith, R., Snow, P., Serry, T., & Hammond, L. (2021). The role of background knowledge in
reading comprehension: A critical review. Reading Psychology, 42(3), 214-240. https://doi.
org/10.1080/02702711.2021.1888348
Stanley, T. (2019). Using rubrics for performance-based assessment: A practical guide to
evaluating student work. Routledge.
Stevens, D. D., & Levi, A. J. (2013). Introduction to rubrics: An assessment tool to save grading
time, convey effective feedback, and promote student learning (2nd ed.). Stylus.
U.S. Department of Education. (2025). Distribution of reading questions: 2019 and 2024.
https://nces.ed.gov/nationsreportcard/reading/distributequest.aspx
von Davier, M., Kennedy, A., Reynolds, K., Fishbein, B., Khorramdel, L., Aldrich, C.,
Bookbinder, A., Bezirhan, U., & Yin, L. (2024). TIMSS 2023 international results in
mathematics and science. Boston College, TIMSS & PIRLS International Study Center.
https://doi.org/10.6017/lse.tpisc.timss.rs6460
Wagner, J.-P., & Hastedt, D. (2022). Valuing curriculum-based international large-scale
assessments: Ensuring alignment with national curricula in IEA studies. IEA Compass:
Briefs in Education, 16, 1-7.
Yan, Z., & Yang, L. (2022). Assessment-as-learning in the global assessment reforms. In Z. Yan
& L. Yang (Eds.), Assessment as learning: Maximising opportunities for student learning
and achievement (pp. 1-7). Routledge.