Javascript is required
Search
Volume 3, Issue 2, 2025

Abstract

Full Text|PDF|XML

The psychometric validity of multiple-choice questions (MCQs) generated by an advanced Artificial Intelligence (AI) language model (ChatGPT) was evaluated in comparison with those developed by experienced human instructors, with a focus on mathematics teacher education. Two parallel 30-item MCQ tests—one human-designed and one AI-generated—were administered to 30 mathematics teacher trainees. A comprehensive psychometric analysis was conducted using six metrics: item difficulty index (Pi), discrimination index (D), point-biserial correlation, item-test correlation (Rit), Cronbach’s alpha (α) for internal consistency, and score variance. The analysis was facilitated by the Analysis of Didactic Items with Excel (AnDIE) tool. Results indicated that the human-authored MCQs exhibited acceptable difficulty (mean Pi = 0.55), moderate discrimination power (mean D = 0.31), and strong internal consistency (Cronbach’s α = 0.752). In contrast, the AI-generated MCQs were found to be substantially more difficult (mean Pi = 0.22), demonstrated weak discrimination (mean D = 0.16), and yielded negative internal consistency reliability (Cronbach’s α = −0.1), raising concerns about their psychometric quality. While AI-generated assessments offer advantages in terms of scalability and speed, the findings underscore the necessity of expert human review to ensure content validity, construct alignment, and pedagogical appropriateness. These results suggest that AI, in its current form, is not yet equipped to autonomously generate assessment instruments of sufficient quality for high-stakes educational settings. A hybrid test design model is therefore advocated, wherein AI is leveraged for initial item drafting, followed by rigorous human refinement. This approach may enhance both efficiency and quality in the development of educational assessments. The implications extend to educators, assessment designers, and developers of educational AI systems, highlighting the need for collaborative human-AI frameworks to achieve reliable, valid, and pedagogically sound testing instruments.

Abstract

Full Text|PDF|XML

The reduction of excessive academic burden in China’s basic education system has been established as a central objective of national education reform and has become a subject of intense policy debate. To elucidate the complex strategic interactions that shape the implementation of the “Double Reduction” policy, a multi-agent evolutionary game model was constructed incorporating three principal stakeholder groups: government authorities, schools and teachers, and students and parents. Replicator dynamic equations were employed to examine the evolutionary stability of stakeholder strategies and the conditions under which equilibrium outcomes emerge. Through numerical simulations, the influence of regulatory enforcement intensity on behavioral trajectories and convergence patterns was evaluated. The results reveal that asymptotically stable equilibria exist, with optimal system performance achieved when government bodies maintain active and credible regulatory oversight, educational institutions engage in substantive and sustained burden-reduction efforts, and families adopt cooperative and adaptive responses. By clarifying the mechanisms through which stakeholder interactions determine collective outcomes, this study provides theoretical support for the refinement of policy coordination and the long-term enhancement of education governance capacity. These findings contribute not only to the understanding of the “Double Reduction” policy’s systemic impact but also to broader discussions on the role of evolutionary game theory in evaluating multi-agent policy interventions in education systems.

Abstract

Full Text|PDF|XML
In the context of the digital era and ongoing reforms in higher education, how to cultivate and enhance university teachers’ digital teaching competence has become a prominent topic among scholars. Accordingly, this study examined the relationships between university assessment mechanisms and teachers’ digital teaching competence based on 422 valid questionnaires. The results indicate that instructional assessment, research assessment, administrative assessment, and qualification assessment all exert positive effects on teachers’ digital teaching competence. Social support moderates the negative relationship between work stress and digital teaching competence, and further moderates the mediating role of work stress in the relationships between the four dimensions of assessment mechanism rationality and digital teaching competence. The findings provide insights and recommendations for optimizing assessment mechanisms and promoting the modern professional development of university teachers.

Abstract

Full Text|PDF|XML
Significant regional imbalances have long been observed in China’s development, with the level of educational development in the western region consistently lagging behind the national average. Structural disparities are also evident among the provinces within the region. To systematically identify the determinants of these disparities and to characterize the spatial development patterns, a multidimensional evaluation framework was constructed using six indicators: number of schools, number of teachers, average years of schooling, public library collection size, governmental fiscal education expenditure, and number of internet users. Panel data from 12 western provinces (including municipalities and autonomous regions) for 2008 and 2017 were employed. Indicator weights were determined using the entropy method, followed by cluster analysis to classify the levels of educational development across the region. The findings indicate a steady overall improvement in educational development in western China, although substantial interprovincial disparities persist. Based on these results, policy recommendations are presented, including the optimization of the education policy system, improvement of resource allocation structures, strengthening of high-quality talent recruitment and incentive mechanisms, and coordinated planning of educational resources. The conclusions provide empirical support and policy guidance for enhancing educational equity and promoting balanced development in western China.

Abstract

Full Text|PDF|XML
Artificial intelligence (AI) changes the way university students learn and improve skills. A total of 476 valid responses were received. This study examined AI anxiety’s dual effect on university students’ proactive skill development and the related mechanisms. The results show that AI anxiety promotes proactive skill development through challenge appraisal and inhibits it through threat appraisal. AI literacy strengthens the positive challenge appraisal and weakens the negative threat appraisal. A high self-driven profile and other profiles driven by resource management, crisis response, and competence are conducive to developing skills proactively. AI anxiety has a complex influence on students’ proactive skill development. At the same time, this provides guidance for universities’ digital transformation and talent cultivation, underscoring the importance of improving AI literacy and fostering constructive mindsets.
- no more data -