Secciones
Referencias
Resumen
Servicios
Buscar
Fuente


Development and validation of the AI-enhanced Self-Regulated Learning (AI-SRL) scale
Desarrollo y validación de la escala de Aprendizaje Autorregulado con IA(AI-SRL)
RIED-Revista Iberoamericana de Educación a Distancia, vol. 29, núm. 1, pp. 185-216, 2026
Asociación Iberoamericana de Educación Superior a Distancia

Estudios e investigaciones


Recepción: 01 Junio 2025

Aprobación: 27 Agosto 2025

DOI: https://doi.org/10.5944/ried.45452

Abstract: The integration of Artificial Intelligence (AI) into educational settings offers new opportunities for enhancing self-regulated learning (SRL) among students. However, current tools lack precision in measuring how AI influences learners' self-regulatory capabilities. This study introduces the AI-Enhanced Self-Regulated Learning (AI-SRL) Scale, designed to assess SRL in AI-supported educational environments. Through a rigorous scale development process involving a literature review, expert consultation, and systematic validation, we developed a comprehensive measurement instrument. The development employed a two-phase validation approach with separate samples for exploratory and confirmatory factor analyses. The final validated scale comprises 22 items organized into five dimensions: AI Competence, Learning Awareness, Learning Strategies, Engagement and Efficiency, and Ethical Collaboration. These factors capture the multifaceted nature of how students regulate their learning when using AI tools, from technical proficiency to ethical considerations. The scale demonstrated strong psychometric properties, with excellent internal consistency and robust construct validity. This validated instrument has practical applications for educators aiming to optimize AI integration in their classrooms, researchers investigating the intersection of AI and SRL, and institutions developing AI-enhanced curricula. The AI-SRL Scale provides a reliable framework for assessing how effectively students leverage AI tools while maintaining their self-regulatory capabilities, thereby contributing to more effective and responsible AI implementation.

Keywords: AI, AIED, self-regulated learning, AI-SRL, AI-SRL scale.

Resumen: La integración de la Inteligencia Artificial (IA) en entornos educativos brinda nuevas oportunidades para mejorar el aprendizaje autorregulado (SRL) en los estudiantes. Sin embargo, las herramientas actuales carecen de precisión para evaluar cómo la IA influye en las capacidades autorregulatorias de los estudiantes. Este estudio presenta la Escala de Aprendizaje Autorregulado Mejorado por IA (AI-SRL), diseñada para evaluar el aprendizaje autorregulado en entornos educativos asistidos por IA. A través de un riguroso proceso de desarrollo de escala, que incluyó revisión de literatura, consulta con expertos y validación sistemática, se construyó un instrumento de medición integral. En el desarrollo se empleó un enfoque de validación en dos fases con muestras separadas para análisis factoriales exploratorios y confirmatorios. La escala final validada consta de 22 ítems organizados en cinco dimensiones: Competencia en IA, Conciencia de Aprendizaje, Estrategias de Aprendizaje, Implicación y Eficiencia, y Colaboración Ética. Estos factores capturan la naturaleza multifacética de cómo los estudiantes regulan su aprendizaje al usar herramientas de IA, desde competencia técnica hasta consideraciones éticas. La escala demostró sólidas propiedades psicométricas, con excelente consistencia interna y robusta validez de constructo. Este instrumento validado tiene aplicaciones prácticas para docentes que buscan optimizar la integración de IA en sus aulas, investigadores que investigan la intersección de IA y aprendizaje autorregulado, e instituciones desarrollando currículos mejorados con IA. La Escala AI-SRL proporciona un marco confiable para evaluar en qué medida los estudiantes aprovechan las herramientas de IA mientras mantienen sus capacidades autorregulatorias, contribuyendo así a una implementación de IA más efectiva y responsable.

Palabras clave: IA, AIED, aprendizaje autorregulado, AI-SRL, escala AI-SRL.

INTRODUCTION

Self-regulated learning (SRL), a critical component of success in digital and lifelong learning, involves learners actively setting goals, monitoring progress, and adjusting behaviors to achieve academic success (Hilpert et al., 2023; Lai, 2024). In the digital age, SRL has become increasingly crucial for lifelong learning, enabling individuals to adapt to changing educational needs while promoting problem-solving abilities (Adams et al., 2024; Hilpert et al., 2023; Kong & Yang, 2024). As educational practices evolve, integrating advanced technologies like AI into learning environments offers opportunities to enhance SRL while addressing associated challenges.

The theoretical foundations of SRL are primarily based on two comprehensive frameworks that have shaped educational psychology for over three decades. Zimmerman's cyclical phase model, grounded in social cognitive theory, conceptualizes SRL as structured into three recursive phases: forethought, performance, and self-reflection (Panadero, 2017; Schunk & Zimmerman, 2012; Zimmerman, 1989). The forethought phase encompasses task analysis, including goal setting and strategic planning, alongside self-motivational beliefs such as self-efficacy, outcome expectations, and intrinsic interest. The performance phase involves self-control strategies such as focused attention, task strategies, and self-instruction, combined with self-observation through metacognitive monitoring of one's progress and strategy effectiveness. The self-reflection phase includes self-judgment, where learners evaluate their performance against standards, and self-reaction processes involving satisfaction responses and adaptive or defensive reactions that influence future learning efforts (Panadero, 2017; Tinajero et al., 2024).

Complementing this temporal view, Pintrich's (2000) framework offers a taxonomy with four phases —forethought/planning/activation, monitoring, control, and reaction/reflection—applied across four regulatory areas: cognition, motivation/affect, behavior, and context (Tinajero et al., 2024; Yot-Domínguez & Marcelo, 2017). This framework is particularly valuable for its explicit treatment of motivational regulation and contextual factors, offering a 16-cell matrix of specific regulatory processes that provides exhaustive coverage of SRL components. Pintrich's model emphasizes that SRL is an active, constructive process whereby learners set goals for their learning and then attempt to monitor, regulate, and control their cognition, motivation, and behavior, guided and constrained by their goals and the contextual features in the environment (Panadero, 2017). These foundational models, while developed for conventional learning formats, provide the essential theoretical architecture for investigating how AI transforms established SRL processes.

The integration of AI in educational settings is transforming the structure of learning, bringing both opportunities and challenges in fostering SRL (Chang et al., 2023; Grüneke et al., 2024). AI technologies offer innovative ways to enhance traditional SRL strategies, providing personalized real-time feedback, flexible support, and real-time observing that supports learners in managing their learning processes effectively (Huang et al., 2023; Molenaar et al., 2023; Ng et al., 2024). Research demonstrates that AI applications efectively support metacognitive, cognitive, and behavioral regulation across Pintrich's four phases, with learning analytics dashboards significantly enhancing metacognitive awareness through real-time progress tracking and performance visualization (Gkintoni et al., 2025; Seo et al., 2021; Wang & Lin, 2023). Additionally, AI tools support metacognitive strategies through goal-setting support and strategic planning assistance, aligning with Zimmerman's forethought phase (Anggoro & Pratiwi, 2023; Liu et al., 2024). These systems can trace learning behaviors and provide insights into SRL processes, facilitating solutions that support the development of SRL (Chang et al., 2023; Jin et al., 2023; Molenaar et al., 2023).

Though AI tools have potential to support metacognitive skills and providing real-time monitoring (Dahri et al., 2024; Jin et al., 2023), researchers express concerns about overreliance on AI hindering essential skill development (Boguslawski et al., 2025). The literature highlights disagreement between AI's capacity to provide personalized support and the risk of diminishing learner autonomy through cognitive offloading effects, where frequent AI usage is negatively associated with critical thinking abilities (Chiu, 2024; Gerlich, 2025; Kong & Yang, 2024). Studies reveal that while learners perceive AI as useful for metacognitive, cognitive, and behavioral regulation, they show a strong preference for human support when addressing motivational needs, suggesting that AI applications consistently struggle to support motivational regulation—a critical component of Pintrich's framework (Lan & Zhou, 2025; Seo et al., 2021). Another critical aspect emphasized in studies is that studies emphasize the importance of maintaining a balance between AI support and human guidance in educational settings (Kong & Yang, 2024; Lai, 2024). While AI can provide detailed oversight and feedback mechanisms, educators play a crucial role in designing learning activities and providing emotional support (Chiu et al., 2023). Studies suggest that effective implementation of AI in SRL requires careful consideration of how these tools can complement rather than replace human interaction, particularly in supporting students' metacognitive development and motivation (Wang & Lin, 2023; Xia et al., 2023).

However, the potential of AI to fully enhance SRL has not yet been fully realized due to several barriers. While some studies highlight AI's potential to alleviate learning anxiety and enhance motivation (Biju et al., 2024), others raise concerns about the need for balanced integration to maintain active student participation (Grüneke et al., 2024). Furthermore, significant ethical concerns have arisen in multiple domains, including data privacy and surveillance issues, algorithmic bias that may embed educational inequalities, and academic integrity challenges as AI-generated content complicates the assessment of authentic learning (Gkintoni et al., 2025; Walter, 2024). To address these challenges, researchers emphasize the critical need for developing reliable measurement instruments to evaluate AI's impact on SRL processes (Baker, 2023; Grüneke et al., 2024; Jin et al., 2023; Molenaar et al., 2023).

Building on these theoretical foundations, the present study conceptualizes AI-enhanced SRL through five dimensions, extending and operationalizing Zimmerman's and Pintrich's frameworks for contemporary technological contexts. The AI Competence factor aligns with the forethought phase of both models, encompassing self-efficacy beliefs specifically related to AI tool usage and strategic planning for their deployment. Learning Awareness corresponds to metacognitive monitoring central to Pintrich's monitoring phase and Zimmerman's self-observation subprocess, capturing learners' awareness of their understanding and progress while interacting with AI. Learning Strategies and Engagement and Efficiency factors map onto the performance and control phases of both frameworks, reflecting cognitive strategy selection, adaptation, and the behavioral and motivational regulation necessary to persist in learning tasks. Critically, the Ethical Collaboration factor represents a necessary extension to classical models, operationalizing modern context regulation (Pintrich's fourth regulatory area) that addresses algorithmic bias, data privacy, and academic integrity concerns-regulatory demands unique to the AI era not anticipated in traditional SRL frameworks (Gkintoni et al., 2025; Walter, 2024).

The development of the AI-SRL Scale has important implications for evaluating diverse educational technology environments. In platforms utilizing real-time formative assessment tools like Socrative, the scale's Learning Awareness and Learning Strategies dimensions can measure whether instant feedback effectively prompts metacognitive monitoring and strategy adjustment (Bauer et al., 2025). Within Learning Management Systems such as Moodle featuring AI assistants like Corolair or CodeRunner Agent, the AI Competence and Engagement and Efficiency subscales can assess whether these 24/7 AI tutors enhance students' self-regulatory capabilities outside direct instructor supervision. For Intelligent Tutoring Systems providing adaptive scaffolding and personalized learning paths, the complete scale offers holistic evaluation of the entire regulatory cycle, from planning and strategy use to ethical considerations, offering valuable feedback for designing systems that are both intelligent and pedagogically sound (Wu & Chiu, 2025). As students increasingly engage with generative AI tools for academic work, the scale provides structured methods to investigate their regulation of inquiry, critical evaluation, and ethical AI use, addressing key gaps in understanding modern digital learning ecosystems (Li et al., 2025; Mohamed et al., 2025). Therefore, this study addresses the following research questions:

RQ1: How can AI-enhanced self-regulated learning be comprehensively measured through a scale that operationalizes and extends the established theoretical frameworks of Zimmerman and Pintrich for contemporary technological contexts?

RQ2: What is the factorial structure and psychometric properties of AI-SRL Scale when validated through exploratory and confirmatory factor analysis procedures?

RQ3: To what extent does the AI-SRL Scale function as a reliable and valid instrument for capturing the unique dimensions of SRL in AI-supported educational environments?

Literature review

Recent advancements in AI have fundamentally transformed the landscape of SRL in educational settings, creating both unprecedented opportunities and complex challenges that demand careful theoretical and empirical examination. The literature reveals a nuanced picture where AI technologies demonstrate strong potential for supporting cognitive and metacognitive dimensions of SRL (Seo et al., 2021; Wang & Lin, 2023), while simultaneously presenting significant challenges for motivational regulation and raising critical ethical concerns about learner agency and digital well-being (Lan & Zhou, 2025).

The theoretical foundations of SRL, particularly Zimmerman's cyclical model and Pintrich's comprehensive framework, have proven remarkably robust when applied to AI-enhanced learning contexts. Zimmerman's three-phase cyclical structure—forethought, performance, and self-reflection—maps effectively onto AI applications (Chiu, 2024), with AI systems providing differential support across these phases. Research by Lan and Zhou (2025) found that while 50% of AI applications primarily support the performance phase, comprehensive support across all three phases remains rare (only 21% of applications), with the self-reflection phase receiving the least AI support. This suggests a continued need for human guidance in deeper reflective processes. Similarly, Pintrich's framework, with its four areas of regulation (cognitive, motivational/affective, behavioral, and contextual), finds clear parallels in AI-mediated learning environments (Yot-Domínguez & Marcelo, 2017), though motivational regulation remains particularly problematic (Jin et al., 2023; Seo et al., 2021).

AI technologies have demonstrated significant support for core SRL components across disciplines, particularly in cognitive and metacognitive domains. Recent empirical studies reveal significant relationships between AI self-efficacy and academic performance, with practical application of AI tools serving as the strongest predictor of self-efficacy (Bećirović et al., 2025). Research by Liang et al. (2023) found that student-AI interaction significantly improved learning achievement through enhanced cognitive engagement (β = 0.046), while AI-powered feedback systems enhance students' metacognitive awareness through real-time progress tracking and performance visualization (Gkintoni et al., 2025; Molenaar et al., 2023). The implementation of AI in SRL reveals complex support-hindrance dynamics that require careful consideration. While AI tools like ChatGPT have enhanced students' research skills and supported self-directed learning through autonomy and engagement (Li et al., 2025), studies reveal concerning negative correlations (r = −0.39) between over-reliance on AI tools and critical thinking abilities, particularly among younger learners aged 17-25 (Gerlich, 2025). This "cognitive offloading" effect suggests that frequent AI usage may prevent the development of independent metacognitive skills, with learners becoming overly reliant on AI systems rather than developing autonomous self-regulation capabilities (Gerlich, 2025). Research by Glick et al. (2024) demonstrated that while AI-powered training modules significantly increased use of planning tools and goal-setting activities, the balance between AI support and learner agency remains a critical concern.

Perhaps the most significant finding across the literature is that AI applications consistently struggle to support motivational regulation, a critical component of effective SRL (Seo et al., 2021). Multiple systematic reviews confirm that while learners perceive AI as useful for metacognitive, cognitive, and behavioral regulation, they show a strong preference for human support when addressing motivational needs (Jin et al., 2023; Lan & Zhou, 2025). From a Self-Determination Theory perspective, while AI tools can support competence through personalized feedback and autonomy through learning pathway control (Chiu, 2024; Li et al., 2025), relatedness remains the weakest aspect of AI support.

Recent empirical evidence from 2020-2025 increasingly points toward hybrid human-AI regulatory systems as optimal approaches for supporting SRL. Research distinguishes between human-centered self-regulation, where AI serves as a facilitator providing tools and data for learner control, and AI-centered self-regulation, where AI generates SRL cycles based on data analysis (Lan & Zhou, 2025). The most effective implementations appear to involve human-AI interaction where AI engages with human-centered SRL cycles across behavioral, emotional, and cognitive dimensions while maintaining human agency (Wu & Chiu, 2025).

Overall, while AI presents transformative potential for SRL, the evidence suggests that effective implementation requires a carefully balanced approach that harnesses technological capabilities while preserving fundamental human elements of SRL. The development of new measurement tools like the AI-SRL Scale becomes crucial for providing deeper insights into the effectiveness and impact of AI tools, addressing the complex interplay between cognitive enhancement and potential motivational deficits, and guiding future integrations that maintain learner agency while leveraging AI's strengths in personalization and cognitive load management.

METHOD AND FINDINGS

In this study, we aimed to develop a scale to measure participants’ SRL levels in the context of AI use. Despite established guidelines, researchers often face limitations in scale development, including sample characteristics, methodological issues, and psychometric challenges. To improve future practices, researchers should carefully consider these potential limitations and make informed methodological choices throughout the scale development process (Morgado et al., 2017). Therefore, acknowledging these limitations, we employed a robust methodology for the development of a reliable and valid scale in this study.

Participants

Two separate samples were used for the exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). The EFA sample consisted of 504 participants, while the CFA sample included 424 participants. Participants were recruited online through convenience and purposeful sampling (Creswell, 2014). Convenience sampling was used in a way that enabled us to reach participants more easily. We found participants from the universities that we somehow had connections with. Our sampling was purposeful, because we selected participants who could readily understand the scale items. Most of them were from the foreign language teaching or literature departments of universities. For those who were from other departments, we made sure they could easily understand the items on our scale. We preferred to collect data mainly from these departments to ensure clarity in item comprehension, which were in English. This might have affected the generalizability of the responses to some extent and may have influenced factor interpretation across some constructs. Future studies to validate the scale could collect data across diverse academic populations to enhance generalizability and statistics. The demographics of our participants are summarized in the table below.

Table 1
Demographics

As Table 1 indicates, we had a total of 928 participants for the first and second steps (EFA and CFA) of the study. 594 participants (64.0%) were female, while 334 (36.0%) were male. Among departments, 288 participants (31.0%) were from English Language Teaching, 262 (28.2%) from English Language and Literature, and 149 (16.1%) from Pharmacy. Class distribution included 212 participants (22.8%) from preparatory English classes, 184 (19.8%) freshmen, 260 (28.0%) sophomores, 180 (19.4%) juniors, and 92 (9.9%) seniors. We collected some other information related to the participants’ use of technology and AI in learning. Table 2 summarizes this information.

Table 2
Participants’ use of technology and AI in learning

As seen from Table 2, the participants indicated their level of using technology in learning, with a vast majority reporting moderate (395, 42.5%), high (338, 36.4%) and very high levels (141, 14%). The ones whose level was low and very low were 5.8% of the participants. The participants also provided information on how much experience they had with AI. 543 of them (58.5%) had one year of experience, 239 had 2 years (25.7%), 81 had 3 years (8.7%), and the others had 4, 5, 6 or more years of experience with AI. Lastly, we gathered input from the participants on how often they used AI for learning. The number of participants who reported to be using it every day was 104 (11.2%), 212 (almost 22.8%) used it several times a week, 141 (15.2%) once a week, 180 (19.3%) several times a month, and 189 (20.3%) once a month. Also, 102 (almost 11%) participants reported using AI for learning when they needed it.

Item generation

Scale development is a critical process in social and behavioral research, typically involving three main phases: item generation, scale construction, and scale evaluation (Boateng et al., 2018). The item generation phase begins with clearly defining the construct to be measured and creating an initial item pool (DeVellis, 1991). In this study, the initial pool of items was generated through a comprehensive review of the literature on SRL and AI. Content validity is assessed, often through expert review (Teeluckdharry et al., 2021). The items in our scale were designed to capture key dimensions of SRL in the context of AI, ensuring alignment with existing theoretical frameworks. The initial item pool underwent a rigorous content validation process by a panel of seven experts. These experts were selected based on specific criteria: (1) possession of a Ph.D. in Educational Technology, Educational Psychology, or a related field; (2) a minimum of five years of research experience; and (3) at least two peer-reviewed publications on either SRL or the application of AI in education. This purposive selection ensured substantial theoretical and practical expertise. The panel provided feedback on item clarity, relevance, and representativeness. Content validity was assessed using the Content Validity Index (CVI). Items achieving CVI ≥ .80 from at least 6 of 7 experts were retained. The Scale-level CVI average (S-CVI/Ave) was .92, exceeding the recommended threshold of .90. Items were revised or removed based on this feedback, yielding a preliminary scale for pilot testing. The literature suggests various sample sizes for pilot studies. Johanson and Brooks (2009) reported that Isaac and Michael (1995) and Hill (1998) suggested 10 to 30 samples for pilot studies, and they also stated that Mooney and Duval (1993) offered 30-50, and Hertzog (2008) suggested 25-40 for scale development. In our case, the pilot test was conducted with a sample of 33 participants representing similar characteristics of the target study groups. The pilot study aimed to identify any issues with item wording, response options, or overall scale length. Based on the pilot test results, minor revisions were made to enhance item clarity and reliability.

Exploratory Factor Analysis (EFA)

The scale construction phase includes pre-testing, survey administration, item reduction, and factor extraction (Boateng et al., 2018). After administering the scale to the first study group, we proceeded to the next step. The scale evaluation phase involves testing dimensionality, reliability, and validity (Teeluckdharry et al., 2021). The suitability of the data from the first sample (n = 504) for factor analysis was assessed. The Kaiser-Meyer-Olkin (KMO) Measure of Sampling Adequacy was .96, indicating marvelous suitability for factor analysis (Kaiser & Rice, 1974). Additionally, Bartlett’s Test of Sphericity was significant, χ² = 9846.80, p < .001, confirming that the correlation matrix was not an identity matrix and was appropriate for factor analysis. Following this, Exploratory Factor Analysis (EFA) was carried out using SPSS 22 to examine and confirm the factor structure of the AI-SRL Scale. EFA was conducted to reveal the underlying factor structure and to identify clusters of related items, thereby establishing the construct validity of the scale by reducing the data into a smaller number of meaningful factors (Field, 2018). Principal Component Analysis (PCA) was employed as the extraction method, and promax rotation - one of the oblique rotation methods - was used to allow for correlations between factors (Kline, 2014; Muthén & Muthén, 2017). While Principal Axis Factoring (PAF) and Maximum Likelihood (ML) were tested, they did not yield stable or interpretable factor solutions. Given our aim of exploring the underlying dimensions and reducing the item pool into coherent components, PCA was selected for its robustness and interpretability in this context. This approach aligns with prior recommendations for early-stage scale development (Fabrigar & Wegener, 2012; Jolliffe, 2014). To further ensure validity, we used a separate dataset to conduct a Confirmatory Factor Analysis (CFA) via AMOS, which confirmed the PCA-derived factor structure with satisfactory model fit indices. The EFA most often yielded four factors, and in some cases five, based on eigenvalues larger than one. After trying a very large number of models with EFA and CFA, we chose the five-factor solution, because it yielded the best model fit. At the EFA stage, items with factor loadings of 0.4 or higher were retained, as this threshold was considered significant (Floyd & Widaman, 1995; Hair et al., 2010). During the EFA stage, three items were deleted due to cross-loadings below the 0.10 difference threshold (Büyüköztürk, 2018): “I do not share my personal information when using AI tools,” “I check the originality of AI-generated work before using it,” and “I ensure that my peers also use AI tools ethically.” At the CFA stage, nine additional items were removed due to low or marginal standardized loadings, redundancy with higher-loading items, or problematic cross-factor modification indices. These included items related to AI Competence (e.g., “I can quickly learn to use new AI tools”), reflective and evaluative Learning Awareness (e.g., “I evaluate whether AI tools give me accurate information”), and Ethical Collaboration (e.g., “I discuss AI-related ethical issues with peers”). Removal decisions were based on both statistical and conceptual considerations to ensure the final scale retained psychometric rigor and theoretical coherence. The items removed and the rationale for their exclusion are presented in Appendix B. The EFA revealed a five-factor structure, accounting for 62.42% of the total variance. Items loaded strongly on their respective factors, with factor loadings ranging from 0.44 to 0.96.

Table 3
Eigenvalues of parallel analysis

As seen from Table 3, the eigenvalues for the first four factors are higher than one, and the fifth one is 0.99, just below 1.00. Because the fifth factor was supported by the previous literature and the upcoming analyses, we decided that it stay as a factor in the final scale. Parallel analysis was conducted using 1000 randomly generated data matrices with the same number of variables (31) and participants (504) as the actual dataset. The first three factors clearly exceeded the 95th percentile of random eigenvalues, strongly supporting their retention. Factors 4 and 5 had eigenvalues slightly below the 95th percentile threshold but were retained based on (a) theoretical alignment with established SRL frameworks, (b) adequate factor loadings (all > .40), and (c) improved model fit in subsequent CFA. This decision aligns with recommendations to consider multiple criteria beyond eigenvalues alone when determining factor retention (Fabrigar & Wegener, 2012). The five-factor solution was also supported by a scree plot, which illustrates the potential factors within the scale (Eisinga et al., 2013).


Figure 1
Scree plot for the initial Exploratory Factor Analysis

The scree plot also supported a five-factor solution, as the slope of the plot flattened considerably after the fifth factor. Appendix C presents the factor loadings for each item from the initial model, outlining the sub-dimensional nature of the scale. Following this analysis, items 21, 24, and 29 were removed. The remaining 31 items formed a five-factor structure, which was then subjected to CFA using data from the second study group. This subsequent analysis resulted in the final 22-item scale.

Confirmatory Factor Analysis (CFA)

After the initial EFA model was confirmed, we collected new data based on the latest EFA model. The new data was used for CFA. Sun (2005) indicates three main purposes for conducting a CFA: Construct validity evaluation, response pattern comparison and competing model comparison. We used it for evaluating construct validity and model comparison. The CFA, conducted with a separate sample (n = 424), was used to test the five-factor structure identified in the EFA. The initial model was refined based on standardized loadings and modification indices. Items with low loadings or those creating significant cross-factor covariance were iteratively removed to improve model fit while maintaining theoretical coherence. This process resulted in the final 22-item model. Table 2 summarizes the main fit indices from the CFA analyses.

Table 4
CFA results

As Table 4 indicates, the final model demonstrated a good fit to the data, as assessed against conventional fit criteria (Hu & Bentler, 1999). The fit indices were: χ²/df (CMIN/DF) = 2.82, Comparative Fit Index (CFI) = .97, Tucker-Lewis Index (TLI) = .95, Root Mean Square Error of Approximation (RMSEA) = .04 (90% CI [.035, .045]), and Standardized Root Mean Square Residual (SRMR) = .04. As the CFI and TLI were above .95 and the RMSEA and SRMR were below .05, the five-factor model was confirmed as a valid representation of the data.

Confirmatory Factor Analysis (CFA) Model

The figure below illustrates the CFA model and the coefficients between variables and items.


Figure 2
Confirmatory Factor Analysis of AI-SRL Scale with standardized path coefficients

The figure confirms the final model that consists of 5 factors and 22 items. The five factors confirmed through CFA represent distinct dimensions of AI-enhanced self-regulated learning with clear pedagogical implications. AI Competence (Factor 1, 6 items) captures learners' self-efficacy and technical proficiency in utilizing AI tools for educational purposes, reflecting their confidence in adapting to various AI interfaces, understanding AI-generated instructions, and effectively interpreting AI-produced information for learning. Learning Awareness (Factor 2, 4 items) encompasses metacognitive monitoring processes where learners consciously track their understanding, progress, and learning effectiveness while interacting with AI systems, including critical reflection on their dependency on these tools. Learning Strategies (Factor 3, 4 items) involves the selection, adaptation, and implementation of AI-supported cognitive strategies, including how learners use AI for finding effective study methods, improving problem-solving skills, and enhancing critical thinking through multiple perspective analysis. Engagement and Efficiency (Factor 4, 4 items) measures learners' behavioral and motivational regulation in maintaining participation, achieving permanent learning outcomes, and sustaining engagement when using AI tools adapted to their learning style. Ethical Collaboration (Factor 5, 4 items) represents a novel dimension addressing learners' awareness of responsible AI use, including considerations of personal responsibility, privacy concerns, potential biases, and information security in AI-assisted learning contexts. These factors collectively operationalize how contemporary learners navigate the complex landscape of AI-enhanced education while maintaining their self-regulatory capabilities. The complete validated 22-item AI-SRL Scale with all items organized by these five factors is provided in Appendix A. In the upcoming sections, it has been elaborated on the reliability and validity of the AI-SRL scale.

Reliability analysis

We performed several tests to ensure the AI-SRL scale is a reliable scale. These tests are explained below.

Internal consistency

Cronbach's alpha was calculated to assess the internal consistency of the overall scale and each factor, which is used to indicate the overall degree of homogeneity of test items. Cronbach's alpha values above .90 are considered to reflect excellent consistency, while values between .80 and .90 suggest good consistency. Values between .70 and .80 indicate acceptable internal consistency, values between .60 - .70 are questionable, .50 to .60 are poor, and values between .50 are unacceptable (George & Mallery, 2003). The overall scale demonstrated excellent internal consistency (α = .94, 95% CI [.93, .95]). The individual sub-factors also showed good to excellent reliability: AI Competence (α = .91), Learning Awareness (α = .88), Learning Strategies (α = .84), Engagement and Efficiency (α = .86), and Ethical Collaboration (α = .87). Detailed statistics, including item-factor correlations, are presented in Appendix D.

Split-half reliability

Split-half reliability analysis was performed on the AI-SRL scale to further assess the consistency of the test items. Table 5 shows the results of this analysis.

Table 5
Split-half Reliability Analysis Results

Notes: a The items are: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21.

b The items are: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22.

As Table 5 indicates, the Spearman-Brown coefficient was .89 (95% CI [.87, .91]), and the Guttman Split-Half coefficient was .89 (95% CI [.87, .91]), indicating excellent split-half reliability. The correlation between forms was .80 (95% CI [.77, .83]).

Validity

According to Cheung et al. (2024), convergent validity requires CR ≥ 0.70, standardized loadings ≥ 0.50, and AVE ≥ 0.50-all of which are met in our model. Table 6 shows the scores of AI-SRL scales.

Table 6
CR, AVE, MSV, and Factor Correlations

Note: 95% confidence intervals for correlations are available in supplementary materials. All correlations significant at p < .001. CR = Composite Reliability; AVE = Average Variance Extracted; MSV = Maximum Shared Variance.

Discriminant validity was tested using multiple criteria: (a) no item cross-loads across constructs, (b) AVE for each construct exceeded squared inter-construct correlations, and (c) inter-factor correlations fell below the recommended values -none exceeded 0.85- (Cheung et al., 2024; Hair et al., 2010). While some high correlations were found among Learning Awareness, Engagement and Efficiency, and Learning Strategies, these constructs maintained theoretical distinctiveness within the AI‑SRL framework. Thus, their retention was considered both psychometrically justifiable and conceptually meaningful.

Statistical analyses were performed using IBM SPSS Statistics 22 and AMOS 22. All confidence intervals were calculated at the 95% level using bias-corrected accelerated (BCa) bootstrap with 5000 resamples. Parallel analysis employed O'Connor's (2000) SPSS syntax, generating 1000 random correlation matrices for eigenvalue comparison. Significance levels were set at α = .05 for all tests, with Bonferroni corrections applied for multiple comparisons where appropriate. The validated scale offers a reliable and valid measure of SRL in the context of AI, with potential applications in educational research and practice. Future research could explore the scale's use in diverse populations and settings.

DISCUSSION

This study successfully developed and validated the AI-SRL Scale, providing the psychometrically robust instrument specifically designed to measure SRL capabilities in AI-supported educational environments. For the preliminary analyses, a tool of 34 items was developed based on the literature and expert views. Factor analysis utilizing KMO revealed an excellent suitability (.96), and Bartlett’s Test of Sphericity proved the multivariate normal distribution of the dataset. After EFA and PCA to establish meaningful factors and reveal correlations among these factors, 12 items were removed from the scale to preserve the scale's theoretical coherence and psychometric integrity. Based on the responses from 928 participants, EFA and CFA analyses were performed. The content validity, construct validity, internal consistency and split-half reliability results of the study proved that AI-SRL is a psychometrically sound scale in assessing the SRL in AI in educational environments. The finalized version of this scale involved 22 items under 5 factors, which are AI Competence, Learning Awareness, Learning Strategies, Engagement and Efficiency, and Ethical Collaboration.

This five-factor structure shows that AI use in educational environments deepens and contributes to the traditional construct of self-regulation by elaborating on competence, learning awareness, learning strategies, engagement and efficiency, and ethical collaboration. This new five-factor structure complies with the recent literature by highlighting how AI integration into educational settings advances the conventional SRL theories proposed by Zimmerman (1989) and Pintrich (1999). Furthermore, the AI Competence factor in the current study contributes to the previous literature. For example, Bećirović et al. (2025) explored a significant correlation between academic performance and AI self-efficacy. Also, AI Competence factor can be associated to cognitive regulation strategies of Pintrich (1999) since the current study highlighted that AI tools can assist elaboration strategies by providing individualized explanations and suggestions. Moreover, the Learning Awareness factor echoes Zimmerman’s self-observation process of performance phase (Panadero, 2017; Zimmerman, 1989) by indicating AI’s potentials in enhancing metacognitive monitoring and learning awareness. Gkintoni et al. (2025) revealed that feedback from AI tools, enabling simultaneous performance measuring, promote learners’ metacognitive awareness. This awareness supports students in following their learning processes more consciously. In addition, the factor of Ethical Collaboration contributes to the previous literature. For example, the issues such as data privacy, surveillance, algorithmic bias, and academic integrity were highlighted as ethical concerns that may directly affect SRL in educational contexts in terms of AI applications (Gkintoni et al., 2025; Walter, 2024). Thus, this factor underpins students’ ethical responsibilities in using AI tools and highlights the challenges that may emerge in AI-based educational environments. The Ethical Collaboration factor represents a paradigmatic extension beyond classical SRL frameworks, introducing dimensions that neither Zimmerman nor Pintrich anticipated. This factor operationalizes what Pintrich (1999) termed "context regulation" but extends it to encompass algorithmic transparency, data sovereignty, and collaborative ethics in human-AI interactions. The emergence of this factor as statistically and conceptually distinct suggests that ethical considerations in AI-mediated learning constitute not merely an add-on to existing frameworks but a fundamental regulatory domain requiring dedicated theoretical attention. This finding aligns with recent calls for "algorithmic literacy" as a core competency in digital education (Walter, 2024), suggesting that self-regulated learners must now navigate not only cognitive and motivational challenges but also ethical complexities inherent in AI-assisted learning ecosystems. Regarding the factor of Engagement and Efficiency, AI technologies can present invaluable potentials to support cognitive and metacognitive dimensions of SRL while they may fall short in motivational regulation (Lan & Zhou, 2025; Seo et al., 2021; Wang & Lin, 2023). Considering Zimmerman’s (1989) and Pintrich (1999) theories, motivational regulation is closely associated with trust, affinity, and individualized affective support which are provided by AI supported educational tools. Thus, Engagement and Efficiency as a factor in the current study emerges as a critical area for new research to elaborately investigate the motivational impacts of AI implementations. The Learning Strategies factor aligns with both Pintrich's (1999) cognitive control phase and Zimmerman's (1989) performance phase, capturing how learners select, adapt, and implement AI-supported cognitive strategies for learning. This factor reflects the literature's findings that AI tools effectively support strategy adaptation and selection, with students using AI to find effective study methods, improve problem-solving skills, and enhance critical thinking through multiple perspective analysis (Liu et al., 2024; Seo et al., 2021). The inclusion of this factor acknowledges that contemporary learners actively integrate AI recommendations into their strategic repertoire, representing a shift from traditional strategy use to AI-augmented strategic learning approaches that require both technological fluency and metacognitive judgment about when and how to deploy AI-suggested strategies.

CONCLUSION AND FUTURE DIRECTIONS

Rapid advancements in AI tools have made it necessary for educational environments to adapt them into educational processes appropriately. Along with other psychological constructs, it has become evident that developing a measurement tool regarding SRL in using AI in educational environments to employ in research studies for various purposes seems essential. For this purpose, the current study aimed to conduct the validity and reliability study of AI-SRL scale to address this need in the literature. The current study proposed the AI-SRL scale, and it can be used as a psychometrically sound scale to measure students’ SRL skills in AI use in educational environments. The AI-SRL scale can make an original contribution to the literature, and it has potential as an effective instrument for implementations in AI-mediated educational contexts. For future research, cross-cultural validation studies of this instrument can be conducted to analyze its applicability in different cultural settings. Also, the data of the current study were collected in higher education contexts, and the future data can be collected in secondary education contexts or professional contexts to see whether it yields different results.

The AI-SRL scale represents a pioneering contribution to educational measurement by being among the first validated instruments specifically designed to capture the multidimensional nature of SRL in AI-enhanced environments. Unlike generic SRL measures or technology acceptance scales, this instrument uniquely bridges theoretical foundations from educational psychology with contemporary realities of AI integration, offering researchers and practitioners a psychometrically robust tool for understanding how learners navigate the complexities of AI-mediated education. The scale's potential extends beyond individual assessment to inform institutional policy and pedagogical innovation. Educational institutions implementing AI-powered learning systems can utilize the scale to benchmark student readiness, identify support needs, and evaluate the effectiveness of AI integration initiatives. The five-factor structure provides actionable insights for curriculum designers, suggesting that AI literacy programs should address not only technical competence but also metacognitive awareness, strategic adaptation, motivational engagement, and ethical reasoning.

For future research, cross-cultural validation studies of this instrument can be conducted to analyze its applicability in different cultural settings. Also, the data of the current study were collected in higher education contexts, and the future data can be collected in secondary education contexts or professional contexts to see whether it yields different results. Future validation studies should prioritize cross-cultural investigations, particularly in non-Western educational contexts where collectivist learning orientations might interact differently with AI tools. The scale's applicability in K-12 settings warrants investigation, as younger learners may exhibit distinct patterns of AI-enhanced self-regulation. Professional training contexts, including corporate learning environments and continuing education programs, represent another frontier where the AI-SRL scale could illuminate how working professionals integrate AI tools into their lifelong learning practices. Longitudinal studies tracking changes in AI-SRL profiles over time would provide valuable insights into developmental trajectories and the impact of educational interventions.

PRACTICAL IMPLICATIONS AND LIMITATIONS

The AI-SRL scale can be implemented as an effective reliable tool from planning to assessment in educational settings. It can be applied to measure students’ AI-SRL levels, learning analytics can be designed or observed based on the reports from the scale, and it can provide insights for instructional designs. Also, the scale can be employed in face to face, online, and hybrid education. The AI-SRL scale offers concrete applications for enhancing teaching practices across diverse educational modalities. In diagnostic assessment, educators can administer the scale at course onset to identify students who may struggle with AI-enhanced learning components, enabling targeted support interventions. For learning analytics applications, scale scores can be integrated with digital trace data to create comprehensive learner profiles, revealing patterns between self-reported AI-SRL capabilities and actual tool usage behaviors. This integration becomes particularly valuable in massive open online courses (MOOCs) where personalized support at scale remains challenging. In instructional design, the five-factor structure guides the development of scaffolding strategies tailored to different aspects of AI-enhanced learning. For instance, students scoring low on AI Competence might benefit from technical orientation modules, while those with weak Ethical Collaboration scores could engage with case studies on responsible AI use. In hybrid learning environments, the scale can inform decisions about when to deploy AI support versus human instruction, optimizing the blend based on students' self-regulatory profiles. Distance education programs can use the scale to assess whether students possess the requisite AI-SRL skills for success in autonomous learning environments, potentially incorporating scale-based prerequisites or preparatory courses.

However, the current study involved several limitations. First, most of the participants were chosen from foreign language teaching and English language literature departments of universities in Turkey. Involving most of the participants from two departments may limit generalizability of the findings to other higher education contexts. Second, convenience and purposeful sampling methods were used to reach the participants in this study. Non-random sampling may hinder the generalizability of the findings to other populations. Third, the items of the scale can be interpreted differently by individuals from different linguistic backgrounds. Last, disciplinary biases may influence the effective implementation of the instrument negatively. However, several limitations warrant consideration. The scale's development within Turkish higher education contexts, with participants predominantly from language-related disciplines, may have limited cross-cultural and cross-disciplinary generalizability. The English-language items may have introduced response biases among non-native speakers, potentially affecting measurement invariance across linguistic groups. Additionally, the rapidly evolving nature of AI technologies means the scale will require periodic updates to maintain relevance.

Ethical approval

This research received ethical approval from Social Sciences University of Ankara, as per decision number 98038.

Ethical use of AI tools

In accordance with transparency principles, we disclose the use of AI tools in manuscript preparation. We utilized Claude Opus primarily for linguistic refinement—specifically to improve grammatical structures, enhance sentence flow, and ensure consistency in academic terminology across sections. This tool served exclusively as editorial assistants for language polishing, not for generating research content, data analysis, or theoretical contributions. All substantive content, including research design, data collection, statistical analyses, and interpretation of findings, represents original work by the authors. We maintained full editorial control throughout the writing process, critically reviewing and modifying all AI-suggested revisions. The authors assume complete responsibility for the accuracy, originality, and scholarly integrity of the final manuscript.

Data availability

The data and materials used in this study are available upon request from the corresponding author.

Funding

The authors report no funding.

REFERENCES

Adams, D., Chuah, K.-M., Devadason, E., & Azzis, M. S. A. (2024). From novice to navigator: Students’ academic help-seeking behaviour, readiness, and perceived usefulness of ChatGPT in learning. Education and Information Technologies, 29(11), 13617-13634. https://doi.org/10.1007/s10639-023-12427-8

Anggoro, K. J., & Pratiwi, D. I. (2023). Fostering self-assessment in English learning with a generative AI platform: A case of Quizizz AI. Studies in Self-Access Learning Journal, 14(4), 489-501. https://doi.org/10.37237/140406

Baker, R. S. (2023). AI and self-regulated learning theory: What could be on the horizon? Computers in Human Behavior, 147, 107849. https://doi.org/10.1016/j.chb.2023.107849

Bauer, E., Greiff, S., Graesser, A. C., Scheiter, K., & Sailer, M. (2025). Looking beyond the hype: Understanding the effects of AI on learning. Educational Psychology Review, 37(2). https://doi.org/10.1007/s10648-025-10020-8

Bećirović, S., Polz, E., & Tinkel, I. (2025). Exploring students’ AI literacy and its effects on their AI output quality, self-efficacy, and academic performance. Smart Learning Environments, 12, 29. https://doi.org/10.1186/s40561-025-00384-3

Biju, N., Abdelrasheed, N. S. G., Bakiyeva, K., Prasad, K. D. V., & Jember, B. (2024). Which one? AI-assisted language assessment or paper format: An exploration of the impacts on foreign language anxiety, learning attitudes, motivation, and writing performance. Language Testing in Asia, 14, 45. https://doi.org/10.1186/s40468-024-00322-z

Boateng, G. O., Neilands, T. B., Frongillo, E. A., Melgar-Quiñonez, H. R., & Young, S. L. (2018). Best practices for developing and validating scales for health, social, and behavioral research: A primer. Frontiers in Public Health, 6. https://doi.org/10.3389/fpubh.2018.00149

Boguslawski, S., Deer, R., & Dawson, M. G. (2025). Programming education and learner motivation in the age of generative AI: Student and educator perspectives. Information and Learning Sciences, 126(1/2), 91–109. https://doi.org/10.1108/ILS-10-2023-0163

Büyüköztürk, Ş. (2018). Sosyal bilimler için veri analizi el kitabı. Pegem Akademi Yayıncılık. https://doi.org/10.14527/9789756802748

Chang, D. H., Lin, M. P.-C., Hajian, S., & Wang, Q. Q. (2023). Educational design principles of using AI chatbot that supports self-regulated learning in education: Goal setting, feedback, and personalization. Sustainability, 15(17), 12921. https://doi.org/10.3390/su151712921

Cheung, G. W., Cooper-Thomas, H. D., Lau, R. S., & Wang, L. C. (2024). Reporting reliability, convergent and discriminant validity with structural equation modeling: A review and best-practice recommendations. Asia Pacific Journal of Management, 41(2), 745-783. https://doi.org/10.1007/s10490-023-09871-y

Chiu, T. K. F. (2024). A classification tool to foster self-regulated learning with generative artificial intelligence by applying self-determination theory: A case of ChatGPT. Educational Technology Research and Development, 72(4), 2401-2416. https://doi.org/10.1007/s11423-024-10366-w

Chiu, T. K. F., Moorhouse, B. L., Chai, C. S., & Ismailov, M. (2023). Teacher support and student motivation to learn with artificial intelligence (AI) based chatbot. Interactive Learning Environments, 1-17. https://doi.org/10.1080/10494820.2023.2172044

Creswell, J. W. (2014). Research design: Qualitative, quantitative and mixed methods approaches (4th ed.). SAGE.

Dahri, N. A., Yahaya, N., Al-Rahmi, W. M., Aldraiweesh, A., Alturki, U., Almutairy, S., Shutaleva, A., & Soomro, R. B. (2024). Extended TAM based acceptance of AI-powered ChatGPT for supporting metacognitive self-regulated learning in education: A mixed-methods study. Heliyon, 10(8), e29317. https://doi.org/10.1016/j.heliyon.2024.e29317

DeVellis, R. F. (1991). Scale development: Theory and applications. SAGE.

Eisinga, R., te Grotenhuis, M., & Pelzer, B. (2013). The reliability of a two-item scale: Pearson, Cronbach, or Spearman-Brown? International Journal of Public Health, 58(4), 637-642. https://doi.org/10.1007/s00038-012-0416-3

Fabrigar, L. R., & Wegener, D. T. (2012). Exploratory factor analysis. Oxford University Press. https://doi.org/10.1093/acprof:osobl/9780199734177.001.0001

Field, A. P. (2018). Discovering statistics using IBM SPSS statistics (5th ed.). SAGE.

Floyd, F. J., & Widaman, K. F. (1995). Factor analysis in the development and refinement of clinical assessment instruments. Psychological Assessment, 7(3), 286-299. https://doi.org/10.1037/1040-3590.7.3.286

George, D., & Mallery, P. (2003). SPSS for Windows step by step: A simple guide and reference, 11.0 update (4th ed.). Allyn & Bacon.

Gerlich, M. (2025). AI tools in society: Impacts on cognitive offloading and the future of critical thinking. Societies, 15(1), 6. https://doi.org/10.3390/soc15010006

Gkintoni, E., Antonopoulou, H., Sortwell, A., & Halkiopoulos, C. (2025). Challenging cognitive load theory: The role of educational neuroscience and artificial intelligence in redefining learning efficacy. Brain Sciences, 15(2), 203. https://doi.org/10.3390/brainsci15020203

Glick, D., Miedijensky, S., & Zhang, H. (2024). Examining the effect of AI-powered virtual-human training on STEM majors’ self-regulated learning behavior. Frontiers in Education, 9, 1465207. https://doi.org/10.3389/feduc.2024.1465207

Grüneke, T., Guggenberger, T., Hofmeister, S., & Stoetzer, J. C. (2024). AI-enabled self-regulated learning: A multi-layer taxonomy development. In Proceedings of the 32nd European Conference on Information Systems (ECIS 2024), Paphos, Cyprus. (ISBN 978-1-958200-10-0). https://aisel.aisnet.org/ecis2024/track13_learning_teach/track13_learning_teach/4

Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate data analysis (7th ed.). Pearson.

Hertzog, M. A. (2008). Considerations in determining sample size for pilot studies. Research in Nursing & Health, 31, 180–191. https://doi.org/10.1002/nur.20247

Hill, R. (1998). What sample size is ‘enough’ in Internet survey research? Interpersonal Computing and Technology: An Electronic Journal for the 21st Century, 6, 1–10.

Hilpert, J. C., Greene, J. A., & Bernacki, M. (2023). Leveraging complexity frameworks to refine theories of engagement: Advancing self-regulated learning in the age of artificial intelligence. British Journal of Educational Technology, 54(5), 1204-1221. https://doi.org/10.1111/bjet.13340

Hu, L. T., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal, 6(1), 1-55. https://doi.org/10.1080/10705519909540118

Huang, A. Y. Q., Lu, O. H. T., & Yang, S. J. H. (2023). Effects of artificial intelligence-enabled personalized recommendations on learners’ learning engagement, motivation, and outcomes in a flipped classroom. Computers & Education, 194, 104684. https://doi.org/10.1016/j.compedu.2022.104684

Isaac, S., & Michael, W. B. (1995). Handbook in research and evaluation: A collection of principles, methods, and strategies useful in the planning, design, and evaluation of studies in education and the behavioral sciences (3rd ed.). EdITS Publishers.

Jin, S.-H., Im, K., Yoo, M., Roll, I., & Seo, K. (2023). Supporting students’ self-regulated learning in online learning using artificial intelligence applications. International Journal of Educational Technology in Higher Education, 20, 37. https://doi.org/10.1186/s41239-023-00406-5

Johanson, G. A., & Brooks, G. P. (2009). Initial scale development: Sample size for pilot studies. Educational and Psychological Measurement, 70(3), 394-400. https://doi.org/10.1177/0013164409355692

Jolliffe, I. (2014). Principal component analysis. In N. Balakrishnan, T. Colton, B. Everitt, & W. W. Piegorsch (Eds.), Wiley StatsRef: Statistics Reference Online. John Wiley & Sons. https://doi.org/10.1002/9781118445112.stat06472

Kaiser, H. F., & Rice, J. (1974). Little jiffy, mark IV. Educational and Psychological Measurement, 34(1), 111-117. https://doi.org/10.1177/001316447403400115

Kline, P. (2014). An easy guide to factor analysis. Routledge. https://doi.org/10.4324/9781315788135

Kong, S. C., & Yang, Y. (2024). A human-centered learning and teaching framework using generative artificial intelligence for self-regulated learning development through domain knowledge learning in K-12 settings. IEEE Transactions on Learning Technologies, 17, 1562-1573. https://doi.org/10.1109/TLT.2024.3392830

Lai, J. W. (2024). Adapting self-regulated learning in an age of generative artificial intelligence chatbots. Future Internet, 16(6), 218. https://doi.org/10.3390/fi16060218

Lan, M., & Zhou, X. (2025). A qualitative systematic review on AI empowered self-regulated learning in higher education. npj Science of Learning, 10(1). https://doi.org/10.1038/s41539-025-00319-0

Li, Y., Sadiq, G., Qambar, G., & Zheng, P. (2025). The impact of students’ use of ChatGPT on their research skills: The mediating effects of autonomous motivation, engagement, and self-directed learning. Education and Information Technologies, 30(4), 4185-4216. https://doi.org/10.1007/s10639-024-12981-9

Liang, J., Wang, L., Luo, J., Yan, Y., & Fan, C. (2023). The relationship between student interaction with generative artificial intelligence and learning achievement: Serial mediating roles of self-efficacy and cognitive engagement. Frontiers in Psychology, 14, 1285392. https://doi.org/10.3389/fpsyg.2023.1285392

Liu, Z. M., Hwang, G. J., Chen, C. Q., Chen, X. D., & Ye, X. D. (2024). Integrating large language models into EFL writing instruction: Effects on performance, self-regulated learning strategies, and motivation. Computer Assisted Language Learning, 1-25. https://doi.org/10.1080/09588221.2024.2389923

Mohamed, A. M., Shaaban, T. S., Bakry, S. H., Guillén-Gámez, F. D., & Strzelecki, A. (2025). Empowering the faculty of education students: Applying AI’s potential for motivating and enhancing learning. Innovative Higher Education, 50(2), 587–609. https://doi.org/10.1007/s10755-024-09747-z

Molenaar, I., Mooij, S. de, Azevedo, R., Bannert, M., Järvelä, S., & Gašević, D. (2023). Measuring self-regulated learning and the role of AI: Five years of research using multimodal multichannel data. Computers in Human Behavior, 139, 107540. https://doi.org/10.1016/j.chb.2022.107540

Mooney, C. Z., & Duval, R. D. (1993). Bootstrapping: A nonparametric approach to statistical inference. SAGE. https://doi.org/10.4135/9781412983532

Morgado, F. F. R., Meireles, J. F. F., Neves, C. M., Amaral, A. C. S., & Ferreira, M. E. C. (2017). Scale development: Ten main limitations and recommendations to improve future research practices. Psicologia: Reflexão e Crítica, 30(1). https://doi.org/10.1186/s41155-016-0057-1

Muthén, L. K., & Muthén, B. O. (2017). Mplus: Statistical analysis with latent variables: User’s guide (Version 8). Muthén & Muthén.

Ng, D. T. K., Tan, C. W., & Leung, J. K. L. (2024). Empowering student self-regulated learning and science education through ChatGPT: A pioneering pilot study. British Journal of Educational Technology, 55(4), 1328-1353. https://doi.org/10.1111/bjet.13454

O’Connor, B. P. (2000). SPSS and SAS programs for determining the number of components using parallel analysis and Velicer’s MAP test. Behavior Research Methods, Instruments, & Computers, 32(3), 396-402. https://doi.org/10.3758/BF03200807

Panadero, E. (2017). A review of self-regulated learning: Six models and four directions for research. Frontiers in Psychology, 8, 422. https://doi.org/10.3389/fpsyg.2017.00422

Pintrich, P. R. (1999). The role of motivation in promoting and sustaining self-regulated learning. International Journal of Educational Research, 31(6), 459-470. https://doi.org/10.1016/S0883-0355(99)00015-4

Pintrich, P. R. (2000). The role of goal orientation in self-regulated learning. In M. Boekaerts, P. R. Pintrich, & M. Zeidner (Eds.), Handbook of self-regulation (pp. 451-502). Academic Press. https://doi.org/10.1016/B978-012109890-2/50043-3

Schunk, D. H., & Zimmerman, B. J. (2012). Self-regulation and learning. In I. B. Weiner, G. E. Miller, & W. M. Reynolds (Eds.), Handbook of psychology: Vol. 7. Educational psychology (2nd ed., pp. 59-78). John Wiley & Sons. https://doi.org/10.1002/9781118133880.hop207003

Seo, K., Tang, J., Roll, I., Fels, S., & Yoon, D. (2021). The impact of artificial intelligence on learner–instructor interaction in online learning. International Journal of Educational Technology in Higher Education, 18(1), 54. https://doi.org/10.1186/s41239-021-00292-9

Sun, J. (2005). Assessing goodness of fit in confirmatory factor analysis. Measurement and Evaluation in Counseling and Development, 37(4), 240-256. https://doi.org/10.1080/07481756.2005.11909764

Teeluckdharry, N., Teeroovengadum, V., & Seebaluck, A. (2021). Scale development in marketing research. In R. Nunkoo, V. Teeroovengadum, & C. M. Ringle (Eds.), Handbook of research methods for marketing management (pp. 17-50). Edward Elgar. https://doi.org/10.4337/9781788976954.00007

Tinajero, C., Mayo, M. E., Villar, E., & Martínez-López, Z. (2024). Classic and modern models of self-regulated learning: Integrative and componential analysis. Frontiers in Psychology, 15, 1307574. https://doi.org/10.3389/fpsyg.2024.1307574

Walter, Y. (2024). Embracing the future of artificial intelligence in the classroom: The relevance of AI literacy, prompt engineering, and critical thinking in modern education. International Journal of Educational Technology in Higher Education, 21(1), 15. https://doi.org/10.1186/s41239-024-00448-3

Wang, C. Y., & Lin, J. J. H. (2023). Utilizing artificial intelligence to support analyzing self-regulated learning: A preliminary mixed-methods evaluation from a human-centered perspective. Computers in Human Behavior, 144, 107721. https://doi.org/10.1016/j.chb.2023.107721

Wu, X. Y., & Chiu, T. K. F. (2025). Integrating learner characteristics and generative AI affordances to enhance self-regulated learning: A configurational analysis. Journal of New Approaches in Educational Research, 14(1). https://doi.org/10.1007/s44322-025-00028-x

Xia, Q., Chiu, T. K. F., Chai, C. S., & Xie, K. (2023). The mediating effects of needs satisfaction on the relationships between prior knowledge and self-regulated learning through artificial intelligence chatbot. British Journal of Educational Technology, 54(4), 967–986. https://doi.org/10.1111/bjet.13305

Yot-Domínguez, C., & Marcelo, C. (2017). University students’ self-regulated learning using digital technologies. International Journal of Educational Technology in Higher Education, 14, 38. https://doi.org/10.1186/s41239-017-0076-8

Zimmerman, B. J. (1989). Models of self-regulated learning and academic achievement. In B. J. Zimmerman & D. H. Schunk (Eds.), Self-regulated learning and academic achievement: Theory, research, and practice (pp. 1-25). Springer. https://doi.org/10.1007/978-1-4612-3618-4_1

APPENDICES

Appendix A
Items from the AI-Enhanced Self-Regulated Learning (AI-SRL) Scale




Appendix B
Items removed from final model with empirical and conceptual rationale

Appendix C
Factor loadings for the initial EFA model

Note: All loadings significant at p < .001. Standard errors ranged from .028 to .045.

Appendix D
Cronbach's Alpha for each factor and item-factor correlations

Note: 95% CIs calculated using 5000 bootstrap samples. All item-total correlations significant at p < .001.

Información adicional

How to cite: Ayaz, M. A., Karataş, F., & Yüce, E. (2026). Development and validation of the AI-Enhanced Self-Regulated Learning (AI-SRL) scale [Desarrollo y validación de la Escala de Aprendizaje Autorregulado con IA (AI-SRL)]. RIED-Revista Iberoamericana de Educación a Distancia, 29(1), 185-216. https://doi.org/10.5944/ried.45452

Información adicional

redalyc-journal-id: 3314



Buscar:
Ir a la Página
IR
Visor de artículos científicos generados a partir de XML-JATS por