cover
Contact Name
-
Contact Email
-
Phone
-
Journal Mail Official
-
Editorial Address
-
Location
Kab. sleman,
Daerah istimewa yogyakarta
INDONESIA
REiD (Research and Evaluation in Education)
ISSN : -     EISSN : 24606995     DOI : -
Core Subject : Education,
Arjuna Subject : -
Articles 173 Documents
Item analysis of reading comprehension questions for English proficiency test using Rasch model Dewi, Henda Harmantia; Damio, Siti Maftuhah; Sukarno, Sukarno
REID (Research and Evaluation in Education) Vol. 9 No. 1 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i1.53514

Abstract

The need to take English as a foreign language proficiency test (known as TOEFL [Test of English Language Proficiency]) has been gaining popularity in Indonesia. The increasing demands for such a test and its expensive cost have reinforced many institutions to develop TOEFL instruments and administer the test internally. However, constructing a test instrument is a complex process that makes conducting item analysis become more challenging. Meanwhile, item analysis is crucial to assess the items' quality. Therefore, this study reported the results of statistically analyzing 20 questions of TOEFL reading comprehension that were analyzed in terms of the test reliability, the item and person fit, and the items' difficulty level. Thirty-eight members of the English Department Students' Association of a state university in West Java participated in this study by taking the reading test. The data were analyzed using the Rasch model by utilizing the Quest program. The results showed that four items (36.8%) did not fulfill the ideal criteria of a valid test because they were too easy and too difficult to be given to the target test takers; thus, they needed to be discarded. Meanwhile, 16 items (63.2%) are of good quality and can be used immediately in the proficiency test, especially to measure reading comprehension skills, because they have fulfilled the standard requirements for a valid test. The findings have provided insight into the importance of item analysis in validating test instruments to improve the test quality for future administrations.
Analysis of critical thinking skills, cognitive learning outcomes, and student activities in learning the human excretory system using an interactive flipbook Aswanti, Novita Hawsari; Isnaeni, Wiwi
REID (Research and Evaluation in Education) Vol. 9 No. 1 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i1.53126

Abstract

This study aims to: (1) analyze the achievement of critical thinking skills, (2) analyze the achievement of cognitive learning outcomes, (3) determine the learning activities carried out, and (4) determine student responses to the use of interactive flipbook media. This study is Quasi Experiment research with a posttest-only control design employing a purposive sampling technique. The data in this study are (1) data on the achievement of critical thinking ability indicators, (2) data on achievement of cognitive levels (C4, C5, and C6), (3) data on student learning activities carried out, and (4) data on student responses to use of interactive flipbook media. The instruments used are (1) critical thinking tests in the form of description questions, (2) cognitive learning outcomes tests in the form of multiple-choice questions, (3) student activity observation sheets, and (4) student response questionnaires. The results obtained in learning the human excretory system using interactive flipbooks are: (1) the level of achievement of critical thinking skills on each indicator can be achieved well, (2)the level achievement of cognitive learning outcomes (C4, C5, and C6) in each category can be achieved well, (3) learning activities that appear and implement in the learning process include visual activities, oral activities, and listening activities, and (4) the use of interactive flipbook media gets a positive response from students. Most respondents agree to use interactive flipbook media to learn the human excretory system. Students also assess that interactive flipbook media are interesting, flexible, practical, meaningful, and not dull.
Dominant factors that determine college students completing studies in mathematics education study programs Dewanti, Sintha Sih; Pramono, Aji Joko Budi
REID (Research and Evaluation in Education) Vol. 9 No. 1 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i1.51081

Abstract

Failure to graduate on time is a problem for both the university and the students themselves. Beside the competencies possessed by students, many other factors affect the completion of student studies. This study aims to reduce the variables that really affect the completion of student studies. The approach of this research was survey research on students of the Mathematics Education Study Program at least in semester 7 (currently taking a final project course). There are 17 factors that determine the completion of student studies, namely achievement motivation, discipline, interest, intelligence, study habits, health, part-time work activities, organizational activities, curriculum, mentoring methods, student relations with lecturers, availability of books, internet facilities, family economic conditions, relationships with parents and family members, friends, and social environment. Data analysis was conducted using Principal Component Analysis (PCA) method to obtain the dominant factor. Based on the results of the study, four main factors that affect the completion of student studies are formed, namely: (1) the first factor: motivation and academic ability; (2) the second factor: activities and social environment; (3) the third factor: facilities and family; and (4) the fourth factor: thesis guidance. The four factors can explain the dominant factor in student study completion at 86.54%, with details of motivation and academic ability at 37.57%, activities and social environment at 26.34%, facilities and family at 15.21%, and thesis guidance at 7.42%.
Evaluation of English language improvement program for Information System graduates using a comparative analysis method Purwasih, Ratih; Rahimullaily, Rahimullaily; Zikri, Zikri
REID (Research and Evaluation in Education) Vol. 9 No. 1 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i1.54025

Abstract

STMIK Indonesia Padang seeks to improve the quality of its graduates by providing several academic training in information systems science and soft skills. One of them is training to improve graduates' English skills, including English I and II, English for career, and TOEFL training. The purpose of the study was to evaluate the success of the English language improvement program given to graduates of STMIK Indonesia Padang by comparing the scores for all English programs. This research method is explanatory. Data processing and analysis used descriptive statistics and comparative mean analysis techniques with Friedman test statistics on 4 data groups and Wilcoxon on two data groups. The sample of this study is a saturated sample (170 people) and dependent. The results of the test statistic showed that the value of sig. is less than 0.05. This shows that there is a significant difference between the average scores of English I, English II, English for Career, and TOEFL, either simultaneously or not. Based on descriptive statistics, it was found that the difference did not indicate an increase in the average score on English language skills. Several recommendations can be made in improving English, including (1) the implementation of continuous training, not only at the beginning of the semester and at the end of the semester for students, (2) Increasing the practice of communicating English such as participating in debate competitions, storytelling, and speeches.
Pre-service teachers' agentive projections toward innovation in online English Language Teaching (ELT) classes Farmasari, Santi; Wardana, Lalu Ali; Baharuddin, Baharuddin; Amrullah, Amrullah; Isnaeni, Mh; Lail, Husnul
REID (Research and Evaluation in Education) Vol. 9 No. 1 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i1.51393

Abstract

This study examines the pre-service teachers' agentive projections toward innovation in online English Language Teaching (ELT) classes. Employing teacher agency theory, this instrumental case study views projections as agentive when they are informed by the students' ecological aspects (past and present), oriented to solve potential learning problems and improve learning outcomes. The study involved 84 pre-service teachers who were voluntarily asked to fill in a questionnaire, submit a lesson plan, and be interviewed. nVivo Pro was used to organize themes. The study indicates that the pre-service teachers, M=3.81, SD=.590, perceive that innovation in online ELT classes is closely related to the integration of information and technology. As a result, the students' agentive projections were also oriented to solve technology and internet-based obstacles, added with innovative learning methods. The research findings may become important insights for the development of English teaching and learning in order to provide more capital for pre-service teachers creating ELT innovation in the future.
Developing a religiosity scale for Indonesian Muslim youth Abdullah, Shodiq; Warsiyah, Warsiyah; Ju'subaidi, Ju'subaidi
REID (Research and Evaluation in Education) Vol. 9 No. 1 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i1.61201

Abstract

This study aims to construct and test the validity of the Islamic youth religiosity scale. The population in this study is Muslim students of senior high schools in Surakarta, Central Java, with a sample of 258 established using the random sampling technique. The data analysis used the Linear Structural Model. The result shows that the RMSEA (≤ 0.08) and GFI (≥ 0.09) values from the four dimensions (belief, ritual, social, commitment meet the standard values of compatibility with the respective values for RMSEA beliefs = 0.055 GFI = 0.94, RMSEA rituals = 0.026 GFI = 0.99, social RMSEA = 0.059 GFI = 0.91, commitment of RMSEA = 0.032 GFI = 0.97. This means that these dimensions (belief, ritual, social, commitment) can reflect the religiosity variables positively and fit empirical data. The most dominant dimension reflecting religiosity is the social dimension with an average factor loading value >0.05, and the weakest one that reflects religiosity is confidence because many items have a loading factor <0.05.
Smartphone application for assessing teacher performance Marlianawati, Hesty; Suranto, Suranto; Kartowagiran, Badrun
REID (Research and Evaluation in Education) Vol. 9 No. 2 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i2.52384

Abstract

This work seeks to design a smartphone application for assessing teacher performance utilizing a 4D development model modified with the Mardapi instrument development model of (1) define (defining); (2) design (deciding instrument requirements, determi­ning application media specifications, creating flowcharts, and creating storyboards); (3) develop (assemble instruments, code, execute tests, evaluate test results, and interpret measurement findings); and (4) disseminate (spread). Proof of the application's viability was established through product testing with 26 principals acting as application users and evaluations by material and media specialists. The content validity of the acquired results was evaluated using the Aiken method, and the construct validity was evaluated using PLS (Partial Least Squares) analysis because the sample size does not have to be large, in accordance with this study where the number of samples is not so large. Using Cronbach's alpha to evaluate the reliability. The results demonstrated that the built application was "possible," as proven by the existing instruments in the valid and trustworthy application. This is proven by the fact that the content validity test utilizing the Aiken method yielded good validity results, with a mean score of 0.80 for material experts and 0.75 for media experts. The construct validity test using PLS (Partial Least Squares) yielded positive results for validity. The estimation of reliability using Cronbach's alpha yields positive findings. Therefore, it can be concluded that the developed application for teacher performance evaluation possesses the features of a valid and trustworthy assessment instrument.
Stability of estimation item parameter in IRT dichotomy considering the number of participants Ibrahim, Zulfa Safina; Retnawati, Heri; Irambona, Alfred; Pérez, Beatriz Eugenia Orantes
REID (Research and Evaluation in Education) Vol. 10 No. 1 (2024)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v10i1.73055

Abstract

This research is related to item response theory (IRT) which is needed to measure the goodness of a test set, while item parameter estimation is needed to determine the technical properties of a test item. Stability of item parameter estimation is conducted to determine the minimum sample that can be used to obtain good item parameter estimation results. The purpose of this study is to describe the effect of the number of test takers on the stability of item parameter estimation with the Bayes method (expected a posteriori, EAP) on dichotomous data. This research is an exploratory descriptive research with a bootstrap approach using the EAP method. The EAP method is performed by modifying the likelihood and function to include prior information about the participant's 9 score. Bootstrapping on the original data is done to take bootstrap samples. with ten different sample sizes of 100, 150, 250, 300, 500, 700, 1,000, 1,500, 2,000, 2,500 were then replicated ten times and grain parameter estimation was performed. Each sample data with ten replications was calculated Root Mean Squared Difference (RMSD) value. The results showed that the 2PL model was chosen as the best model. The RMSD value obtained proves that many test participants affect the stability of item parameter estimation on dichotomous data with the 2PL model. The minimum sample to ensure the stability of item parameter estimates with the 2PL model is 1,000 test participants.
Development and validation of a self-assessment-based instrument to measure elementary school students' attitudes in online learning Setiawan, Ari; Cendana, Wiputra; Ayres, Mark; Yuldashev, Azim Abdurakhmanovich; Setyawati, Sri Panca
REID (Research and Evaluation in Education) Vol. 9 No. 2 (2023)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v9i2.52083

Abstract

Online learning during the COVID-19 pandemic makes it difficult for teachers to assess student learning attitudes. Limited availability of instruments to measure attitudes of students when they are engaged in online learning leads to difficulty of teachers to conduct appropriate assessments on that measure. The current study, therefore, mainly was intended to produce a self-assessment-based instrument that is feasible to use to measure students' attitudes in online learning. In order to produce such instrument, we used developmental research method by following steps in the instrument design that is proposed by McCoach, Gable, and Madura. Furthermore, in order to provide feasibility of our instrument, we provided evidence of content validity through experts' judgment data as well as evidence of construct validity with confirmatory factor analysis (CFA) and reliability estimation with Cronbach's α through a limited trial and an expanded trial using response data of sixth graders of elementary school engaged in online learning. Our study has produced a self-assessment-based instrument that uses a summated rating scale and is composed of six components (i.e., honest, disciplined, responsible, polite, caring, and self-confident) and 24 items that have demonstrated evidence of content validity, stable factor structure, and high reliability estimates.
Construction of an instrument for evaluating the teaching process in higher education: Content and construct validity Setiawan, Risky; Wagiran, Wagiran; Alsamiri, Yasir
REID (Research and Evaluation in Education) Vol. 10 No. 1 (2024)
Publisher : Graduate School of Universitas Negeri Yogyakarta & Himpunan Evaluasi Pendidikan Indonesia (HEPI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21831/reid.v10i1.63483

Abstract

This study aims to reveal the content validity, construct validity, and reliability of the instrument for evaluating the teaching process in higher education. This research is development research applying the ADDIE model from Molenda. The indicators evaluated consist of context, inputs, processes, and products. The sample consisted of 1200 students from eight faculties, each represented by three study programs. Data analysis uses three stages: content validity test analysis using the V -Aiken method involving six panellists or experts; construct validity test using Confirmatory Factor Analysis (CFA). Quantitative descriptive analysis and interpretive qualitative analysis used the Miles and Huberman method. The results showed that the developed evaluation instrument had good proof of the validity of the content, with an average V-Aiken score of 0.752, which was in the high category. Universitas Negeri Yogyakarta's evaluation instrument, which was developed through the instrument, already meets the validity of an exemplary construct of a good loading factor value (> 0.3). It has a composite reliability score above 0.7 and Cronbach's alpha above 0.6. The analysis results show that all empirical test criteria indicate the data is fit against the developed model.