cover
Contact Name
Natris Idriyani
Contact Email
natrisidriyani@uinjkt.ac.id
Phone
-
Journal Mail Official
natrisidriyani@uinjkt.ac.id
Editorial Address
-
Location
Kota tangerang selatan,
Banten
INDONESIA
Jurnal Pengukuran Psikologi dan Pendidikan Indonesia (JP3I)
ISSN : 20896247     EISSN : 26545713     DOI : -
Core Subject : Education, Social,
Jurnal Pengukuran Psikologi dan Pendidikan Indonesia (JP3I) adalah jurnal ilmiah yang diterbitkan oleh Fakultas Psikologi, Universitas Islam Negeri Syarif Hidayatullah Jakarta. Jurnal ini bertujuan untuk memfasilitasi interaksi, diskusi, dan gagasan di antara para ilmuwan psikologi Indonesia. Jurnal ini difokuskan pada Psikologi Pengukuran.
Arjuna Subject : -
Articles 260 Documents
The Development of The Situational Judgement Test (SJT) for Workforce Agility Helmi, Avin Fadilla; Marvianto, Ramadhan Dwi; Al Fariz, Arif Budiman; Anggoro, Indahinsani Purnasari; Anggoro, Wahyu Jati
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 1 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i1.39333

Abstract

This research aimed to develop workforce agility measuring tool with the situational judgment test (SJT) format and evaluate its psychometric properties. This research included 886 respondents with higher education. The data were analyzed using confirmatory factor analysis (CFA) and multigroup confirmatory factor analysis (MGCFA). The CFA results showed that 18 items have satisfactory psychometric properties, thus forming a fit factor structure and providing valid evidence based on internal structure. Besides, the MGCFA results showed that this measuring instrument has measurement invariance at the strict invariance level to be used as validity evidence based on test consequences. The Omega reliability of this instrument was estimated, and the required criteria were met. The implication is that this measuring instrument has at least three of the five sources of validity to allow solid interpretations of the measuring results.
A New Unidimensional Family Resilience Scale in an Indonesian Context: Development and Psychometric Properties Iqbal, Muhammad; Roebianto, Adiyo; Wardani, Laila Meiliyandrie Indah
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 2 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i2.39475

Abstract

This study aims to develop a psychological instrument of family resilience using data derived from Indonesian samples based on a five-dimensional resilience scale: physical resilience, psychological resilience, economic resilience, social resilience, and religiosity resilience. Each dimension was measured with five items (25 items in total) on a 4-point Likert scale.  A total of 841 subjects (n = 841) were gathered through purposive sampling and were required to complete the questionnaire. Data were analysed using confirmatory factor analysis (CFA) and Rasch analysis, aimed at generating each item’s validity based on the formulated constructs. Results indicated that all item t-values exceeded 1.96 (t > 1.96), and the model’s statistical indices satisfied the criteria for unidimensionality and goodness of fit. According to the Rasch analysis, all items demonstrated good fit and a high Cronbach’s alpha reliability index. A newly developed family resilience measure for Indonesian populations demonstrates strong construct validity across all dimensions and items, including a unique dimension that recognises the role of religion in family coping. While promising for clinical and research use, further validation and cross-cultural studies are needed to address limitations related to purposive sampling and cultural specificity. These studies should employ longitudinal and cross-cultural designs to enhance generalizability. This will ensure that the instrument provides a robust measure of family resilience for diverse applications. Meanwhile, the limitation of the study is the need to conduct a standardised norming of the instrument.
Measuring the Teacher's Sense of Efficacy Scale for Teachers of Children with Special Needs in Indonesia Rachmahana, Ratna Syifa'a; Muhammad Chirzin; Khoiruddin Bashori; Halim Purnomo
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 1 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i1.40028

Abstract

This research aims to test the validity and analysis of the Teacher's Sense of Efficacy Scale (short form) measurement tool for teachers of children with special needs in Indonesia. In this study, researchers tested three dimensions developed by Tschannen-Moran and Hoy (2001) for measuring teaching efficacy: Efficacy in Classroom Management, Efficacy in Student Engagement, and Efficacy in Instructional Strategies, with 12 items. The research subjects were 301 teachers of children with special needs. Among the subjects are 225 female and 76 male teachers, with a median age of 26-35. The sampling method utilised in this study is purposive sampling. The data analysis method used in this research is confirmatory factor analysis (CFA). The analysis results show a GFI value of 0.9 97; CFI 0.930; TLI 0.907; RNI 0.930; SRMR 0.044; and RMSEA 0.082. Further, Rasch analysis found that all 12 items of the scale were valid and used in the study. This model allows stakeholders to gain more insights into teachers’ self-efficacy, especially regarding children with special needs.
Mplus and the R mirt Package: A Comparison of Model Parameter Estimation for Generalized Partial Credit Model (GPCM) Al Fariz, Arif Budiman
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 13 No. 2 (2024): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v13i2.40344

Abstract

This article aims to carry out an empirical demonstration to calibrate data using the generalized partial credit model (GPCM) and compare the results of GPCM analysis on paid software, namely Mplus, and open-source software, namely R Package Mirt. The data used in this study used secondary data in the form of item scores from the future orientation scale or Skala orientasi masa depan (S-OMD) with a total of 326 participants using a Likert scale with 4 response options. The results of this study show that GPCM is fit for OMD scale data. Comparison of analysis results using Mplus and R Package mirt shows the same output, such as discrimination parameters and item difficulty levels. The resulting factor score correlation also has a perfect correlation or one. In conclusion, open-source software is capable of having the same computing performance as paid software, and even has several additional features that are not found in paid software.
Development and Validation of the Online Victimization Scale: Confirmatory Factor Analysis and Composite Reliability Al Afghani, Abdullah Azzam; Muzayyin, Muhammad Dzuhri; Maula, Mela Minkhatul
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 1 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i1.40554

Abstract

Online victimization is harmful actions directed at individuals or institutions through digital technology. This study aims to develop, validate, and examine the psychometric properties of the Online Victimization Scale (OVS). The researchers constructed and adapted several items from the Online Victimization Scale (OVS) by Tynes et al. (2014) and the Perceived Online Racism Scale (PORS) by Keum (2021), resulting in 54 items that were translated into Indonesian. A quantitative research method was employed using an accidental sampling technique. Data were collected from 204 individuals aged 14-23 years who were active social media users. Data analysis was conducted using Confirmatory Factor Analysis (CFA)to evaluate the factor structure. Score reliability was assessed by calculating composite omega and stratified alpha values using the lavaan package in R. The results indicated high inter-factor correlations, prompting a second-order factor analysis. The first-order and second-order models demonstrated good model fit indices with no significant differences, resulting in a final scale of 16 items. Measurement invariance testing using multi-group CFA confirmed that the scale met the criteria for scalar invariance, as indicated by minimal changes in ΔCFI and ΔTLI (<0.01). These findings validate the equivalence of the factor structure, factor loadings, and item intercepts between male and female participants, ensuring that score differences reflect actual differences in the online victimization construct rather than measurement bias.
Construct Validity and Reliability: Perception of Teacher Competency Scale Rakhmania, Raisa; Riyanti, Benedicta Prihatin Dwi; Purwanti, Margaretha
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 2 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i2.41460

Abstract

Early childhood teacher competency ensures high-quality programs and fosters student development. This competency comprises four key aspects: pedagogical, personality, social, and professional skills. However, Indonesia still lacks a psychometrically validated, perception-based competency scale that fully aligns with both national policy standards and internationally recognised frameworks. To address this gap, this research aimed to develop a self-inventory model of teacher competency. A 60-item questionnaire was sent to 249 early childhood teachers. Construct validity was examined using Exploratory and Confirmatory Factor Analyses. Convergent validity was confirmed by significant factor loadings, Composite Reliability, and Average Variance Extracted values. Criterion-related validity was supported by positive correlations with established measures of creative adaptability, perceived social support, teacher efficacy, and work engagement scale. The final analysis identified seven teacher competency dimensions. This scale also correlates with the other four measuring instruments. Analysis of Cronbach's Alpha Coefficient, Composit Reliability Value, and Average Variance Extract assessed reliability. The scale was refined to 45 valid items. This was followed by Measurement Invariance with Multigroup Confirmatory Factor Analysis, to evaluate potential differences in perceptions among groups categorised by educational background and teaching experience in their comprehension of the item questionnaire. The results revealed no significant differences in perceptions across either group type regarding the measuring instrument. This new scale is expected to enhance future research, educator training, and the quality of early childhood education.
The Indonesian version of the Short Grit Scale (Indonesian Grit-S): Psychometric Properties Based on the Multidimensional Rasch Model Nursalam, Nursalam; Bintang, Rahmat S.; Zainuddin, Kurniati; Mutiah, Diana
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 2 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i2.41829

Abstract

“Grit” refers to persistence in trying and consistent interest in achieving long-term goals. The Short Grit Scale (Grit-S) is one of the measures used to assess grit. Given that previous research on the Grit-S scale was reported with a CFA approach, this study aims to examine the psychometric properties of the Indonesian Grit-S using Rasch model analysis. Rasch model shows a more detailed analysis than CFA by explaining item difficulty, item fit statistics, reliability, and wright map. Participants in the study were 480 students (male: 86 and female: 394) with an age range of 17-25 years (mean age: 20.05, SD age: 1.32). The sampling method used was non-probability sampling with a convenience sampling technique. The research analysis results show that the Indonesian Grit-S has a good reliability. In addition, all Indonesian Grit-S scale items fit using the Rasch model. Overall, the results showed that the Indonesian Grit-S scale has valid and reliable psychometric properties and can be used to measure grit.
Comparing IRT Models: Summated Scaling Effects on Critical Thinking in Vocational Students Manggaberani, Andi Abdurrahman; Samsul Hadi; Nur Hidayanto Pancoro Setyo Putro; Fajri, Abrar Syahrul; Retnawati, Heri
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 2 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i2.42886

Abstract

This study investigates the comparative efficacy of Summated Rating Scales (SRS) and traditionalordinal scales (raw Likert-type responses) in measuring critical thinking skills among vocationalstudents, employing Item Response Theory (IRT) to evaluate their psychometric properties.Addressing the limitations of ordinal scales notably inconsistent intervals between responsecategories the research adopts a descriptive quantitative methodology involving 269 students fromstate vocational high schools in Yogyakarta, Indonesia. Data were collected using a five-pointLikert scale instrument, validated for content (Aiken’s V = 0.94), and analyzed through two IRTframeworks: Polytomous IRT for unscaled ordinal data and Continuous Response Model (CRM)IRT for SRS-transformed interval data. Key findings reveal that SRS enhances measurementprecision by normalizing response distributions into proportional intervals (e.g., recalibrated scores:0.00, 0.73, 1.46, 2.07, 2.84), thereby resolving issues of unequal category spacing inherent toordinal scales. Polytomous IRT demonstrated robust item fit (e.g., Partial Credit Model fit for 5/6items) and strong difficulty parameter invariance (r = 0.84), yet exhibited instability in abilityestimates (r = 0.37) due to extreme response patterns. Conversely, CRM IRT applied to scaleddata produced stable ability estimates (r = 0.46) and eliminated infinite values in MaximumLikelihood Estimation, underscoring its superiority in handling continuous metrics. However, ordinalscales retained higher consistency in difficulty calibration across subgroups. The study concludesthat integrating SRS with CRM IRT offers a refined approach for critical thinking assessments,balancing precision and fairness, while ordinal scales remain pragmatic for contexts prioritizingsimplicity. These insights advocate for the adoption of advanced scaling techniques in vocationaleducation to improve the validity of competency evaluations, with recommendations for futureresearch to explore hybrid models and longitudinal applications.
Validating the Indonesian Version of UWES-17: A Rasch Model Analysis of Work Engagement Shaleh, Abdul Rahman; Hayat, Bahrul; Samudera, Arung
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 1 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i1.46003

Abstract

Work engagement is a positive psychology construct widely sought after and developed in various countries. Psychometric validation requires statistical analysis to measure and differentiate individuals across different cultures. This study aims to validate the Utrecht Work Engagement Scale-17 (UWES-17) instrument on hospitality workers in state-owned companies in Indonesia. The study population comprised 3185 individuals, representing various positions and types of hospitality service jobs within the land transportation sector of a state-owned enterprise. A non-probability sampling approach, specifically convenience sampling, was employed, selecting participants based on their accessibility and willingness to participate. The participants, aged between 18 and 62, came from diverse fields and work areas. Analysis of item responses in the test was the Rasch Model with a study focus on item fit and person fit from the UWES-17 Scale. In addition, assumption tests of unidimensionality and local independence were performed to ensure the validity of the measurement model. The scale’s reliability was also assessed to evaluate the instrument’s consistency. The results showed that the items were overall fit, but the criteria for person fit were unmet. This is due to the tendency for uniformity and too many diverse positions and cultural tendencies of state-owned companies. The implication is that further item development and confirmation are needed on the constituent aspects, especially the absorption constituent items. Additionally, developing a more consistent shorter version or discovering other dimensions of work engagement that are specific to collectivist societies may be possible.
PHQ-9: Validation and Investigation Response Format using Nominal Response Model Marvianto, Ramadhan Dwi; Kusrohmaniah, Sri
JP3I (Jurnal Pengukuran Psikologi dan Pendidikan Indonesia) Vol. 14 No. 2 (2025): JP3I
Publisher : FAKULTAS PSIKOLOGI UIN SYARIF HIDAYATULLAH JAKARTA

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15408/jp3i.v14i2.46525

Abstract

The Patient Health Questionnaire 9 (PHQ-9) is a widely used tool for depression screening, but its internal structure varies across different contexts and potentially leads to misinterpretations of the depression construct it measures. This study aims to investigate the internal structure and response format effectiveness of the Indonesian version of the PHQ-9. Data were collected from 1,310 participants who completed the PHQ-9 questionnaire, sourced from the Faculty of Psychology of the Universitas Gadjah Mada (UGM)’s database. Data analysis included confirmatory factor analysis (CFA), item factor analysis (IFA), and item response theory (IRT) using a nominal response model (NRM). Results indicated that a two-factor model demonstrated a better fit than a single-factor model, which was categorised as a marginal fit. Furthermore, nearly all items functioned effectively in their response format, except for items 5 (poor appetite) and 9 (suicidal thoughts), which showed suboptimal functioning in the highest categories. These findings support the practical use of the PHQ-9 and underscore the value of collapsing categories when extreme responses are rarely endorsed to improve measurement precision.