بررسی مهارت مصححین در سنجش مهارت گفتاری: بهره گیری از مدل چند وجهی راش

Publish Year: 1397
نوع سند: مقاله ژورنالی
زبان: English
View: 502

This Paper With 22 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_LGHOR-2-2_005

تاریخ نمایه سازی: 17 مهر 1398

Abstract:

Since scoring oral language proficiency is performed by raters, they are an essential part of performance assessment. One important feature of raters is their teaching and rating experience which has attracted considerable attention. In a majority of previous studies on rater training, extremely severe or lenient raters, benefited more from training programs and thus results of this training showed significant severity/leniency reduction in their rating behavior. However, they mostly investigated the application of FACETS on only one or two facets and few have used a pre, post-training design. Besides, empirical studies have reported contrasting outcomes, not showing clearly which group of raters does rating more reliably than the other. In this study, 20 experienced and inexperienced raters rated the oral performances produced by 200 test-takers before and after a training program. The results indicated that training leads to higher measures of interrater consistency and reduces measures of biases towards using rating scale categories. Moreover, since it is almost impossible to completely eradicate rater variability even if training is applied, rater training procedure had better had better be regarded as a procedure to make raters more self-consistent (intrarater reliability) rather than consistent with each other (interrater reliability). The findings of this study indicated that inexperienced and experienced raters’ rating quality improved after training; however, inexperienced raters underwent higher consistency and less bias. Hence, there is no evidence that inexperienced raters should be excluded from rating solely because of their lack of adequate experience. Moreover, Inexperienced raters, being more economical than the experienced ones, cost less for decision-makers for rating. Therefore, instead of charging a bulky budget on experienced raters, decision-makers had better use the budget for establishing better training programs.

Keywords:

Bias , Interrater consistency , Intrarater consistency , Multifaceted Rasch measurement (MFRM) , Rater expertise

Authors

Houman Bijani

Zanjan Branch, Islamic Azad University, Zanjan, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Ahmadi, A., & Sadeghi, E. (2016). Assessing English language learners’ ...
  • Attali, Y. (2016). A comparison of newly-trained and experienced raters ...
  • Barkaoui, K. (2011). Think-aloud protocols in research on essay rating: ...
  • Barrett, S. (2001). The impact of training on rater variability. ...
  • Bijani, H. (2010). Raters’ perception and expertise in evaluating second ...
  • Bijani, H., & Fahim, M. (2011). The effects of rater ...
  • Bonk, W. J., & Ockey, G. J. (2003). A many-facet ...
  • Caban, H. L. (2003). Rater group bias in speaking assessment ...
  • Cohen, L., Manion, L., & Morrison, K. (2007). Research methods ...
  • Cumming, A. (1990). Expertise in evaluating second language compositions. Language ...
  • Davis, L. (2009). The influence of interlocutor proficiency in a ...
  • Davis, L. (2016). The influence of training and experience on ...
  • Eckes, T. (2015). Introduction to many-facet Rasch measurement. Frankfurt, Germany: ...
  • Educational Testing Service (2001). ETS oral proficiency testing manual. Princeton, ...
  • Gan, Z. (2010). Interaction in group oral assessment: A case ...
  • Huang, H., Huang, S., & Hong, H. (2016). Test-taker characteristics ...
  • In’nami, Y., & Koizumi, R. (2016). Task and rater effects ...
  • Khabbazbashi, N. (2017). Topic and background knowledge effects on performance ...
  • Kim, H. J. (2011). Investigating raters’ development of rating ability ...
  • Kim, H. J. (2015). A qualitative analysis of rater behavior ...
  • Kondo-Brown, K. (2002). A FACETS analysis of rater bias in ...
  • Kuiken, F., & Vedder, I. (2014). Raters’ decisions, rating procedures ...
  • Kyle, K., Crossley, S. A., & McNamara, D. S. (2016). ...
  • Leaper, D. A., & Riazi, M. (2014). The influence of ...
  • Lim, G. S. (2011). The development and maintenance of rating ...
  • Linacre, J. M. (1989). Many-faceted Rasch measurement. Chicago, IL: MESA ...
  • McNamara, T. F. (1996). Measuring second language performance. London, England: ...
  • McNamara, T. F., & Lumley, T. (1997). The effect of ...
  • Nakatsuhara, F. (2011). Effect of test-taker characteristics and the number ...
  • Steiger, J. H., (1980). Test for comparing elements of a ...
  • Van Moere, A. (2012). A psycholinguistic approach to oral language ...
  • Winke, P., Gass, S., & Myford, C. (2012). Raters’ L2 ...
  • Wright, B. D., & Linacre, J. M. (1994). Reasonable mean-square ...
  • نمایش کامل مراجع