Evaluation of the Claude AI Assistant's Performance on the Iranian Master's Entrance Exam in Medical Physics

Publish Year: 1402
نوع سند: مقاله کنفرانسی
زبان: English
View: 88

نسخه کامل این Paper ارائه نشده است و در دسترس نمی باشد

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

RSACONG03_045

تاریخ نمایه سازی: 20 آذر 1402

Abstract:

Aim: This study aimed to assess the performance of the Claude AI assistant[۲] on a multiple choice exam covering key topics in medical physics and determine areas needing improvement. Methods: Claude was provided a ۱۶۰ question multiple choice exam from the Iranian Master's Entrance Exam in Medical Physics directly in PDF form[۱] without using any OCR tools. Claude provided its best reasoned answers, which were compared to the answer key to calculate percent correct overall and by topic. Results: Overall Claude achieved ۶۱% accuracy compared to the answer key. Performance was strongest in Physiology and Anatomy (۶۷% correct), radiation physics, general physics, and math (۶۰% each), and general English (۶۸%). Weaker areas were nuclear/atomic physics (۵۵% correct), radiobiology (۵۸%), biology (۶۰%), and physiology/anatomy (۶۷%). Conclusion: The Claude AI assistant demonstrated a foundational command of key physics topics, with room for improvement in specialized medical applications. Additional training focused on nuclear physics, radiobiology, and biological sciences would further enhance Claude's performance on medical physics exams and tasks requiring cross-disciplinary knowledge. However, Claude shows promise in integrating physics and medical concepts.

Authors

Saeed Dabirifar

Department of Radiology , Mashhad university of medical Sciences, Mashhad, Iran

Saeed Dabirifar

Department of Radiology , Mashhad university of medical Sciences, Mashhad, Iran