Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? : insights into strengths and limitations

2023
journal article
article
13
dc.abstract.enPurpose: Rapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) in radiology and imaging diagnostics within Polish education system. Additionally, the study aimed to identify the strengths and limitations of the model through a detailed analysis of issues raised by exam questions. Material and methods: The present study utilized a PES exam consisting of 120 questions, provided by Medical Examinations Center in Lodz. Questions were administered using openai.com platform that grants free access to GPT-3.5 model. All questions were categorized according to Bloom’s taxonomy to assess their complexity and difficulty. Following the answer to each exam question, ChatGPT was asked to rate its confidence on a scale of 1 to 5 to evaluate the accuracy of its response. Results: ChatGPT did not reach the pass rate threshold of PES exam (52%); however, it was close in certain question categories. No significant differences were observed in the percentage of correct answers across question types and sub-types. Conclusions: The performance of the ChatGPT model in the pass rate of PES exam in radiology and imaging diagnostics in Poland is yet to be determined, which requires further research on improved versions of ChatGPT.
dc.contributor.authorKufel, Jakub
dc.contributor.authorPaszkiewicz, Iga
dc.contributor.authorBielówka, Michał
dc.contributor.authorBartnikowska, Wiktoria
dc.contributor.authorJanik, Michał
dc.contributor.authorStencel, Magdalena
dc.contributor.authorStencel, Łukasz
dc.contributor.authorStencel, Katarzyna
dc.contributor.authorMielcarska, Sylwia
dc.date.accessioned2024-07-23T10:10:11Z
dc.date.available2024-07-23T10:10:11Z
dc.date.issued2023
dc.date.openaccess0
dc.description.accesstimew momencie opublikowania
dc.description.additionalBibliogr. s. e434
dc.description.physicale430-e434
dc.description.versionostateczna wersja wydawcy
dc.description.volume88
dc.identifier.doi10.5114/pjr.2023.131215
dc.identifier.issn1733-134X
dc.identifier.urihttps://ruj.uj.edu.pl/handle/item/389193
dc.languageeng
dc.language.containereng
dc.rightsUdzielam licencji. Uznanie autorstwa - Użycie niekomercyjne - Bez utworów zależnych 4.0 Międzynarodowa
dc.rights.licenceCC-BY-NC-ND
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.pl
dc.share.typeotwarte czasopismo
dc.subject.enChatGPT
dc.subject.endeep learning
dc.subject.enlarge language model
dc.subject.enartificial intelligence
dc.subtypeArticle
dc.titleWill ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? : insights into strengths and limitations
dc.title.journalPolish Journal of Radiology
dc.typeJournalArticle
dspace.entity.typePublicationen
dc.abstract.en
Purpose: Rapid development of artificial intelligence has aroused curiosity regarding its potential applications in medical field. The purpose of this article was to present the performance of ChatGPT, a state-of-the-art language model in relation to pass rate of national specialty examination (PES) in radiology and imaging diagnostics within Polish education system. Additionally, the study aimed to identify the strengths and limitations of the model through a detailed analysis of issues raised by exam questions. Material and methods: The present study utilized a PES exam consisting of 120 questions, provided by Medical Examinations Center in Lodz. Questions were administered using openai.com platform that grants free access to GPT-3.5 model. All questions were categorized according to Bloom’s taxonomy to assess their complexity and difficulty. Following the answer to each exam question, ChatGPT was asked to rate its confidence on a scale of 1 to 5 to evaluate the accuracy of its response. Results: ChatGPT did not reach the pass rate threshold of PES exam (52%); however, it was close in certain question categories. No significant differences were observed in the percentage of correct answers across question types and sub-types. Conclusions: The performance of the ChatGPT model in the pass rate of PES exam in radiology and imaging diagnostics in Poland is yet to be determined, which requires further research on improved versions of ChatGPT.
dc.contributor.author
Kufel, Jakub
dc.contributor.author
Paszkiewicz, Iga
dc.contributor.author
Bielówka, Michał
dc.contributor.author
Bartnikowska, Wiktoria
dc.contributor.author
Janik, Michał
dc.contributor.author
Stencel, Magdalena
dc.contributor.author
Stencel, Łukasz
dc.contributor.author
Stencel, Katarzyna
dc.contributor.author
Mielcarska, Sylwia
dc.date.accessioned
2024-07-23T10:10:11Z
dc.date.available
2024-07-23T10:10:11Z
dc.date.issued
2023
dc.date.openaccess
0
dc.description.accesstime
w momencie opublikowania
dc.description.additional
Bibliogr. s. e434
dc.description.physical
e430-e434
dc.description.version
ostateczna wersja wydawcy
dc.description.volume
88
dc.identifier.doi
10.5114/pjr.2023.131215
dc.identifier.issn
1733-134X
dc.identifier.uri
https://ruj.uj.edu.pl/handle/item/389193
dc.language
eng
dc.language.container
eng
dc.rights
Udzielam licencji. Uznanie autorstwa - Użycie niekomercyjne - Bez utworów zależnych 4.0 Międzynarodowa
dc.rights.licence
CC-BY-NC-ND
dc.rights.uri
http://creativecommons.org/licenses/by-nc-nd/4.0/legalcode.pl
dc.share.type
otwarte czasopismo
dc.subject.en
ChatGPT
dc.subject.en
deep learning
dc.subject.en
large language model
dc.subject.en
artificial intelligence
dc.subtype
Article
dc.title
Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? : insights into strengths and limitations
dc.title.journal
Polish Journal of Radiology
dc.type
JournalArticle
dspace.entity.typeen
Publication
Affiliations

* The migration of download and view statistics prior to the date of April 8, 2024 is in progress.

Views
12
Views per month
Views per city
Krakow
7
Gdansk
3
Knurów
1
Downloads
kufel_et-al_will_chatgpt_pass_the_polish_specialty_exam_in_radiology_2023.pdf
16