Copyright
©The Author(s) 2025.
World J Transplant. Sep 18, 2025; 15(3): 103536
Published online Sep 18, 2025. doi: 10.5500/wjt.v15.i3.103536
Published online Sep 18, 2025. doi: 10.5500/wjt.v15.i3.103536
Table 2 Overall GPT-4 performance in assigning context labels in virtual cases across 294 virtual cases, highlighting agreement with predefined labels, n (%)
Actual label | Assigned label, GI | Assigned label, diagnosis | Assigned label, DD | Assigned label, treatment | Assigned label, prognosis | Assigned label, total |
GI | 42 (14.29) | 14 (4.76) | 1 (0.34) | 20 (6.8) | 1 (0.34) | 78 (26.53) |
Diagnosis | 2 (0.68) | 44 (14.97) | 1 (0.34) | 1 (0.34) | 0 (0) | 48 (16.33) |
DD | 0 (0) | 15 (5.1) | 5 (1.7) | 2 (0.68) | 0 (0) | 22 (7.48) |
Treatment | 5 (1.7) | 7 (2.38) | 1 (0.34) | 73 (24.83) | 0 (0) | 86 (29.25) |
Prognosis | 10 (3.4) | 11 (3.74) | 5 (1.7) | 7 (2.38) | 27 (9.18) | 60 (20.41) |
Total | 59 (207) | 91 (30.95) | 13 (4.42) | 103 (353) | 28 (9.52) | 294 (100) |
- Citation: Christou CD, Sitsiani O, Boutos P, Katsanos G, Papadakis G, Tefas A, Papalois V, Tsoulfas G. Comparison of ChatGPT-3.5 and GPT-4 as potential tools in artificial intelligence-assisted clinical practice in renal and liver transplantation. World J Transplant 2025; 15(3): 103536
- URL: https://www.wjgnet.com/2220-3230/full/v15/i3/103536.htm
- DOI: https://dx.doi.org/10.5500/wjt.v15.i3.103536