Observational Study
Copyright ©The Author(s) 2025.
World J Transplant. Sep 18, 2025; 15(3): 103536
Published online Sep 18, 2025. doi: 10.5500/wjt.v15.i3.103536
Table 2 Overall GPT-4 performance in assigning context labels in virtual cases across 294 virtual cases, highlighting agreement with predefined labels, n (%)
Actual label
Assigned label, GI
Assigned label, diagnosis
Assigned label, DD
Assigned label, treatment
Assigned label, prognosis
Assigned label, total
GI42 (14.29)14 (4.76)1 (0.34)20 (6.8)1 (0.34)78 (26.53)
Diagnosis2 (0.68)44 (14.97)1 (0.34)1 (0.34)0 (0)48 (16.33)
DD0 (0)15 (5.1)5 (1.7)2 (0.68)0 (0)22 (7.48)
Treatment5 (1.7)7 (2.38)1 (0.34)73 (24.83)0 (0)86 (29.25)
Prognosis10 (3.4)11 (3.74)5 (1.7)7 (2.38)27 (9.18)60 (20.41)
Total59 (207)91 (30.95)13 (4.42)103 (353)28 (9.52)294 (100)