Observational Study
Copyright ©The Author(s) 2025.
World J Transplant. Sep 18, 2025; 15(3): 103536
Published online Sep 18, 2025. doi: 10.5500/wjt.v15.i3.103536
Table 1 Overall ChatGPT performance in assigning context labels across 294 virtual cases, highlighting agreement with predefined labels, n (%)
Actual label
Assigned label, GI
Assigned label, diagnosis
Assigned label, DD
Assigned label, treatment
Assigned label, prognosis
Assigned label, total
GI33 (11.22)20 (6.8)6 (24)16 (5.44)3 (12)78 (26.53)
Diagnosis8 (2.72)33 (11.22)5 (1.7)0 (0)2 (0.68)48 (16.33)
DD0 (0)11 (3.74)11 (3.74)0 (0)0 (0)22 (7.48)
Treatment9 (36)10 (3.4)2 (0.68)64 (21.77)1 (0.34)86 (29.25)
Prognosis12 (48)12 (48)3 (12)3 (12)30 (10.2)60 (20.41)
Total62 (219)86 (29.25)27 (9.18)83 (28.23)36 (12.24)294 (100)