Minireviews
Copyright ©The Author(s) 2021.
World J Gastroenterol. Oct 14, 2021; 27(38): 6399-6414
Published online Oct 14, 2021. doi: 10.3748/wjg.v27.i38.6399
Table 2 Most common evaluation metrics found in the state of the art for detection, segmentation and classification tasks
Term
Symbol
Description
PositivePNumber of real positive cases in the data
NegativeNNumber of real negative cases in the data
True positiveTPNumber of correct positive cases classified/detected
True negativeTNNumber of correct negative cases classified/detected
False positiveFPInstances incorrectly classified/detected as positive
False negativeFNInstances incorrectly classified/detected as negative
Area under curveAUCArea under the ROC plot
TermTaskFormulation
AccuracyC, D, S(TP + TN)/(TP + TN + FN + FP)
Precision/PPVC, D, STP/(TP + FP)
Sensitivity/Recall/TPRC, D, STP/(TP + FN)
Specificity/TNRC, D, STN/(TN + FP)
FPRC, D, SFP/(TN + FP)
FNRC, D, SFN/(TP + FN)
f1-score/DICE indexC, D, S2 ∙ (precision ∙ recall)/(precision + recall)
f2-scoreC, D, S4 ∙ (precision∙recall)/(4∙precision + recall)
IoU/Jaccard indexD, S(target ∩ prediction)/(target ∪ prediction)
AACD, S(detected area ∩ real area)/(real area)