The information is subject to change during the first days after launch, as we continue updating and refining the details.
Tasks Overview¶
The UNICORN challenge includes 20 tasks designed to evaluate model performance across both radiology and pathology. Each task has a defined time limit, which sets the maximum allowed runtime for the algorithm Docker container. For vision and vision-language tasks, each case is processed individually, so the time limit applies per case. For language tasks, the algorithm processes all reports in a single run, meaning the time limit applies to the entire dataset.¶
Note: Additional tasks may be introduced in the future, also as additional (hidden) test data.
ID | Task Name | Task Type | Modality | Domain | Metric | Time limit |
---|---|---|---|---|---|---|
T1 | Classifying HE prostate biopsies into ISUP scores | Classification | Vision | Pathology | quadratic weighted kappa | 600s |
T2 | Classifying lung nodule malignancy in CT | Classification | Vision | Radiology | AUC | 300s |
T3 | Predicting the time to biochemical recurrence in HE prostatectomies | Regression | Vision | Pathology | censored c-index | 1500s |
T4 | Predicting slide-level tumor proportion score in NSCLC IHC-stained WSI | Classification | Vision | Pathology | quadratic weighted kappa | 600s |
T5 | Detecting signet ring cells in HE-stained WSI of gastric cancer | Detection | Vision | Pathology | F1 score | 300s |
T6 | Detecting clinically significant cancer in prostate MRI exams | Detection | Vision | Radiology | average of AUROC and AP | 600s |
T7 | Detecting lung nodules in thoracic CT | Detection | Vision | Radiology | sensitivity | 300s |
T8 | Detecting mitotic figures in breast cancer HE-stained WSIs | Detection | Vision | Pathology | F1 score | 300s |
T9 | Segmenting ROIs in breast cancer HE-stained WSIs | Segmentation | Vision | Pathology | DICE | 300s |
T10 | Segmenting lesions within ROIs in CT | Segmentation | Vision | Radiology | DICE, long- and short-axis errors | 600s |
T11 | Segmenting three anatomical structures in lumbar spine MRI | Segmentation | Vision | Radiology | DICE | 600s |
T12 | Predicting histopathology sample origin | Classification | Language | Pathology | unweighted kappa | 7200s |
T13 | Classifying pulmonary nodule presence | Classification | Language | Radiology | AUC | 7200s |
T14 | Classifying kidney abnormality | Classification | Language | Radiology | AUC | 7200s |
T15 | Predicting Hip Kellgren-Lawrence score | Classification | Language | Radiology | unweighted kappa | 7200s |
T16 | Classifying colon histopathology diagnosis | Classification | Language | Pathology | macro AUC | 7200s |
T17 | Predicting lesion size measurements | Regression | Language | Radiology | RSMAPE | 7200s |
T18 | Predicting prostate volume, PSA, and PSA density | Regression | Language | Radiology | RSMAPE | 7200s |
T19 | Anonymizing report | Named Entity Recognition | Language | Radiology + Pathology | weighted F1 | 7200s |
T20 | Generating caption from WSI | Generation | Vision-Language | Pathology | BLEU-4, ROUGE-L, METEOR, CIDER | 1500 s |