T20: Generating caption from WSI¶


Objective:
The objective of this task is to generate a caption that encapsulates the relevant clinical information such as tissue type and diagnosis from whole-slide images (WSI). The WSI corresponds to a tissue sample obtained from the colon, rectum or cervix.

Patient Population:
The WSI-report pairs are obtained from the ExaMode project, which provides a large dataset ~10k comprising colon and cervix biopsies and polyps and their corresponding Dutch pathology reports.

Image-Report Data:

  • A subsample of the initial dataset was selected to curate the challenge dataset for this task.
  • Given that most of the biopsies are benign (more than half), a balanced selection of cases across each diagnostic category was made to ensure diversity within the dataset.
  • The reports have been preprocessed to retain only the conclusion section.

Test Data:
The test data is the WSI without the pathology report. Participants must generate a caption as a string in Dutch that encapsulates the relevant clinical information such as tissue type and diagnosis from WSI.

Reference Standard:
The reference standard consists of the conclusion section from the original Dutch pathology report associated with each WSI, as it captures the pathologist’s final diagnostic interpretation.

Evaluation Metrics:
The generated caption is evaluated by comparing its similarity to the original pathology report using the following metrics:

  • BLEU-4
  • ROUGE-L
  • CIDEr
  • METEOR
  • BERTScore F1

The final score is the average of all five metrics.