Best Performing Participants 🎉¶

The PUMA challenge finished on the 15th of March 2025. Starting in April, the challenge will reopen as a rolling challenge.

Below are the final rankings for each track:

Final Rankings – Track 1¶

Rank	Team	Summed Macro F1	Macro F1	Micro Dice (Tissue)	Mean Summed Nuclei F1 & Micro Dice	Mean Averaged Nuclei F1 & Micro Dice	Mean Position Leaderboard	Rank (Summed Nuclei F1 & Micro Dice)	Rank (Averaged Nuclei F1 & Micro Dice)
🥇 #1	wildsquirrel (TIAKong)	0.7439	0.6466	0.7823	0.7631	0.7145	2.0	1.0	1.0
🥈 #2	NiTo (LSM)	0.7443	0.6501	0.7237	0.7340	0.6869	2.5	2.0	2.0
🥉 #3	rictoo	0.7578	0.6585	0.6326	0.6952	0.6456	2.5	3.0	3.0
#8	Baseline	0.6940	0.5980	0.5548	0.6244	0.5764	8.0	8.0	8.0

Final Rankings – Track 2¶

Rank	Team	Summed Macro F1	Macro F1	Micro Dice (Tissue)	Mean Summed Nuclei F1 & Micro Dice	Mean Averaged Nuclei F1 & Micro Dice	Mean Position Leaderboard	Rank (Summed Nuclei F1 & Micro Dice)	Rank (Averaged Nuclei F1 & Micro Dice)
🥇 #1	NiTo (LSM)	0.4897	0.2707	0.7798	0.6348	0.5253	1.5	1.0	1.0
🥈 #2	wildsquirrel (TIAKong)	0.4669	0.2656	0.7823	0.6246	0.5240	1.5	2.0	2.0
🥉 #3	agaldran	0.4778	0.2617	0.6204	0.5491	0.4411	4.5	3.0	3.0
#11	Baseline	0.2977	0.2040	0.5548	0.4263	0.3794	10.5	10.0	11.0

As our original evaluation code calculated the nuclei F1 score as the average of the F1 scores per image, while our intention was to use the summed true positives, false positives, and false negatives, we decided to report both metrics. For both the summed F1 score and the average F1 score, the top-ranked teams remain the same across tracks. In cases where teams have the same mean ranking, the mean of the F1 score and Tissue Micro Dice is used to determine the higher-performing team.