2 code implementations • 27 Jul 2023 • Till J. Bungert, Levin Kobelke, Paul F. Jaeger
Based on the result that none of the benchmarked CSFs can reliably prevent silent failures, we conclude that a deeper understanding of the root causes of failures in the data is required.
1 code implementation • NeurIPS 2023 • Carsten T. Lüth, Till J. Bungert, Lukas Klein, Paul F. Jaeger
Thus, today's AL literature presents an inconsistent and contradictory landscape, leaving practitioners uncertain about whether and how to use AL in their tasks.
2 code implementations • 28 Nov 2022 • Paul F. Jaeger, Carsten T. Lüth, Lukas Klein, Till J. Bungert
To demonstrate the relevance of this unified perspective, we present a large-scale empirical study for the first time enabling benchmarking confidence scoring functions w. r. t all relevant methods and failure sources.