TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Learning with noisy labels	ANIMAL	SURE	Accuracy	89.0	# 1
Learning with noisy labels	ANIMAL	SURE	Network	Vgg19-BN	# 1
Learning with noisy labels	ANIMAL	SURE	ImageNet Pretrained	NO	# 1
Long-tail Learning	CIFAR-100-LT (ρ=10)	SURE(ResNet-32)	Error Rate	26.76	# 7
Long-tail Learning	CIFAR-100-LT (ρ=100)	SURE(ResNet-32)	Error Rate	43.66	# 9
Long-tail Learning	CIFAR-100-LT (ρ=50)	SURE(ResNet-32)	Error Rate	36.87	# 7
Long-tail Learning	CIFAR-10-LT (ρ=10)	SURE(ResNet-32)	Error Rate	5.04	# 2
Long-tail Learning	CIFAR-10-LT (ρ=100)	SURE(ResNet-32)	Error Rate	13.07	# 6
Long-tail Learning	CIFAR-10-LT (ρ=50)	SURE(ResNet-32)	Error Rate	9.78	# 2
Image Classification	Food-101N	SURE(ResNet-50)	Accuracy	88.0	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/learning-with-noisy-labels-on-animal)](https://paperswithcode.com/sota/learning-with-noisy-labels-on-animal?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/long-tail-learning-on-cifar-10-lt-r-10)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-10?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/long-tail-learning-on-cifar-10-lt-r-50)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-50?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/image-classification-on-food-101n-1)](https://paperswithcode.com/sota/image-classification-on-food-101n-1?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/long-tail-learning-on-cifar-10-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-100?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/long-tail-learning-on-cifar-100-lt-r-10)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-10?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/long-tail-learning-on-cifar-100-lt-r-50)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-50?p=sure-survey-recipes-for-building-reliable-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/sure-survey-recipes-for-building-reliable-and/long-tail-learning-on-cifar-100-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-100?p=sure-survey-recipes-for-building-reliable-and)`

SURE: SUrvey REcipes for building reliable and robust deep networks

1 Mar 2024 · Yuting Li, Yingyi Chen, Xuanlong Yu, Dexiong Chen, Xi Shen ·

In this paper, we revisit techniques for uncertainty estimation within deep neural networks and consolidate a suite of techniques to enhance their reliability. Our investigation reveals that an integrated application of diverse techniques--spanning model regularization, classifier and optimization--substantially improves the accuracy of uncertainty predictions in image classification tasks. The synergistic effect of these techniques culminates in our novel SURE approach. We rigorously evaluate SURE against the benchmark of failure prediction, a critical testbed for uncertainty estimation efficacy. Our results showcase a consistently better performance than models that individually deploy each technique, across various datasets and model architectures. When applied to real-world challenges, such as data corruption, label noise, and long-tailed class distribution, SURE exhibits remarkable robustness, delivering results that are superior or on par with current state-of-the-art specialized methods. Particularly on Animal-10N and Food-101N for learning with noisy labels, SURE achieves state-of-the-art performance without any task-specific adjustments. This work not only sets a new benchmark for robust uncertainty estimation but also paves the way for its application in diverse, real-world scenarios where reliability is paramount. Our code is available at \url{https://yutingli0606.github.io/SURE/}.

PDF Abstract

Code

Add Remove Mark official

YutingLi0606/SURE official

Tasks

Add Remove

Image Classification

Learning with noisy labels

Long-tail Learning

Datasets

CIFAR-10

ImageNet

CIFAR-100

Tiny ImageNet CIFAR-10C CIFAR100-LT

ANIMAL Food-101N

Results from the Paper

Add Remove

Ranked #1 on Learning with noisy labels on ANIMAL

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Learning with noisy labels	ANIMAL	SURE	Accuracy	89.0	# 1	Compare
			Network	Vgg19-BN	# 1	Compare
			ImageNet Pretrained	NO	# 1	Compare
Long-tail Learning	CIFAR-100-LT (ρ=10)	SURE(ResNet-32)	Error Rate	26.76	# 7	Compare
Long-tail Learning	CIFAR-100-LT (ρ=100)	SURE(ResNet-32)	Error Rate	43.66	# 9	Compare
Long-tail Learning	CIFAR-100-LT (ρ=50)	SURE(ResNet-32)	Error Rate	36.87	# 7	Compare
Long-tail Learning	CIFAR-10-LT (ρ=10)	SURE(ResNet-32)	Error Rate	5.04	# 2	Compare
Long-tail Learning	CIFAR-10-LT (ρ=100)	SURE(ResNet-32)	Error Rate	13.07	# 6	Compare
Long-tail Learning	CIFAR-10-LT (ρ=50)	SURE(ResNet-32)	Error Rate	9.78	# 2	Compare
Image Classification	Food-101N	SURE(ResNet-50)	Accuracy	88.0	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

SURE: SUrvey REcipes for building reliable and robust deep networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove