GitHub - wish44165/ntd: CVPR 2024 (Seattle, USA)

NTD - Official Pytorch Implementation

The code has been successfully tested on Ubuntu 22.04.

Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels

1. Environmental Setup

Hardware Information

CPU: Intel® Core™ i7-12650H
GPU: NVIDIA GeForce RTX 4050 Laptop GPU (6G)

Create Conda Environment

$ conda create -n ntd python=3.10 -y
$ conda activate ntd
$ git clone https://github.com/wish44165/ntd.git
$ cd ntd/
$ pip install -r requirements.txt

Datasets Preparation

Datasets	CIFAR10	CIFAR100	WebVision	Food-101N
# train	50000	50000	65944	52867
# test	10000	10000	2500	4741
# class	10	100	50	101
# tasks	5	5	10	5
Memory size	500	2000	1000	2000
Models	ResNet18	ResNet32	ResNet34	ResNet34
Batch size	16	16	16	16
Epochs	256	256	128	128

2. Reproducing Details

Execute Commands

$ python run_experiment.py --dataset_path <dataset path> \
                           --mem_manage <memory construction type> \
                           --robust_type <memory usage type> \
                           --exp_name <noisy level and type>

Reproduce Commands

# CIFAR10
$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage NTD --robust_type none --exp_name blurry10_symN20
$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_symN20

$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage NTD --robust_type none --exp_name blurry10_symN40
$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_symN40

$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage NTD --robust_type none --exp_name blurry10_symN60
$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_symN60

$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage NTD --robust_type none --exp_name blurry10_asymN20
$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_asymN20

$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage NTD --robust_type none --exp_name blurry10_asymN40
$ python run_experiment.py --dataset_path ../../../../datasets/cifar10_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_asymN40

# CIFAR100
$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage NTD --robust_type none --exp_name blurry10_symN20
$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_symN20

$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage NTD --robust_type none --exp_name blurry10_symN40
$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_symN40

$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage NTD --robust_type none --exp_name blurry10_symN60
$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_symN60

$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage NTD --robust_type none --exp_name blurry10_asymN20
$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_asymN20

$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage NTD --robust_type none --exp_name blurry10_asymN40
$ python run_experiment.py --dataset cifar100 --dataset_path ../../../../datasets/cifar100_png --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10_asymN40

# WebVision
$ python run_experiment.py --dataset WebVision-V1-2 --dataset_path ../../../../datasets/WebVision-V1-2 --mem_manage NTD --robust_type none --exp_name blurry10
$ python run_experiment.py --dataset WebVision-V1-2 --dataset_path ../../../../datasets/WebVision-V1-2 --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10

# Food-101N
$ python run_experiment.py --dataset Food-101N --dataset_path ../../../../datasets/Food-101N/images --mem_manage NTD --robust_type none --exp_name blurry10
$ python run_experiment.py --dataset Food-101N --dataset_path ../../../../datasets/Food-101N/images --mem_manage PuriDivER --robust_type PuriDivER --exp_name blurry10

3. Experimental Results

Comparison Tables

logs

Last test accuracy evaluated on CIFAR10 and CIFAR100 datasets with noisy types Sym.-{20%, 40%, 60%} and Asym.-{20%, 40%}.

Methods	CIFAR10			CIFAR100
	Sym.			Sym.
	20	40	60	20	40
PuriDivER [1]	60.6 ± 1.8	57.8 ± 2.2	52.0 ± 2.8	61.2 ± 2.9	49.4 ± 5.7	36.3 ± 0.3
NTD (ours)	59.8 ± 0.6	59.7 ± 1.5	50.9 ± 0.3	60.1 ± 0.3	53.7 ± 3.9	38.3 ± 1.0

Last memory clean ratio on CIFAR10 and CIFAR100 datasets with noisy types Sym.-{20%, 40%, 60%} and Asym.-{20%, 40%}.

Methods	CIFAR10			CIFAR100
	Sym.			Sym.
	20	40	60	20	40
PuriDivER [1]	98.6 ± 0.7	96.1 ± 0.6	86.6 ± 4.0	98.7 ± 0.4	79.7 ± 7.7	99.2 ± 0.1
NTD (ours)	99.2 ± 0.5	97.1 ± 0.9	86.8 ± 0.7	98.7 ± 1.0	87.9 ± 4.6	99.0 ± 0.3

Last test accuracy evaluated on WebVision and Food-101N.

Methods	WebVision	Food-101N
PuriDivER [1]	25.1 ± 0.8	13.8 ± 0.6
NTD (ours)	26.1 ± 1.6	17.0 ± 0.9

Last memory clean ratio on WebVision and Food-101N.

Methods	WebVision	Food-101N
PuriDivER [1]	100 ± 0	100 ± 0
NTD (ours)	100 ± 0	100 ± 0

The average training time on the CIFAR10 dataset with noisy type Sym.-40% across three distinct random seeds for the online learning stage, the episodic memory usage stage, and the overall process (measured in hours).

Methods	Online learning	Episodic memory usage	Overall
PuriDivER [1]	0.28	3.09	3.37
NTD (ours)	0.19	1.25	1.44

GPU memory usage for the CIFAR10 dataset with noisy type Sym.-40% during the online learning and episodic memory usage stages (measured in MiB).

Methods	Online learning	Episodic memory usage
PuriDivER [1]	828	4528
NTD (ours)	828	834

References:

[1] Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries (GitHub)

Acknowledgements

We extend our gratitude to the authors in PuriDivER for furnishing the organized code base, facilitating the reproducibility of results, and enabling performance comparison with multiple approaches.

Citation

@misc{chen2024data,
      title={Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels}, 
      author={Yu-Hsi Chen},
      year={2024},
      eprint={2404.04871},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
assets		assets
configuration		configuration
logs		logs
methods		methods
models		models
tasks		tasks
utils		utils
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run_experiment.py		run_experiment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NTD - Official Pytorch Implementation

1. Environmental Setup

2. Reproducing Details

3. Experimental Results

Acknowledgements

Citation

About

Releases

Packages

Languages

wish44165/ntd

Folders and files

Latest commit

History

Repository files navigation

NTD - Official Pytorch Implementation

1. Environmental Setup

2. Reproducing Details

3. Experimental Results

Acknowledgements

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages