automix

AutoMix

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Abstract

Data mixing augmentation have proved to be effective in improving the generalization ability of deep neural networks. While early methods mix samples by hand-crafted policies (e.g., linear interpolation), recent methods utilize saliency information to match the mixed samples and labels via complex offline optimization. However, there arises a trade-off between precise mixing policies and optimization complexity. To address this challenge, we propose a novel automatic mixup (AutoMix) framework, where the mixup policy is parameterized and serves the ultimate classification goal directly. Specifically, AutoMix reformulates the mixup classification into two sub-tasks (i.e., mixed sample generation and mixup classification) with corresponding sub-networks and solves them in a bi-level optimization framework. For the generation, a learnable lightweight mixup generator, Mix Block, is designed to generate mixed samples by modeling patch-wise relationships under the direct supervision of the corresponding mixed labels. To prevent the degradation and instability of bi-level optimization, we further introduce a momentum pipeline to train AutoMix in an end-to-end manner. Extensive experiments on nine image benchmarks prove the superiority of AutoMix compared with state-of-the-arts in various classification scenarios and downstream tasks.

Results and models

ImageNet-1k

Model	Mixup	resolution	Params(M)	Epochs	Top-1 (%)	Config	Download
ResNet-18	AutoMix	224x224	11.17	100	70.50	config	model / log
ResNet-18	AutoMix	224x224	11.17	300	72.05	config	model / log
ResNet-34	AutoMix	224x224	21.28	100	74.52	config	model / log
ResNet-34	AutoMix	224x224	21.28	300	76.10	config	model / log
ResNet-50	AutoMix	224x224	23.52	100	77.91	config	model / log
ResNet-50	AutoMix	224x224	23.52	300	79.25	config	model / log
ResNet-101	AutoMix	224x224	42.51	100	79.87	config	model / log
ResNet-101	AutoMix	224x224	42.51	300	80.98	config	model / log
ResNeXt-101	AutoMix	224x224	44.18	100	80.89	config	model / log
DeiT-S	AutoMix	224x224	22.05	300	80.78	config	model / log
PVT-T	AutoMix	224x224	13.2	300	76.37	config	model / log
Swin-T	AutoMix	224x224	28.29	300	81.80	config	model / log
ConvNeXt-T	AutoMix	224x224	28.59	300	82.28	config	model / log

We will update configs and models (ResNets, ViTs, Swin-T, and ConvNeXt-T) for AutoMix soon (please contact us if you want the models right now). Please refer to Model Zoo for image classification results.

Citation

@InProceedings{liu2022automix,
      title={AutoMix: Unveiling the Power of Mixup for Stronger Classifiers},
      author={Zicheng Liu and Siyuan Li and Di Wu and Zhiyuan Chen and Lirong Wu and Jianzhu Guo and Stan Z. Li},
      booktitle={European Conference on Computer Vision (ECCV)},
      pages={441--458},
      year={2022},
}

Name		Name	Last commit message	Last commit date
parent directory ..
basic		basic
convnext		convnext
deit		deit
pvt		pvt
swin		swin
README.md		README.md
auto_train_in_automix.py		auto_train_in_automix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automix

automix

README.md

AutoMix

Abstract

Results and models

ImageNet-1k

Citation

Files

automix

Directory actions

More options

Directory actions

More options

Latest commit

History

automix

Folders and files

parent directory

README.md

AutoMix

Abstract

Results and models

ImageNet-1k

Citation