论文: Training data-efficient image transformers & distillation through attention
代码: https://github.com/facebookresearch/deit