Abstract: Model compression using self-knowledge distillation methods has achieved remarkable performance in tasks such as image classification and object detection. However, most current ...