-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducing results of "Understanding the Role of the Projector in Knowledge Distillation" #446
Conversation
@@ -0,0 +1,200 @@ | |||
datasets: | |||
&imagenet_train ilsvrc2012/train: !import_call | |||
_name: &dataset_name 'imagenet2012' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just for consistency with existing files, could you replace 'imagenet2012' in this line with 'ilsvrc2012'?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @roymiles
Beside the above request, I have a few more minor requests:
- Add
srd-resnet18_from_resnet34.log
toconfigs/official/ilsvrc2012/roymiles/aaai2024/
- Add your paper info to this file as the third item following the format
- Add your result to https://github.com/yoshitomo-matsubara/torchdistill/blob/main/docs/source/_static/benchmarks/imagenet-resnet18_kd.tsv
- Most important, add
README.md
like this file toconfigs/official/ilsvrc2012/roymiles/aaai2024/
. You can use the links to checkpoint and log file (those links become effective when I release the next version) in your README.md file.
Thanks!
(Just for a record, roymiles/Simple-Recipe-Distillation#1)
Thanks @roymiles for the updates! Did you use 3 GPUs for distributed training? If nos, I will remove the section from README later, when I make minor changes. |
Ah oops, I must have overlooked that. Yea I only used 1 GPU. |
No problem, I will merge this PR and make some modifications. It's a great job! Thanks for your contribution! |
af96172
into
yoshitomo-matsubara:main
I have reproduced the results in the original paper. The original paper reports an accuracy of 71.63%, while this config leads to 71.65%.
The log and checkpoint for this run can be found here: https://drive.google.com/drive/folders/18xl0CDZ6CioP4Sbjdpj1Pndp4biSLpnV?usp=sharing
Trained on a single GPU.