Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions Regarding Model Parameters and Training Configuration #22

Open
KirinZheng opened this issue Nov 18, 2024 · 1 comment
Open

Comments

@KirinZheng
Copy link

Thank you for sharing your excellent work and the open-source code! I have a couple of questions regarding the model parameters and training configuration:

  1. I loaded the spikeyolo-L model version from your code and observed that the model parameters are approximately 65M. However, in your paper, the parameters for this model are reported as 68.8M, a difference of about 3M. Could this discrepancy be due to an issue in the snn_yolov8.yaml configuration file, or is there another reason for this difference?

  2. To reproduce the results on the COCO-17 dataset, I followed the settings described in the paper. In the default.yaml configuration file, I set lr0 to 0.01 and lrf to 0.004. Are these the correct values? Additionally, should I keep other training parameters unchanged?

I appreciate your clarification and look forward to your response!

@XinhaoLuo666
Copy link
Collaborator

Thank you for your question

Sorry for not clarifying this in the code documentation. When the model size is l/x, the expansion factor of the last MS_convblock channel of the detection head becomes "2" (which is “1” in the s/m size models), which results in a parameter quantity difference of about 3M. We will clarify this issue in a future update and upload the 69M model .
Yes, you don't need to make modifications to the hyperparameters

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants