Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to fine-tune NLLB-200 model? #969

Open
epiniguin opened this issue Jan 15, 2025 · 3 comments
Open

How to fine-tune NLLB-200 model? #969

epiniguin opened this issue Jan 15, 2025 · 3 comments
Labels
question Further information is requested

Comments

@epiniguin
Copy link

epiniguin commented Jan 15, 2025

Hi!

How to train NLLB model using an existing NLLB-200 model(for example 3.3B) as a checkpoint?

@epiniguin epiniguin added the question Further information is requested label Jan 15, 2025
@cbalioglu
Copy link
Contributor

Hi @epiniguin, we are about to land a refactoring to our recipes that will reduce the amount of code one needs to write. Do you use a publicly available dataset? We plan to write READMEs for each recipe and we might use your use case as an example.

@epiniguin
Copy link
Author

@cbalioglu I use publicly available datasets.

@epiniguin
Copy link
Author

Hi @cbalioglu !

Are there any news about recipe refactoring and Readme?
I'm also interested in MT evaluation recipe.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants