-
Notifications
You must be signed in to change notification settings - Fork 665
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gemini Rate Limit Issue #786
Comments
Hey there, @gurugecl! I'm here to help you with any bugs, questions, or contributions you may have. The rate limit issue you're encountering with the Gemini 2.0 Flash model in the
By addressing these points, you should be able to mitigate the rate limit issues you're experiencing. If the problem persists, further investigation into the specific interactions between To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other |
Resolved by modifying the Docs setup even though it seemed to work without an issue initially |
i think your rate limit is wrong: this is not rpm but tpm. so 4 Million. |
Hi @gurugecl , could you please share your working code? I'm having trouble setting this up with Gemini and you seem to have made it work. I tried your code in here but it's missing a lot of stuff. |
I am running into a rate limit issue when using Gemini 2.0 Flash. According to this thread in LiteLLM I believe it is supported 7188. The rate limit errors comes on the very first query to the docs object and no amount of additional waiting or exponential backoff seems to resolve it so I feel that it might be due to something happening behind the scenes with either PaperQA or LiteLLM however they do not believe this to be an issue on their end 7358.
Here is how I am using paperqa. Can you please confirm if this is utilized correctly to avoid rate limits or if not how I should set it up correctly. Thank you!
Here are some of the logs/errors:
The text was updated successfully, but these errors were encountered: