Gemini 1.5 Pro 002 overloaded? - Google Cloud Community
Recently switched my application from using Gemini 1.0 Pro to Gemini 1.5 Pro 002 last week, but today we're getting this response:
"Gemini API operation failed: operation=models/gemini-1.5-pro-002:generateContent, status_code=503
The model is overloaded. Please try again later."
Is this an issue at the Google end? When I test the model in the control panel it seems to work without any problems. Any help or insight greatly appreciated.
Same here. I'm currently developing my application. This issue has been occurring for a few weeks for me. For the last few days the problem has gotten worse. Before, about half of the requests seem to work; however, today I get this error every time I use the model.
Seems to be less today, so I'm guessing it's at the Google end rather than my end as nothing has changed from our perspective.
Hi, @Wingee Yes, it seems like an issue on Google's side. Could you please create a support ticket in your Google Cloud account? Regards, Mokit.
I've found gemini-1.5-pro-001 to be more stable (or I guess "consistent"?) in terms of performance and handling requests, so I would give that a try. It's still Gemini 1.5 🙂
Thank you! I’ll give that a go!
Hi @Wingee, Welcome to Google Cloud Community! 503 usually indicates that the service may be temporarily overloaded or down on Google's end. Please wait a bit and retry your request. On the other hand, you may implement an exponential backoff retry mechanism in your application to handle these temporary failures or use a different model as an alternative. Additionally, monitoring your API usage can help identify patterns or recurring issues. If you're encountering this behavior consistently, you may contact Google Cloud Support for further investigation and check the requests under your project. Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
Same here, and this almost make the my service can't work at all.
Getting 503 with "The model is overloaded. Please try again later.", and status "UNAVAILABLE" for gemini-1.5-flash-8b for days now, that's a shame as my service relies on it almost fully.