I am developing a prompt with gemini-1.5-pro-001 in Vertex AI Workbench, but the 429 error below keeps occurring.
When I check the IAM quota and limit, it appears that there are sufficient resources, but I don't know what the cause is. Is there any solution?
Since it is a different type from the 429 error that other people have experienced, searching does not provide an answer.
File /opt/conda/lib/python3.10/site-packages/vertexai/generative_models/_generative_models.py:719, in _GenerativeModel._generate_content_streaming(self, contents, generation_config, safety_settings, tools, tool_config)
694 """Generates content.
695
696 Args:
(...)
710 A stream of GenerationResponse objects
711 """
712 request = self._prepare_request(
713 contents=contents,
714 generation_config=generation_config,
(...)
717 tool_config=tool_config,
718 )
--> 719 response_stream = self._prediction_client.stream_generate_content(
720 request=request
721 )
722 for chunk in response_stream:
723 yield self._parse_response(chunk)
File /opt/conda/lib/python3.10/site-packages/google/cloud/aiplatform_v1beta1/services/prediction_service/client.py:2400, in PredictionServiceClient.stream_generate_content(self, request, model, contents, retry, timeout, metadata)
2397 self._validate_universe_domain()
2399 # Send the request.
-> 2400 response = rpc(
2401 request,
2402 retry=retry,
2403 timeout=timeout,
2404 metadata=metadata,
2405 )
2407 # Done; return the response.
2408 return response
File /opt/conda/lib/python3.10/site-packages/google/api_core/gapic_v1/method.py:131, in _GapicCallable.__call__(self, timeout, retry, compression, *args, **kwargs)
128 if self._compression is not None:
129 kwargs["compression"] = compression
--> 131 return wrapped_func(*args, **kwargs)
File /opt/conda/lib/python3.10/site-packages/google/api_core/grpc_helpers.py:174, in _wrap_stream_errors.<locals>.error_remapped_callable(*args, **kwargs)
170 return _StreamingResponseIterator(
171 result, prefetch_first_result=prefetch_first
172 )
173 except grpc.RpcError as exc:
--> 174 raise exceptions.from_grpc_error(exc) from exc
ResourceExhausted: 429 Unable to submit request because the service is temporarily out of capacity. Try again later.
![]() |
Google Ads API Team |