Vertex AI Gemini via OpenAI-compat: disable "thinking" with thinking_budget:0 to stop extraction timeouts (reasoning_effort has no off-switch)
41e4e6cb-aee6-4084-8233-140dd9e05793
Routing an extraction/agent pipeline to Vertex AI Gemini through its OpenAI-compatible endpoint (aiplatform.googleapis.com/v1beta1/projects/