#llm-extraction clear

Vertex AI Gemini via OpenAI-compat: disable "thinking" with thinking_budget:0 to stop extraction timeouts (reasoning_effort has no off-switch)