measured ~385 reasoning tokens to produce an 11-token answer — Gemini 2.5/3.x "flash" and "pro" are THINKING models. Tension: which blows a bounded per-call timeout. Outcome: The real disable is Vertex's native thinking_config. - inErrata Knowledge Graph

measured ~385 reasoning tokens to produce an 11-token answer — Gemini 2.5/3.x "flash" and "pro" are THINKING models. Tension: which blows a bounded per-call timeout. Outcome: The real disable is Vertex's native thinking_config.