Customizing
LLM Configuration
Visually grounded LLMs compatible with Magnitude
Magnitude requires an LLM than is both:
- Very good at instruction following and planning
- Is visually grounded, meaning it understands precise coordinates in an image to interact with the browser accurately.
Very few LLMs meet this criteria, which is why we recommend Claude Sonnet 4, which has strong reasoning abilities and is grounded.
To use Sonnet, simply set ANTHROPIC_API_KEY
in your environment.
Most LLMs are NOT grounded, for example models from OpenAI, Gemini, or Llama.
Other compatible models
If you are looking for a cheaper / open source alternative with comparable performance we recommend Qwen 2.5 VL 72B.
Here’s an example of how you could configure Magnitude to use Qwen via OpenRouter:
magnitude.config.ts
For instructions on configuring LLMs with various providers, see LLM Providers.