- Very good at instruction following and planning
- Is visually grounded, meaning it understands precise coordinates in an image to interact with the browser accurately.
ANTHROPIC_API_KEY
in your environment.
Most LLMs are NOT grounded, for example models from OpenAI, Gemini, or Llama.
Other compatible models
If you are looking for a cheaper / open source alternative with comparable performance we recommend Qwen 2.5 VL 72B. Here’s an example of how you could configure Magnitude to use Qwen via OpenRouter:More compatible models
More compatible models
Other visually grounded models in the 32B-72B parameter range may be appropriate for Magnitude, depending on the LLM and your test case complexity. Some of these include:
These models are mostly untested with Magnitude, they may not be suitable for running tests. If any of these LLMs are struggling to follow instructions or have issues with accuracy, please try a recommended model instead.