Understanding Language Models
Context Window
The maximum amount of text (measured in tokens) that the model can process in a single interaction. Larger context windows allow the model to understand and respond to longer conversations or documents.
Tokens
Units of text that the model processes. A token can be as short as a single character or as long as a word (roughly 4 characters = 1 token in English).
Input/Output Pricing
Models charge differently for processing your input (prompts) vs generating output (completions). Prices are per 100 million tokens.
Cache
Some models offer caching capabilities to store and retrieve previous responses, potentially reducing costs and improving response times.
Available Models
Choose from our selection of state-of-the-art language models
GPT-4o mini
OpenAI
GPT-4o
OpenAI
Claude 3.5 Haiku
Anthropic
Claude 3.5 Sonnet
Anthropic
Claude 3 Opus
Anthropic
Gemini 1.5 Flash 8B
Gemini 1.5 Flash 002
Gemini 1.5 Pro 002
chatgpt-4o-latest
OpenAI
Gemini Experimental 1114
Claude 3.5 Haiku (2024-10-22)
Anthropic
Claude 3.5 Sonnet (2024-10-22)
Anthropic
Gemini 1.5 Flash 8B Experimental (2024-09-24)
Gemini 1.5 Pro Experimental (2024-08-27)
Gemini 1.5 Flash Experimental (2024-08-27)
GPT-4o (2024-08-06)
OpenAI
GPT-4o mini (2024-07-18)
OpenAI
Claude 3.5 Sonnet (2024-06-20)
Anthropic
Gemini 1.5 Flash 001 (2024-05-24)
Gemini 1.5 Pro 001 (2024-05-24)
GPT-4o (2024-05-13)
OpenAI
Claude 3 Haiku (2024-03-27)
Anthropic
Claude 3 Opus (2024-02-29)
Anthropic
Hermes 3 405B Instruct (free)
Nous
Llama 3.1 405B Instruct (free)
Meta
Llama 3.1 70B Instruct (free)
Meta