Openrouter
Elephant
Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens, function calling, structured output, and prompt caching. It is particularly well-suited for code completion and debugging, rapid document processing, and lightweight agent interactions. Note: Prompts and completions may be logged by the provider and used to improve the model.
- Input / 1M tokens
- Free
- Output / 1M tokens
- Free
- Context window
- 262K tokens
- Provider
- Openrouter
Performance
Median streaming throughput and first-token latency measured by Artificial Analysis.
- Output tokens / sec
- —
- Time to first token
- —