NVIDIA
Nemotron Nano 9B V2 (free)
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
- Input / 1M tokens
- Free
- Output / 1M tokens
- Free
- Context window
- 128K tokens
- Provider
- NVIDIA
- Knowledge cutoff
- 2025-03-31
Performance
Median streaming throughput and first-token latency measured by Artificial Analysis.
- Output tokens / sec
- —
- Time to first token
- —