All Models

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Tool Calling

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

Providers 1
Released Oct 12, 2024
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (1)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway nvidia/llama-3.1-nemotron-70b-instruct $1.20/MTok $1.20/MTok 131.1K 16.4K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output