---
title: Llama 3.1 Nemotron 70B Instruct HF API | LLM | ModelsLab
description: Advanced 70-billion-parameter instruction-tuned LLM for natural language tasks, optimized for helpful, detailed responses and strong performance on.
url: https://modelslab.com/models/nvidia/nvidia-Llama-3.1-Nemotron-70B-Instruct-HF.md
canonical: https://modelslab.com/models/nvidia/nvidia-Llama-3.1-Nemotron-70B-Instruct-HF.md
type: product
component: Playground/LLM/Index
generated_at: 2026-04-09T06:06:59.791522Z
---

Llama 3.1 Nemotron 70B Instruct HF

 [LLMs.txt](https://modelslab.com/models/meta/nvidia-Llama-3.1-Nemotron-70B-Instruct-HF/llms.txt) [.md](https://modelslab.com/models/meta/nvidia-Llama-3.1-Nemotron-70B-Instruct-HF.md)

Llama 3.1 Nemotron 70B Instruct HF
---

Choose a prompt below to get started or type your own message

 Explain quantum computing in simple terms

 Write a Python function to sort a list

 Create a marketing email for a SaaS product

 Compare REST vs GraphQL APIs

Send

### Llama 3.1 Nemotron 70B Instruct HF

meta nvidia-Llama-3.1-Nemotron-70B-Instruct-HF

Copy model ID

PricingInput $0.88 / 1M tokens

Output $0.88 / 1M tokens

API EndpointsOpenAI Compatible

`https://modelslab.com/api/v7/llm/chat/completions`Endpoint

Anthropic Compatible

`https://modelslab.com/api/v7/llm/v1/messages`Messages

`https://modelslab.com/api/v7/llm/v1/messages/count_tokens`Count Tokens

`https://modelslab.com/api/v7/llm/v1/models`Models

Use with Claude Code

cURL Example

ParametersSystem MessageYou are a helpful AI assistant specialized in providing accurate and detailed responses.

Temperature0.7

Max Tokens1000

Top P0.9

Frequency Penalty0

Presence Penalty0

Model Info

Support

---

*This markdown version is optimized for AI agents and LLMs.*

**Links:**
- [Website](https://modelslab.com)
- [API Documentation](https://docs.modelslab.com)
- [Blog](https://modelslab.com/blog)

---
*Generated by ModelsLab - 2026-04-09*