Deploy Dedicated GPU server to run AI models
Explore our collection of pre-built AI workflows for various use cases
Create custom AI workflows tailored to your specific needs
API details in LLM-friendly format
Command-line interface for API
Skills for coding agents to use APIs
Agents Control Plane for API
SDKs for ModelsLab API
MCP Server for API
Explore AI models and APIs from Together AI
Showing 1-56 of 151 models
Qwen 2.5 Coder 32B Instruct
Qwen2.5 7B Instruct
Qwen2.5 7B
Qwen2.5 72B
Qwen2.5 3B Instruct
Qwen2.5 14B
Qwen2.5 1.5B Instruct
Qwen2.5 1.5B
Qwen 2 (7B)
Qwen 2 Instruct (1.5B)
Qwen QwQ-32B
Minimax M1 80K
Minimax M1 40K
Deepcoder 14B Preview
Qwen 2 (72B)
Nvidia Nemotron 3 Super 120B A12b Fp8
Llama 3.2 1B
Deepseek Coder 33B Instruct
Deepseek V3.2 Exp
Deepseek V3.1 Base
Deepseek V3
Deepseek V3 Base
DeepSeek R1 Distill Qwen 7B
Gemma 3 4b it
DeepSeek R1 Distill Qwen 14B
DeepSeek R1 Distill Qwen 1.5B
DeepSeek R1 Distill Llama 70B
Deepseek OCR 2
Cogito V1 Preview Qwen 32B
Cogito V1 Preview Qwen 14B
MiniMax M2.5 FP4
Cogito V1 Preview Llama 8B
GLM 5 Fp4
Cogito V1 Preview Llama 70B Turbo
Cogito V1 Preview Llama 70B
nim/nvidia/llama-3.1-nemotron-70b-instruct
nim/nv-mistralai/mistral-nemo-12b-instruct
nim/meta/llama-3.1-8b-instruct
nim/meta/llama-3.1-70b-instruct
Gemma 2 9B It
Qwen3 Next 80B A3b Thinking
Qwen2.5 32B
Nous Hermes 2 Mixtral 8X7B Dpo
Qwen3-VL-32B-Instruct
Meta Llama 3.1 70B Instruct Turbo
Qwen3 Coder 30B A3b Instruct
Qwen 2 (1.5B)
Facebook CWM
Holo3 35B A3b
GLM 5.1 FP4
Qwen2.5-VL (72B) Instruct
Qwen3-VL-235B-A22B-Instruct-FP8
Qwen3 Next 80B A3b Instruct Fp8
Qwen3 8B Base
Qwen3 8B
Qwen3 4B Base