Deploy Dedicated GPU server to run AI models
Explore our collection of pre-built AI workflows for various use cases
Create custom AI workflows tailored to your specific needs
API details in LLM-friendly format
Command-line interface for API
Skills for coding agents to use APIs
Agents Control Plane for API
SDKs for ModelsLab API
MCP Server for API
Explore AI models and APIs from Together AI
Showing 113-151 of 151 models
Qwen3 30B A3B Instruct 2507 Lora
Deepseek V3.1 Terminus
DeepSeek R1 (Original)
Mistral 7B v0.1
Mistral (7B) Instruct v0.1
Magistral Small 2506
Gemma 2B It
Qwen3.5 397B A17b Fp8
nim/meta/llama-3.2-90b-vision-instruct
meta-llama/Llama-2-7b-chat-hf
Gemma 3 27B Pt
Llama 4 Maverick Instruct (17Bx128E) FP8
Glm 4.5 Air Fp8
Qwen3 Next 80B A3b Instruct
GLM 4.7 Fp8
Cogito v2.1 671B
EssentialAI Rnj-1 Instruct
LFM2-24B-A2B
Arize AI Qwen 2 1.5B Instruct
Gemma 3N E4B Instruct
Mistral Small (24B) Instruct 25.01
Meta Llama 3 8B Instruct Lite
Meta Llama 3.3 70B Instruct Turbo
Qwen2.5 7B Instruct Turbo
Qwen3-VL-8B-Instruct
Qwen3 235B A22B Thinking 2507 FP8
Qwen3 235B A22B Instruct 2507 FP8 Throughput
Qwen3 Coder 480B A35B Instruct Fp8
Qwen3 Coder Next Fp8
Qwen3.5 9B FP8
Deepseek V3.1
DeepSeek R1-0528
OpenAI GPT-OSS 20B
OpenAI GPT-OSS 120B
Kimi K2.5
MiniMax M2.5 FP4
Qwen3.5 397B A17b
GLM 5 Fp4
Mixtral-8x7B Instruct v0.1