Arcee AI: Trinity Large Preview (free)
Frontier MoE Free Now
Scale Efficiently. Infer Fast.
Sparse MoE
400B Total 13B Active
Activates 13B params per token for efficient inference on long contexts up to 512K.
Agentic Workflows
Handles Tools Chains
Navigates complex agent harnesses and multi-step tool actions in production apps.
Creative Chat
Roleplay Storytelling Excels
Delivers strong literacy creativity and character consistency beyond pure reasoning models.
Examples
See what Arcee AI: Trinity Large Preview (free) can create
Copy any prompt below and try it yourself in the playground.
Code Agent
“You are a senior software engineer. Analyze this Python codebase for bugs optimize the sorting algorithm and generate unit tests. Codebase: [insert code]. Output refactored code and tests.”
Long Context Summary
“Summarize this 100K token technical report on sparse MoE architectures. Highlight key innovations in expert routing training efficiency and benchmark results. Extract actionable insights for deployment.”
Creative Story
“Write a 2000-word sci-fi story about an AI agent exploring abandoned megastructures on Mars. Maintain consistent character voices include technical details on sparse MoE systems and build tension across 10 chapters.”
Tool Chain Workflow
“Plan a multi-step workflow to research deploy and monitor a new LLM API. Use tools for web search code generation and logging. Execute steps sequentially with reasoning at each stage.”
For Developers
A few lines of code.
400B MoE. 13B code.
ModelsLab handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPU management needed.
- Serverless: scales to zero, scales to millions
- Pay per token, no minimums
- Python and JavaScript SDKs, plus REST API
import requestsresponse = requests.post("https://modelslab.com/api/v7/llm/chat/completions",json={"key": "YOUR_API_KEY","prompt": "","model_id": ""})print(response.json())
Ready to create?
Start generating with Arcee AI: Trinity Large Preview (free) on ModelsLab.