Writer: Palmyra X5
Scale Agents Million Tokens
Build Agents Faster Cheaper
1M Context
Process Million-Token Prompts
Handles full 1M-token inputs in 22 seconds for deep document analysis.
Sub-Second Tools
Invoke Multi-Step Functions
Delivers ~300ms tool-calling latency for real-time agent workflows.
Cost Efficient
3-4x Less Than GPT-4
Priced at $0.60/M input, $6/M output tokens for scalable enterprise use.
Examples
See what Writer: Palmyra X5 can create
Copy any prompt below and try it yourself in the playground.
Contract Summary
“Analyze this 500-page contract PDF. Extract key clauses on termination, payment terms, and liabilities. Output as structured JSON with risk levels.”
RFP Response
“Review 200-page RFP document. Generate compliant proposal sections for pricing, timeline, and compliance. Align with brand guidelines.”
Data Pipeline Code
“Write Python code to process customer feedback logs from S3. Summarize themes, call sentiment API, output dashboard-ready CSV.”
Regulatory Report
“Summarize 1000-page regulatory filing. Identify compliance gaps, suggest fixes, invoke database for historical data comparison.”
For Developers
A few lines of code.
Agents Million Tokens Fast
ModelsLab handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPU management needed.
- Serverless: scales to zero, scales to millions
- Pay per token, no minimums
- Python and JavaScript SDKs, plus REST API
import requestsresponse = requests.post("https://modelslab.com/api/v7/llm/chat/completions",json={"key": "YOUR_API_KEY","prompt": "","model_id": ""})print(response.json())
Ready to create?
Start generating with Writer: Palmyra X5 on ModelsLab.