Happy Horse 1.0 is now on ModelsLab

Try Now
Skip to main content
Available now on ModelsLab · Language Model

Writer: Palmyra X5Scale Agents Million Tokens

Build Agents Faster Cheaper

1M Context

Process Million-Token Prompts

Handles full 1M-token inputs in 22 seconds for deep document analysis.

Sub-Second Tools

Invoke Multi-Step Functions

Delivers ~300ms tool-calling latency for real-time agent workflows.

Cost Efficient

3-4x Less Than GPT-4

Priced at $0.60/M input, $6/M output tokens for scalable enterprise use.

Examples

See what Writer: Palmyra X5 can create

Copy any prompt below and try it yourself in the playground.

Contract Summary

Analyze this 500-page contract PDF. Extract key clauses on termination, payment terms, and liabilities. Output as structured JSON with risk levels.

RFP Response

Review 200-page RFP document. Generate compliant proposal sections for pricing, timeline, and compliance. Align with brand guidelines.

Data Pipeline Code

Write Python code to process customer feedback logs from S3. Summarize themes, call sentiment API, output dashboard-ready CSV.

Regulatory Report

Summarize 1000-page regulatory filing. Identify compliance gaps, suggest fixes, invoke database for historical data comparison.

For Developers

A few lines of code.
Agents Million Tokens Fast

ModelsLab handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPU management needed.

  • Serverless: scales to zero, scales to millions
  • Pay per token, no minimums
  • Python and JavaScript SDKs, plus REST API
import requests
response = requests.post(
"https://modelslab.com/api/v7/llm/chat/completions",
json={
"key": "YOUR_API_KEY",
"prompt": "",
"model_id": ""
}
)
print(response.json())

FAQ

Common questions about Writer: Palmyra X5

Read the docs

Writer: Palmyra X5 is an enterprise LLM with 1M-token context for agentic workflows. It supports multi-step tool calls and processes long prompts in 22 seconds. Ideal for RAG and large-scale agents.

Scores 48.7 on BigCodeBench, 53% on Longbench v2, 19.1% on MRCR 8-needle. Matches GPT-4.1 retrieval at lower cost. Excels in code generation and reasoning.

Writer: Palmyra X5 offers 1M context vs GPT-4.1's limits, 3-4x cheaper tokens. Sub-second tools enable agent scaling unavailable in GPT-4.1.

Supports over 30 languages with adaptive reasoning. Suitable for global enterprise agents and compliance workflows.

Available via LLM endpoint. Use standard chat completions with long context. No setup for 1M tokens or tool calls.

Powers RAG pipelines, document summarization, code generation, multi-agent systems. Pre-built agents for regulatory analysis and file summaries.

Ready to create?

Start generating with Writer: Palmyra X5 on ModelsLab.