Quatarly — API Documentation

Overview

Quatarly provides a unified REST API that proxies requests to Anthropic (Claude), Google (Gemini), and OpenAI (GPT) through a single endpoint and a single API key. All responses conform to the OpenAI Chat Completions schema, making it a true drop-in replacement anywhere you currently use api.openai.com.

Fully OpenAI-compatible. Point your existing code at https://api.quatarly.cloud/v1, swap the key, and everything works — streaming, tool calling, system prompts, and all.

All models use the OpenAI Chat Completions request format. Quatarly handles translation to the native provider format internally — you never have to change your payload structure.

Authentication

Every request must include your Quatarly API key as a Bearer token in the Authorization header.

HTTP Header

Authorization: Bearer your-api-key-here

Get your free key at api.quatarly.cloud/api-key.

Keep your key secret. Never commit it to source code, expose it in client-side JavaScript, or share it publicly. Treat it like a password.

Base URL

Base URL https://api.quatarly.cloud

OpenAI-compatible (v1) https://api.quatarly.cloud/v1

All endpoints live under /v1. The difference is just what base URL you give each client — the client appends the rest automatically.

How clients use the base URL:

OpenAI SDK / curl / OpenCode — you set base_url = "https://api.quatarly.cloud/v1" and the SDK appends /chat/completions. Works for all models (Claude, Gemini, GPT).

Claude Code / Anthropic SDK — you set ANTHROPIC_BASE_URL = "https://api.quatarly.cloud/" (root) and Claude Code appends /v1/messages itself, reaching /v1/messages which uses the Anthropic Messages API format. Claude models only.

Endpoint	Method	Format	Models
/v1/chat/completions	POST	OpenAI	All (Claude, Gemini, GPT)
/v1/messages	POST	Anthropic	Claude only — if `/v1/chat/completions` doesn't work for you with a Claude model, use this

Rate Limits

Rate limits are enforced per API key. Limits vary by key tier and model. A 429 Too Many Requests response is returned when exceeded.

Limit Type	Trial Key	Full Key
Requests / minute (RPM)	70	Custom (per plan)
Monthly credits	Limited	Plan-based
Concurrent requests	5	Unlimited

Credits system: Each request deducts credits based on tokens used. Different models have different credit weights — Claude Opus costs more than Haiku, for example. Check your usage in the management portal.

All Models

All models below are accessible via the standard OpenAI Chat Completions API format using your Quatarly key.

Claude (Anthropic)

claude-sonnet-4-6-thinking

Anthropic · Sonnet

anthropic

claude-opus-4-6-thinking

Anthropic · Opus

anthropic

claude-haiku-4-5-20251001

Anthropic · Haiku

anthropic

Gemini (Google)

gemini-3.1-pro

Google · Pro

openai compat

gemini-3-flash

Google · Flash

openai compat

GPT (OpenAI)

gpt-5.1

OpenAI

openai

gpt-5.1-codex

OpenAI · Codex

openai

gpt-5.1-codex-max

OpenAI · Codex Max

openai

gpt-5.2

OpenAI

openai

gpt-5.2-codex

OpenAI · Codex

openai

gpt-5.3-codex

OpenAI · Codex

openai

gpt-5.4

OpenAI

openai

gpt-5.5

OpenAI

openai

Factory AI Droid

Connect Factory AI Droid to Quatarly to access all models with a single API key. The setup script patches ~/.factory/settings.json automatically with all model entries.

Install Factory AI

powershell

irm https://app.factory.ai/cli/windows | iex

bash

curl -fsSL https://app.factory.ai/cli | sh

Create a Factory Account & Login

Go to app.factory.ai and create a free account. Then run droid in your terminal and log in to create ~/.factory/settings.json.

Run the Setup Script

You need your Quatarly API key (qua_trail_... or qua_...).

powershell

irm https://raw.githubusercontent.com/himanshu91081/Quatarly-setup/main/add-quatarly-models.ps1 -OutFile add-quatarly-models.ps1; .\add-quatarly-models.ps1

bash

curl -fsSL https://raw.githubusercontent.com/himanshu91081/Quatarly-setup/main/add-quatarly-models.sh -o add-quatarly-models.sh && bash add-quatarly-models.sh

The script will prompt for your key and update settings.json with all 11 model entries. Running it again with a new key safely updates existing entries without creating duplicates.

Verify Setup

powershell

Select-String "customModels" -Path "$env:USERPROFILE\.factory\settings.json" -A 5

bash

grep -A 5 "customModels" ~/.factory/settings.json

Expected snippet in settings.json:

json

"customModels": [
  {
    "model":       "claude-sonnet-4-6-thinking",
    "id":          "custom:claude-sonnet-4-6-thinking-0",
    "baseUrl":     "https://api.quatarly.cloud/",
    "apiKey":      "your-api-key",
    "provider":    "anthropic",
    "displayName": "claude-sonnet-4-6-thinking"
  },
  {
    "model":       "gpt-5.1",
    "id":          "custom:gpt-5.1-5",
    "baseUrl":     "https://api.quatarly.cloud/v1",
    "apiKey":      "your-api-key",
    "provider":    "openai",
    "displayName": "gpt-5.1"
  }
]

A backup of your original settings.json is saved as settings.json.backup before any changes. The script requires Python 3.

Claude Code

Use Claude Code as a CLI coding agent routed through Quatarly. No Anthropic account needed — just your Quatarly key.

Install Claude Code

bash

npm install -g @anthropic-ai/claude-code

Set Environment Variables

Option A — Setup script (recommended, persists across restarts):

powershell

irm https://raw.githubusercontent.com/himanshu91081/Quatarly-setup/main/set-claude-env.ps1 -OutFile set-claude-env.ps1; .\set-claude-env.ps1 -ApiKey "your-api-key-here"

bash

curl -fsSL https://raw.githubusercontent.com/himanshu91081/Quatarly-setup/main/set-claude-env.sh -o set-claude-env.sh && bash set-claude-env.sh your-api-key-here

Option B — Set manually for current session only:

bash

export ANTHROPIC_BASE_URL="https://api.quatarly.cloud/"
export ANTHROPIC_AUTH_TOKEN="your-api-key-here"
export ANTHROPIC_DEFAULT_HAIKU_MODEL="claude-haiku-4-5-20251001"
export ANTHROPIC_DEFAULT_SONNET_MODEL="claude-sonnet-4-6-thinking"
export ANTHROPIC_DEFAULT_OPUS_MODEL="claude-opus-4-6-thinking"

powershell

$env:ANTHROPIC_BASE_URL             = "https://api.quatarly.cloud/"
$env:ANTHROPIC_AUTH_TOKEN           = "your-api-key-here"
$env:ANTHROPIC_DEFAULT_HAIKU_MODEL  = "claude-haiku-4-5-20251001"
$env:ANTHROPIC_DEFAULT_SONNET_MODEL = "claude-sonnet-4-6-thinking"
$env:ANTHROPIC_DEFAULT_OPUS_MODEL   = "claude-opus-4-6-thinking"

Variable	Value
ANTHROPIC_BASE_URL	`https://api.quatarly.cloud/`
ANTHROPIC_AUTH_TOKEN	Your Quatarly API key
ANTHROPIC_DEFAULT_HAIKU_MODEL	`claude-haiku-4-5-20251001`
ANTHROPIC_DEFAULT_SONNET_MODEL	`claude-sonnet-4-6-thinking`
ANTHROPIC_DEFAULT_OPUS_MODEL	`claude-opus-4-6-thinking`

After using the setup script, run source ~/.zshrc (macOS) or source ~/.bashrc (Linux) to pick up the changes in the current terminal. GUI apps require a full logout/restart.

Launch Claude Code
bash
```
claude
```
Claude Code will route all requests through Quatarly using your key and credit balance.

OpenCode

Use OpenCode as a terminal AI coding assistant routed through Quatarly. No OpenAI account needed — just your Quatarly key.

Install OpenCode
bash
```
npm install -g opencode-ai
```

Create the Config File

Edit or create ~/.config/opencode/opencode.json:

json (~/.config/opencode/opencode.json)

{
    "$schema": "https://opencode.ai/config.json",
    "provider": {
        "openai": {
            "options": {
                "baseURL": "https://api.quatarly.cloud/v1",
                "apiKey":  "your-api-key-here"
            }
        }
    },
    "model": "openai/gpt-5.3-codex"
}

Create it automatically from the terminal:

powershell

$dir = "$env:USERPROFILE\.config\opencode"
if (!(Test-Path $dir)) { New-Item -ItemType Directory -Force -Path $dir }
@'
{
    "$schema": "https://opencode.ai/config.json",
    "provider": {
        "openai": {
            "options": {
                "baseURL": "https://api.quatarly.cloud/v1",
                "apiKey": "your-api-key-here"
            }
        }
    },
    "model": "openai/gpt-5.3-codex"
}
'@ | Set-Content "$dir\opencode.json"

bash

mkdir -p ~/.config/opencode
cat > ~/.config/opencode/opencode.json << 'EOF'
{
    "$schema": "https://opencode.ai/config.json",
    "provider": {
        "openai": {
            "options": {
                "baseURL": "https://api.quatarly.cloud/v1",
                "apiKey": "qua_trail_your-key-here"
            }
        }
    },
    "model": "openai/gpt-5.3-codex"
}
EOF

Launch OpenCode
bash
```
opencode
```
OpenCode routes all requests through Quatarly. Switch models with /model inside the session — all Quatarly GPT, Gemini, and Claude models appear under the openai provider.

OpenAI Codex CLI

Use OpenAI Codex CLI as a terminal coding agent routed through Quatarly. Supports all models — GPT, Claude, and Gemini — with a single key.

Install Codex CLI
bash
```
npm install -g @openai/codex
```

Configure `~/.codex/config.toml`

Edit or create the file with the following content:

toml (~/.codex/config.toml)

model = "gpt-5.5"                  # Or any model Quatarly supports
model_provider = "quatarly"

# Disables all user confirmation prompts. Dangerous — remove # to enable.
# approval_policy = "never"

# Grants unrestricted sandbox access. Dangerous — remove # to enable.
# sandbox_mode = "danger-full-access"

model_reasoning_effort = "high"
plan_mode_reasoning_effort = "high"
supports_websockets = true

[model_providers.quatarly]
base_url = "https://api.quatarly.cloud/v1"
experimental_bearer_token = "your-api-key-here"
name = "quatarly"
wire_api = "responses"
requires_openai_auth = true

No need to edit ~/.codex/auth.json — the experimental_bearer_token in config.toml handles authentication.

Alternative: API Mode with `auth.json`

If you prefer to store credentials in ~/.codex/auth.json, use this config instead:

toml (~/.codex/config.toml)

# approval_policy = "never"
# sandbox_mode = "danger-full-access"

model_provider = "quatarly"
model = "gpt-5.5"
model_reasoning_effort = "high"

[model_providers.quatarly]
name = "quatarly"
base_url = "https://api.quatarly.cloud/v1"
wire_api = "responses"

Then set your key in ~/.codex/auth.json:

json (~/.codex/auth.json)

{
  "OPENAI_API_KEY": "your-api-key-here"
}

Launch Codex
bash
```
codex
```
Codex will route all requests through Quatarly. You can use any supported model — change model in config.toml to switch between GPT, Claude, or Gemini models.

Supported Models for Codex

Model	Provider	Recommended For
gpt-5.5	OpenAI	Best overall coding performance
gpt-5.3-codex	OpenAI	Code generation & agentic tasks
claude-sonnet-4-6-thinking	Anthropic	Complex reasoning & refactoring
gemini-3.1-pro	Google	Large context windows

Chat Completions

POST /v1/chat/completions

Send a chat message to any model. The request and response format is identical to the OpenAI Chat Completions API.

Request Body

Parameter	Type	Required	Description
model	string	required	Model ID from the models list (e.g. `gpt-5.3-codex`)
messages	array	required	Array of message objects with `role` and `content`
stream	boolean	optional	Stream tokens via SSE. Default: `false`
max_tokens	integer	optional	Max output tokens. Model default if omitted
temperature	number	optional	Sampling temperature 0–2. Default: `1`
top_p	number	optional	Nucleus sampling. Default: `1`
system	string	optional	System prompt (alternative to messages array system role)
tools	array	optional	Tool definitions for function calling

Example Request

json

{
  "model": "claude-sonnet-4-6-thinking",
  "messages": [
    { "role": "system", "content": "You are a helpful assistant." },
    { "role": "user",   "content": "Explain quantum entanglement simply." }
  ],
  "stream": false,
  "max_tokens": 1024
}

Response Format

json

{
  "id": "chatcmpl-xyz123",
  "object": "chat.completion",
  "created": 1750000000,
  "model": "claude-sonnet-4-6-thinking",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Quantum entanglement is when two particles..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 32,
    "completion_tokens": 118,
    "total_tokens": 150
  }
}

Parameters Reference

Common parameters across all models. Provider-specific parameters are passed through transparently.

Parameter	Type	Default	Notes
temperature	float	`1.0`	0 = deterministic, 2 = very random
max_tokens	int	model max	Hard cap on output length
stream	bool	`false`	SSE streaming. Use `-N` flag with curl
top_p	float	`1.0`	Nucleus sampling probability cutoff
stop	string[]	—	Stop sequences (up to 4)
presence_penalty	float	`0`	GPT models only
frequency_penalty	float	`0`	GPT models only
tools	array	—	Function calling tool definitions
tool_choice	string/object	`"auto"`	Tool selection mode

cURL Examples

Basic Chat

bash

curl -X POST "https://api.quatarly.cloud/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-api-key-here" \
  -d '{
    "model": "gpt-5.3-codex",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Streaming Response

bash

curl -X POST "https://api.quatarly.cloud/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-api-key-here" \
  -N \
  -d '{
    "model": "claude-sonnet-4-6-thinking",
    "stream": true,
    "messages": [{"role": "user", "content": "Write a poem about the ocean."}]
  }'

Claude with System Prompt

bash

curl -X POST "https://api.quatarly.cloud/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-api-key-here" \
  -d '{
    "model": "claude-opus-4-6-thinking",
    "messages": [
      {"role": "system", "content": "You are an expert Python developer."},
      {"role": "user",   "content": "Refactor this function for readability."}
    ],
    "max_tokens": 2048
  }'

Gemini with High Temperature

bash

curl -X POST "https://api.quatarly.cloud/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-api-key-here" \
  -d '{
    "model": "gemini-3.1-pro",
    "messages": [{"role": "user", "content": "Brainstorm 10 startup ideas."}],
    "temperature": 1.4,
    "max_tokens": 1024
  }'

Claude via /v1/messages (Anthropic format)

If /v1/chat/completions isn't working with a Claude-only tool, try the Anthropic Messages endpoint instead. Note max_tokens is required and the system prompt is a top-level field.

bash

curl -X POST "https://api.quatarly.cloud/v1/messages" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer your-api-key-here" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-sonnet-4-6-thinking",
    "max_tokens": 1024,
    "system": "You are a helpful assistant.",
    "messages": [
      {"role": "user", "content": "Explain transformers in ML simply."}
    ]
  }'

Python (OpenAI SDK)

Install the official OpenAI library and point it at Quatarly — no other changes needed.

bash

pip install openai

Basic Usage

python

from openai import OpenAI

client = OpenAI(
    api_key="your-api-key-here",
    base_url="https://api.quatarly.cloud/v1",
)

response = client.chat.completions.create(
    model="claude-sonnet-4-6-thinking",
    messages=[
        {"role": "user", "content": "Summarise the last decade of AI progress."}
    ],
    max_tokens=1024,
)

print(response.choices[0].message.content)

Streaming

python

with client.chat.completions.stream(
    model="gpt-5.3-codex",
    messages=[{"role": "user", "content": "Write a sorting algorithm."}],
) as stream:
    for text in stream.text_stream:
        print(text, end="", flush=True)

Node.js (OpenAI SDK)

bash

npm install openai

Basic Usage

javascript

import OpenAI from "openai";

const client = new OpenAI({
  apiKey: "your-api-key-here",
  baseURL: "https://api.quatarly.cloud/v1",
});

const response = await client.chat.completions.create({
  model: "gemini-3.1-pro",
  messages: [{ role: "user", content: "What is the capital of France?" }],
});

console.log(response.choices[0].message.content);

Streaming

javascript

const stream = await client.chat.completions.create({
  model: "claude-haiku-4-5-20251001",
  messages: [{ role: "user", content: "Tell me a short story." }],
  stream: true,
});

for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content ?? "");
}

Models Table

Model ID	Family	Provider	Base URL
claude-sonnet-4-6-thinking	Claude	anthropic	...quatarly.cloud/
claude-opus-4-6-thinking	Claude	anthropic	...quatarly.cloud/
claude-haiku-4-5-20251001	Claude	anthropic	...quatarly.cloud/
gemini-3.1-pro	Gemini	openai compat	...quatarly.cloud/v1
gemini-3-flash	Gemini	openai compat	...quatarly.cloud/v1
gpt-5.1	GPT	openai	...quatarly.cloud/v1
gpt-5.1-codex	GPT	openai	...quatarly.cloud/v1
gpt-5.1-codex-max	GPT	openai	...quatarly.cloud/v1
gpt-5.2	GPT	openai	...quatarly.cloud/v1
gpt-5.2-codex	GPT	openai	...quatarly.cloud/v1
gpt-5.3-codex	GPT	openai	...quatarly.cloud/v1
gpt-5.4	GPT	openai	...quatarly.cloud/v1
gpt-5.5	GPT	openai	...quatarly.cloud/v1

Error Codes

All errors follow the OpenAI error response format with an additional code field.

HTTP Status	Meaning	Common Cause
400	Bad Request	Missing required field, invalid model name, malformed JSON
401	Unauthorized	Missing or invalid API key
402	Payment Required	Monthly credits exhausted for your key
429	Too Many Requests	Rate limit exceeded — wait and retry
500	Internal Server Error	Upstream provider error; retry with backoff
503	Service Unavailable	Provider temporarily unreachable

json — error response shape

{
  "error": {
    "message": "Rate limit exceeded. Please retry after 60 seconds.",
    "type":    "rate_limit_error",
    "code":    "rate_limit_exceeded"
  }
}

Script Files

File	Platform	Purpose
	Windows (PowerShell)	Add all models to Factory AI Droid
	macOS / Linux (Bash)	Add all models to Factory AI Droid
	Windows (PowerShell)	Set Claude Code environment variables globally
	macOS / Linux (Bash)	Set Claude Code environment variables globally

Safe to re-run. All scripts are idempotent — running again with a new key updates existing entries without duplicates. A .backup copy of your original config is saved automatically. Python 3 is required for the Factory scripts.

Quatarly API

Overview

Authentication

Base URL

Rate Limits

All Models

Claude (Anthropic)

Gemini (Google)

GPT (OpenAI)

Factory AI Droid

Install Factory AI

Create a Factory Account & Login

Run the Setup Script

Verify Setup

Claude Code

Install Claude Code

Set Environment Variables

Launch Claude Code

OpenCode

Install OpenCode

Create the Config File

Launch OpenCode

OpenAI Codex CLI

Install Codex CLI

Configure ~/.codex/config.toml

Alternative: API Mode with auth.json

Launch Codex

Supported Models for Codex

Chat Completions

Request Body

Example Request

Response Format

Parameters Reference

cURL Examples

Basic Chat

Streaming Response

Claude with System Prompt

Gemini with High Temperature

Claude via /v1/messages (Anthropic format)

Python (OpenAI SDK)

Basic Usage

Streaming

Node.js (OpenAI SDK)

Basic Usage

Streaming

Models Table

Error Codes

Script Files

Configure `~/.codex/config.toml`

Alternative: API Mode with `auth.json`