← Back to model catalog
View full pricing and calculator →
Qwen3 32B
Provider
Brightnode-hosted
Context
131,072 tokens
Pricing
$0.10 / 1M input · $1.20 / 1M output
APAC regions
Singapore
Residency
in-region
Task
Chat
APAC performance
Latency profile
| Singapore | Tokyo | Sydney | |
|---|---|---|---|
| TTFT p50 | 31 | 45 | 39 |
| TTFT p95 | 49 | 72 | 63 |
| E2E latency p50 | 68 | 84 | 77 |
| E2E latency p95 | 104 | 129 | 117 |
Pricing
Input: $0.10 per 1M tokens
Output: $1.20 per 1M tokens
Billing: Per-token, charged against wallet balance
Dedicated endpoint option
- A100 80GB: $4.01/hr (Singapore)
- H100 80GB: $14.29/hr (Singapore)
Capabilities
Reasoning-focused large model tuned for structured analysis, tool use, and long prompts in multilingual production settings.
Best for: Reasoning tasks, Agent tool use, Code generation, Long-context workflows
Quickstart code snippets
Python
from openai import OpenAI
client = OpenAI(
base_url="https://api.brightnode.cloud/v1",
api_key="YOUR_BRIGHTNODE_API_KEY",
)
response = client.chat.completions.create(
model="Qwen/Qwen3-32B",
messages=[{"role": "user", "content": "Hello from APAC"}],
)Node
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.brightnode.cloud/v1",
apiKey: process.env.BRIGHTNODE_API_KEY,
});
const response = await client.chat.completions.create({
model: "Qwen/Qwen3-32B",
messages: [{ role: "user", content: "Hello from APAC" }],
});Other models on Brightnode
Amazon Nova 2 Lite
Proprietary models · Jakarta, Singapore, Malaysia, Thailand, Tokyo, Seoul, Taiwan
General-purpose inference
Amazon Nova Lite
Proprietary models · Singapore, Sydney, Tokyo
General-purpose inference
Amazon Nova Micro
Proprietary models · Singapore, Sydney, Tokyo
General-purpose inference
Amazon Nova Pro
Proprietary models · Singapore, Sydney, Tokyo
General-purpose inference
