Catalog explainer

APAC model routing snapshot

For the live public model inventory, use /models. This page explains APAC pricing and routing behavior.

ModelProviderInput / 1MOutput / 1MContextAPAC regionsLatency (SG/TYO/SYD)Residency
Amazon Titan Embed Text v2
Titan
Amazon$0.028,192Tokyo, Seoul, Taiwan, Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
BGE Base English v1.5
Bge
BAAI$0.02$0.02512Singapore40ms / 58ms / 51msin-region
BGE Large English v1.5
Bge
BAAI$0.02$0.02512Singapore40ms / 58ms / 51msin-region
BGE-M3 (Multilingual)
Bge
BAAI$0.02$0.028,192Singapore40ms / 58ms / 51msin-region
Multilingual E5 Large
E5
Intfloat$0.02$0.02512Singapore40ms / 58ms / 51msin-region
Amazon Nova Micro
Nova
Amazon$0.04$0.14128,000Singapore, Sydney, Tokyo40ms / 58ms / 51msin-region
Gemma 3 4B Instruct
Gemma
Google$0.04$0.08131,072Singapore40ms / 58ms / 51msin-region
Voxtral Mini 3B
Mistral
Mistral$0.04$0.04131,072Singapore40ms / 58ms / 51msin-region
Nemotron Nano 9B v2
Nemotron
NVIDIA$0.04$0.16131,072Singapore40ms / 58ms / 51msin-region
Google Gemma 3 4B
Gemma
Google$0.05$0.10131,072Tokyo, Seoul, Taiwan, Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
Nemotron 3 Nano 30B
Nemotron
NVIDIA$0.05$0.20262,144Singapore40ms / 58ms / 51msin-region
Qwen3 Embedding 8B
Qwen
Qwen$0.05$0.0532,768Singapore18ms / 29ms / 24msin-region
Voxtral Mini 3B
Mistral
Mistral$0.05$0.05131,072Tokyo, Seoul, Taiwan, Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
Amazon Nova Lite
Nova
Amazon$0.06$0.24300,000Singapore, Sydney, Tokyo40ms / 58ms / 51msin-region
NVIDIA Nemotron Nano 30B
Nemotron
NVIDIA$0.07$0.29131,072Tokyo, Seoul, Taiwan, Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
NVIDIA Nemotron Nano 9B
Nemotron
NVIDIA$0.07$0.28131,072Tokyo, Seoul, Taiwan, Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
OpenAI GPT-OSS 20B
Gpt
OpenAI$0.07$0.31131,072Jakarta, Singapore, Malaysia, Thailand, Tokyo, Seoul, Taiwan, Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
Amazon Titan Embed Image v1
Titan
Amazon$0.08128Mumbai, Sydney, New Zealand40ms / 58ms / 51msin-region
Gemma 3 27B Instruct
Gemma
Google$0.08$0.45131,072Singapore40ms / 58ms / 51msin-region
Gemma 3 27B Pretrained
Gemma
Google$0.08$0.45131,072Singapore40ms / 58ms / 51msin-region

Showing 1-20 of 98 models

Models & regions

Proprietary and Brightnode-hosted models are available in APAC today, with additional regions and model families rolling out.

Sydney

Proprietary models

Sydney

  • Amazon Nova Lite
  • Amazon Nova Micro
  • Amazon Nova Pro
  • Amazon Titan Embed Image v1
  • Amazon Titan Embed Text v2

plus 51 more models in this region

Best for: Production chat, agents, long context

Singapore

Proprietary models

Singapore

  • Amazon Nova 2 Lite
  • Amazon Nova Lite
  • Amazon Nova Micro
  • Amazon Nova Pro
  • Claude 3 Haiku

plus 29 more models in this region

Best for: Southeast Asia latency, data residency

Singapore

Brightnode-hosted

Singapore

  • BGE Base English v1.5
  • BGE Large English v1.5
  • BGE-M3 (Multilingual)
  • Gemma 3 12B Instruct
  • Gemma 3 27B Instruct

plus 32 more models in this region

Best for: Cost-effective inference, full control, same API

Specify model in your request; we route to the right region and provider automatically.

Get started in minutes

OpenAI-compatible API. Swap the base URL and use your existing code.

1. Get your API key

Sign up at the console, create an API key, and add credits. No long-term contract.

Console → API keys
2. Point your client to Brightnode
# Python (OpenAI SDK)
from openai import OpenAI
client = OpenAI(
  base_url="https://api.brightnode.cloud/v1",
  api_key="YOUR_BRIGHTNODE_API_KEY",
)
response = client.chat.completions.create(
  model="meta-llama/Llama-3.3-70B-Instruct",
  messages=[{"role": "user", "content": "Hello from APAC."}],
)

Same for Node, curl, or any OpenAI-compatible client. We support streaming and embeddings.

3. Docs & limits

Full API reference, model list, and rate limits are in our docs. For agent frameworks, just change the base URL.

Documentation