Nghimmo API - Compatible with Cursor / Cline / Claude Code
| Item | Value |
|---|---|
| Base URL | https://api.nghimmo.com/v1 |
| API Key | sk-xxxxxxxxxxxx (provided to you) |
| Token check page | api.nghimmo.com/check |
| Format support | OpenAI & Anthropic compatible |
Most powerful — for hard tasks needing deep reasoning.
nghi/claude-opus-4.8
nghi/claude-opus-4.8-thinking
nghi/claude-opus-4.7
nghi/claude-opus-4.7-thinking
nghi/claude-opus-4.6
nghi/claude-opus-4.6-thinking
nghi/claude-opus-4.5
nghi/claude-opus-4.5-thinking
Balanced quality and cost — recommended for daily use.
nghi/claude-sonnet-4.6
nghi/claude-sonnet-4.6-thinking
nghi/claude-sonnet-4.5
nghi/claude-sonnet-4.5-thinking
nghi/claude-sonnet-4
nghi/claude-sonnet-4-thinking
Fast, cheap on tokens — for light work (typos, renames, quick questions).
nghi/claude-haiku-4.5
nghi/claude-haiku-4.5-thinking
GPT-style names (still run Claude under the hood) and nghi/auto (auto-select).
nghi/gpt-4o
nghi/gpt-4
nghi/auto
Pick the tab matching the tool you use. Every setup shares:
https://api.nghimmo.com/v1sk-... (provided to you)Fastest way: download the file below, copy it into your code folder and double-click. Enter your API Key and you're ready to go (nothing else to install).
⬇️ Download for Windows (.bat) 🍎 Download for Mac (.command)CLAUDE-CODE-CLI.bat above.sk-...) and press Enter.CLAUDE-CODE-CLI.command above.chmod +x CLAUDE-CODE-CLI.command
sk-...) and press Enter. Claude Code opens in that folder..command file automatically points to Nghimmo server for that session, no need to edit settings.json. If you prefer permanent configuration, see below.
Create file ~/.claude/settings.json:
{
"env": {
"ANTHROPIC_BASE_URL": "https://api.nghimmo.com",
"ANTHROPIC_API_KEY": "sk-xxxxxxxxxxxx",
"ANTHROPIC_MODEL": "nghi/claude-opus-4.8",
"ANTHROPIC_SMALL_FAST_MODEL": "nghi/claude-haiku-4.5"
}
}
ANTHROPIC_BASE_URL has no /v1 suffixANTHROPIC_SMALL_FAST_MODEL is required — missing it will break Bash / Auto modenghi/ prefixset ANTHROPIC_BASE_URL=https://api.nghimmo.com
set ANTHROPIC_API_KEY=sk-xxxxxxxxxxxx
set ANTHROPIC_MODEL=nghi/claude-opus-4.8
set ANTHROPIC_SMALL_FAST_MODEL=nghi/claude-haiku-4.5
claude
/compact every 30–50 turns to reduce contextnpm install -g @anthropic-ai/claude-code
For users with the Claude Code extension in VS Code, Cursor or Antigravity. Install the extension once, then download the file matching your tool and double-click it to auto-open.
.bat file above (matching your tool)..bat file, enter your API Key (sk-...) and press Enter..command file above, copy it to your project folder.chmod +x MO-CLAUDE-VSCODE.command
.command file. If macOS blocks it, right-click → Open → Open.sk-...) and press Enter. The tool opens automatically; click the Claude Code icon in the side bar to start.Cursor uses the OpenAI format. Configure it under Settings → Models.
Settings → Models (or press Ctrl + Shift + J).https://api.nghimmo.com/v1
sk-... into the OpenAI API Key field.nghi/claude-sonnet-4.6, then select it to use.Cline is a VS Code extension. It supports both OpenAI and Anthropic formats.
OpenAI Compatible.https://api.nghimmo.com/v1
sk-...nghi/claude-sonnet-4.6Anthropic.https://api.nghimmo.comsk-..., pick a model and use it.Roo Code (a fork of Cline) is also a VS Code extension, configured the same way.
OpenAI Compatible.https://api.nghimmo.com/v1
sk-...nghi/claude-sonnet-4.6 (or another model).
Continue (a VS Code / JetBrains extension) is configured via config.json.
config.json.models array as below:{
"models": [
{
"title": "Nghimmo Sonnet",
"provider": "openai",
"model": "nghi/claude-sonnet-4.6",
"apiBase": "https://api.nghimmo.com/v1",
"apiKey": "sk-xxxxxxxxxxxx"
}
]
}
Any app that supports OpenAI or Anthropic compatible APIs works (ChatBox, LobeChat, NextChat, Open WebUI...).
Base URL : https://api.nghimmo.com/v1
API Key : sk-xxxxxxxxxxxx
Model : nghi/claude-sonnet-4.6
Base URL : https://api.nghimmo.com
API Key : sk-xxxxxxxxxxxx
Model : nghi/claude-sonnet-4.6
/v1/chat/completions (OpenAI) or /v1/messages (Anthropic).
Tokens are mostly consumed by context (history + attached files), not by the answer itself. Every message resends the whole conversation — the longer the session, the more each turn costs. Use the tips below to make your quota last longer.
/clear when switching topics — start fresh, drop unrelated history./compact every 30–50 turns — condense the conversation while keeping key context.-thinking variants cost more tokens — use only when truly needed./clear and /compact.
/compact or /clear and retry. This is a client-side issue (context exceeded the model limit), not a server error.
Visit the page below, paste your API Key and click "Check" to see: