v1.0 Available

Inference API for
Yuuki Models

Access YuuKi-OS language models through a simple, OpenAI-compatible REST API. Generate API keys, manage usage, and integrate in minutes.

terminal
$ curl -X POST https://api.yuuki.dev/api/chat \
  -H "Authorization: Bearer yk-xxxxxxxxxx" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "yuuki-best",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

{
  "id": "chatcmpl-abc123",
  "model": "yuuki-best",
  "choices": [{
    "message": {
      "role": "assistant",
      "content": "Hello! How can I help you today?"
    }
  }]
}

Built for developers

Everything you need to integrate Yuuki models into your applications.

OpenAI Compatible

Drop-in replacement for OpenAI API. Use existing SDKs and tools with minimal changes.

API Key Management

Create, rotate, and manage API keys from the dashboard. Track usage per key.

3 Model Variants

Choose from Yuuki Best, 3.7, and v0.1 depending on your speed and quality needs.

Low Latency

Powered by Hugging Face Inference API for fast, reliable model serving at scale.

Secure by Default

API keys are hashed and stored securely. Row-level security on all user data.

Full Documentation

Complete API docs with examples in cURL, Python, and JavaScript ready to copy.

Available Models

Three model variants optimized for different use cases.

Flagship

Yuuki Best

Highest quality output with best overall reasoning and generation capabilities.

model: "yuuki-best"
Balanced

Yuuki 3.7

Optimal balance between speed and quality for production workloads.

model: "yuuki-3.7"
Fast

Yuuki v0.1

Lightweight first generation model for quick prototyping and testing.

model: "yuuki-v0.1"

Ready to get started?

Create an account and get your API key in seconds.