Get started
Ready-to-use deploy configs for Rime’s models live in therimelabs/rime-baseten-deploy repository. Each model directory contains a Baseten/Truss config.yaml plus a README with full deploy, secrets, and invocation steps.
| Model | Directory |
|---|---|
| Rime Coda v1 — flagship expressive TTS | rime-coda-v1/ |
| Rime Arcana v3 — multilingual expressive | rime-arcana-v3/ |
| Rime Arcana v2 (English) — expressive | rime-arcana-v2-en/ |
| Rime Mist v3 — low-latency conversational | rime-mist-v3/ |
- Clone the repo and
cdinto the model directory you want. - Set three secrets on your Baseten workspace —
gcp_rime_service_account(pulls Rime’s private engine image),rime_license, andrime_api_key. The per-model README walks through each. Engine images are license-gated; contact help@rime.ai for access. - Push the config with the Truss CLI:
- Set the autoscaling concurrency target (each replica handles ~10 concurrent requests before a new one is added). See the model README for the exact Management API call.
Invoke
Once deployed, send inference requests to your model’s sync endpoint. The Rime API key is baked in via therime_api_key secret, so callers authenticate to Baseten only:
rime-baseten-deploy README for your chosen model.
