Cerebrium

Cerebrium is a serverless infrastructure platform purpose-built for building and scaling AI applications with minimal infrastructure overhead. With ultra-low time-to-first-byte (TTFB) performance, enterprise-grade compliance, and support for global deployments, Cerebrium makes it easy to take applications from prototype to production in minutes. Rime’s text-to-speech (TTS) synthesis model is now available through Cerebrium. With Cerebrium’s Rime integration, you can develop responsive AI voice applications that deliver natural, lifelike interactions.

Get started

Developers can now deploy Rime’s text-to-speech services on Cerebrium:

Create a Rime account and obtain your API key.
Create a secret in Cerebrium named RIME_API_KEY with your Rime key.
Initialize your app with the Cerebrium CLI (v1.39.0 or higher):

cerebrium init rime

Define your deployment using a cerebrium.toml file with the [cerebrium.runtime.rime] configuration. Specify your compute resources (e.g., AMPERE_A10 GPU, memory, CPU), scaling parameters, and region (e.g., us-east-1). See Cerebrium’s documentation for full implementation details.
Deploy with:

cerebrium deploy

Once deployed, you can send real-time TTS inference requests using either:

REST API, with an endpoint like:

https://api.cortex.cerebrium.ai/v4/<project-id>/rime

Include your RIME API key in the Authorization header and specify your desired speaker, model, and text.

WebSocket interface for streaming audio

Rime handles API authentication via the key you provide, and your containers will scale automatically based on the concurrency and replica settings defined in the TOML file. For full setup instructions, see the Cerebrium documentation.

Documentation

Arcana API reference

Mist v2 API reference

API Metadata

Other APIs

Get started

Documentation

Arcana API reference

Mist v2 API reference

API Metadata

Other APIs

​Get started

Get started