Rime logo + Cerebrium logo Cerebrium is a serverless infrastructure platform purpose-built for building and scaling AI applications with minimal infrastructure overhead. With ultra-low time-to-first-byte (TTFB) performance, enterprise-grade compliance, and support for global deployments, Cerebrium makes it easy to take applications from prototype to production in minutes. Rime’s text-to-speech (TTS) synthesis model is now available through Cerebrium. With Cerebrium’s Rime integration, you can develop responsive AI voice applications that deliver natural, lifelike interactions.

Get started

Developers can now deploy Rime’s text-to-speech services on Cerebrium:
  1. Create a Rime account and obtain your API key.
  2. Create a secret in Cerebrium named RIME_API_KEY with your Rime key.
  3. Initialize your app with the Cerebrium CLI (v1.39.0 or higher):
cerebrium init rime
  1. Define your deployment using a cerebrium.toml file with the [cerebrium.runtime.rime] configuration. Specify your compute resources (e.g., AMPERE_A10 GPU, memory, CPU), scaling parameters, and region (e.g., us-east-1). See Cerebrium’s documentation for full implementation details.
  2. Deploy with:
cerebrium deploy
Once deployed, you can send real-time TTS inference requests using either:
  • REST API, with an endpoint like:
https://api.cortex.cerebrium.ai/v4/<project-id>/rime
Include your RIME API key in the Authorization header and specify your desired speaker, model, and text.
  • WebSocket interface for streaming audio
Rime handles API authentication via the key you provide, and your containers will scale automatically based on the concurrency and replica settings defined in the TOML file. For full setup instructions, see the Cerebrium documentation.