Question 1

What is say?

Accepted Answer

The 'say' skill integrates high-quality Text-to-Speech (TTS) capabilities into Claude using the r9s audio API. It enables the agent to convert text into natural-sounding speech, supporting various models, voices, and playback speeds. This tool is designed to enhance user interaction by providing auditory feedback and narration directly through local audio players like mpv or afplay.

Question 2

When should I use say?

Accepted Answer

say is useful in the following scenarios: • Language Learning: Helping users master pronunciation by speaking vocabulary, phonetic transcriptions, and complex sentences aloud. • Accessibility Support: Providing an audio-based interface for visually impaired users or those who prefer consuming information through listening. • Content Narration: Automatically reading out long-form articles, summaries, or scripts to allow for hands-free information consumption. • Auditory Notifications: Using voice output to alert users about task completions, status updates, or important milestones in a workflow.

name	say
description	Text-to-speech output using r9s audio API
compatibility	requires r9s CLI with audio API access and audio player (mpv, ffplay, afplay, or paplay)
author	r9s-ai
version	2.0.0
tags	[tts, audio, speech]

say

When & Why to Use This Skill

Use Cases

Text-to-Speech

Syntax

Configuration

Guidelines

Examples

Requirements