Go to file

Steve White 0e522feddf Clean up memory management in cbx-audiobook.py - Use singleton pattern from TTSService for efficient model management - Remove complex manual memory cleanup code - Simplify CLI arguments by removing redundant memory management options - Load model once at start, let singleton handle efficient reuse - Remove keep-model-loaded and cleanup-interval options - Streamline generation logic to match backend service patterns 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>		2025-06-27 00:01:13 -05:00
.aider.tags.cache.v4	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
.note	Working layout.	2025-06-05 17:38:12 -05:00
.opencode	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
backend	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
frontend	updated with startup script	2025-06-17 16:26:55 -05:00
speaker_data	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
.aider.chat.history.md	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
.aider.input.history	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
.env.example	updated with startup script	2025-06-17 16:26:55 -05:00
.gitignore	updated with startup script	2025-06-17 16:26:55 -05:00
AGENTS.md	updated with startup script	2025-06-17 16:26:55 -05:00
API_REFERENCE.md	Add API reference	2025-06-06 23:18:47 -05:00
CLAUDE.md	Add dialog script save/load functionality and CLAUDE.md	2025-06-06 10:05:58 -05:00
ENVIRONMENT_SETUP.md	updated with startup script	2025-06-17 16:26:55 -05:00
OpenCode.md	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
README-dialog-generator.md	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00
README.md	Major update: Enhanced memory management, configurable silence gaps, and file organization	2025-06-04 12:37:52 -05:00
babel.config.cjs	Working layout.	2025-06-05 17:38:12 -05:00
cbx-audiobook.py	Clean up memory management in cbx-audiobook.py	2025-06-27 00:01:13 -05:00
cbx-dialog-generate.py	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00
cbx-generate.py	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00
chatterbox-test.py	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00
chatterbox_tts.py.bak	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00
gradio_app.py	Major update: Enhanced memory management, configurable silence gaps, and file organization	2025-06-04 12:37:52 -05:00
import_helper.py	Add cbx-audiobook.py and import_helper.py	2025-06-26 15:04:55 -05:00
package-lock.json	Working layout.	2025-06-05 17:38:12 -05:00
package.json	Working layout.	2025-06-05 17:38:12 -05:00
requirements.txt	Patched up to work on m3 laptop. Need to fix the location specific shit.	2025-06-07 16:06:38 -05:00
sample-dialog.md	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00
setup.py	updated with startup script	2025-06-17 16:26:55 -05:00
speakers.yaml	Major update: Enhanced memory management, configurable silence gaps, and file organization	2025-06-04 12:37:52 -05:00
start_servers.py	Singleton manages memory well and fast.	2025-06-26 14:56:53 -05:00
storage_service.py	Updated note directory- gradio interface working.	2025-06-05 09:20:19 -05:00
test.py	Patched up to work on m3 laptop. Need to fix the location specific shit.	2025-06-07 16:06:38 -05:00
test1-wav	Gradio app added, cbx-dialog-generate.py added	2025-06-04 08:30:07 -05:00

README.md

Chatterbox TTS Gradio App

This Gradio application provides a user interface for text-to-speech generation using the Chatterbox TTS model. It supports both single utterance generation and multi-speaker dialog generation with configurable silence gaps.

Features

Single Utterance Generation: Generate speech from text using a selected speaker
Dialog Generation: Create multi-speaker conversations with configurable silence gaps
Speaker Management: Add/remove speakers with custom audio samples
Memory Optimization: Automatic model cleanup after generation
Output Organization: Files saved in single_output/ and dialog_output/ directories

Getting Started

Clone the repository:

git clone https://github.com/your-username/chatterbox-test.git

Install dependencies:
```
pip install -r requirements.txt
```
Prepare speaker samples:
- Create a speaker_samples/ directory
- Add audio samples (WAV format) for each speaker
- Update speakers.yaml with speaker names and file paths
Run the app:
```
python gradio_app.py
```

Usage

Single Utterance Tab

Select a speaker from the dropdown
Enter text to synthesize
Adjust generation parameters as needed
Click "Generate Speech"

Dialog Generation Tab

Add speakers using the speaker configuration section

Enter dialog in the format:

Speaker1: "Hello, how are you?"
Speaker2: "I'm doing well!"
Silence: 0.5
Speaker1: "What are your plans for today?"

Set output base name
Click "Generate Dialog"

File Organization

Generated single utterances are saved to single_output/
Dialog generation files are saved to dialog_output/
Concatenated dialog files have _concatenated.wav suffix
All files are zipped together for download

Memory Management

The app automatically:

Cleans up the TTS model after each generation
Frees GPU memory (for CUDA/MPS devices)
Deletes intermediate tensors to minimize memory footprint

Troubleshooting

"Skipping unknown speaker": Add the speaker first using the speaker configuration
"Sample file not found": Verify the audio file exists in speaker_samples/
Memory issues: Try enabling "Re-initialize model each line" for long dialogs