Second Brain

Brain

A working collection of raw notes, linked insights, source summaries, diagrams, tables, and post seeds before they become polished writing.

Notes / links / research

52 / 52 notes

Folder

brain/insights

3 folders · 2 files

Chatterbox Is A Speech-Token LLM Wrapped Around A Flow-Matching Decoder

working

Chatterbox is easiest to understand as a two-stage speech system. The first stage, `T3`, is a transformer language-model-style token generator that maps text plus voice conditions into discrete S3 speech tokens. The seco

chatterbox-tts-under-the-hood.md

F5-TTS Shows That Alignment Can Be Learned as Flow-Matched Speech Infilling

working

F5-TTS is best understood as an engineering refinement of E2 TTS, not as a separate conceptual break. E2 TTS showed that zero-shot text-to-speech can drop the usual duration model, grapheme-to-phoneme conversion, and pho

f5-tts-e2-tts-flow-matching.md

Brain

Notes / links / research

brain/insights

static-analysis

voice-agents

write-code-ai-agents-love

Chatterbox Is A Speech-Token LLM Wrapped Around A Flow-Matching Decoder

F5-TTS Shows That Alignment Can Be Learned as Flow-Matched Speech Infilling