03 β Repository Layout
Vocal10n/
βββ README.md # User-facing quick start.
βββ LICENSE # MIT.
βββ pyproject.toml # Package metadata for `vocal10n`.
βββ setup_env.ps1 # Creates venv_main + venv_tts, installs deps.
βββ start.ps1 / start.bat # Launches GPT-SoVITS subprocess + main app.
βββ initialplan.md # Original project plan (Chinese + English).
βββ promptfile.md # Prompt history / scratchpad.
βββ simple_ui_plan.md # Simple/Pro mode design doc.
βββ simple_ui_validation.md # Manual QA checklist for Simple mode.
βββ current.logs # Last run log capture (transient).
β
βββ config/
β βββ default.yaml # Single source of truth for runtime config.
β βββ context_gaming.txt # Example domain context for translation.
β βββ filters.txt # Hallucination filter list.
β
βββ knowledge_base/
β βββ glossary_general.txt # Default glossary used by the corrector.
β
βββ stt_terms/
β βββ context_gaming.txt # Example STT initial-prompt term list.
β
βββ reference_audio/ # User-provided voice clone references.
β βββ audio_03.txt
β βββ README.md
β
βββ models/ # Local model storage (git-ignored).
β βββ stt/ # FasterWhisper caches.
β βββ llm/Qwen3-4B-Instruct-2507.Q4_K_M.gguf
β βββ tts/ # GPT-SoVITS pretrained weights.
β
βββ output/
β βββ audio/ # WAV recordings (when save_wav).
β βββ subtitles/ # *_source.srt / *_target.srt pairs.
β βββ training_data/ # Future training output.
β
βββ requirements/
β βββ requirements-main.txt # venv_main deps.
β βββ requirements-tts.txt # venv_tts deps (in addition to vendor reqs).
β
βββ src/vocal10n/ # Application package β see chapter 02.
βββ src/vocal10n.egg-info/ # Generated by editable install.
β
βββ vendor/
β βββ GPT-SoVITS/ # Vendored upstream repository.
β βββ Qwen3/ # Vendored Qwen3-TTS source.
β
βββ venvs/
β βββ venv_main/ # Python 3.11 β STT + LLM + UI.
β βββ venv_tts/ # Python 3.11 β GPT-SoVITS server.
β βββ venv_qwen3tts/ # Python 3.10/3.11 β Qwen3-TTS server.
β
βββ temp_qwen3tts/ # Scratch dir used during Qwen3-TTS bring-up.
βββ training/ # Reserved for future training tooling.
β
βββ Vocal10n-prebuild/ # Legacy reference implementation. Not packaged.
βββ doc/ # This documentation set.
Notable Conventions
src/ layout with pyproject.toml. Install with
pip install -e . inside the venv (the setup script does this for you
via the requirements file pulling in the package).
- One YAML config.
config/default.yaml is the only config file
loaded at runtime. UI changes that should persist write back here.
- Vendored models, not submodules. GPT-SoVITS lives under
vendor/GPT-SoVITS/ to pin a known-good revision and to avoid a
network dependency at install time.
- Output filenames are
YYYY-MM-DD_HH-MM-SS_source.srt /
_target.srt, so each session is grouped by start timestamp.