# Language Learning Research Collection of open-source language learning tools and resources for evaluation. ## Cloning This repo uses git submodules. To get everything in one shot: ```bash git clone --recurse-submodules https://github.com/YOU/language-learning.git ``` If you already cloned without `--recurse-submodules`: ```bash git submodule update --init --recursive ``` ## What's In Here ### repos/ | Repo | What It Does | | -------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------- | | [companion](https://github.com/shakedzy/companion) | Private AI language tutor — Whisper + ChatGPT + Google TTS. Read, write, talk, listen in native and target language. | | [discute](https://github.com/5uru/Discute) | Speaking practice app — designed specifically for conversational practice in a new language. | | [ai-english-tutor](https://github.com/jasonkang14/ai-english-tutor) | Interactive AI English tutor using OpenAI APIs. | | [linguacafe](https://github.com/simjanos-dev/LinguaCafe) | Self-hosted web app for reading-based vocabulary acquisition. 27+ languages. Docker-based. | | [librelingo](https://github.com/kantord/LibreLingo) | Community-driven open language learning platform. Course editor + web app. | | [libretranslate](https://github.com/LibreTranslate/LibreTranslate) | Self-hosted machine translation API. No Google/Azure dependency. | | [speechbrain](https://github.com/speechbrain/speechbrain) | Conversational AI toolkit — speech recognition, speaker verification, TTS. Building block for custom tools. | | [awesome-language-learning](https://github.com/Vuizur/awesome-language-learning) | Curated list of open-source language learning projects and resources. | ### Speaking-Focused (most relevant to mic-based learning) - **companion** — closest to a full "talk to practice" solution out of the box - **discute** — purpose-built for speaking practice - **ai-english-tutor** — English-specific but demonstrates the tutor pattern ### Infrastructure / Building Blocks - **speechbrain** — STT/TTS toolkit if building something custom - **libretranslate** — self-hosted translation backend - **linguacafe** — reading/vocab complement to speaking practice ### Reference - **awesome-language-learning** — index of more projects worth exploring ## Commercial Apps Worth Knowing About These can't be cloned but are worth evaluating alongside the open-source options: - [ChatGPT Voice Mode](https://chatgpt.com) — free-form conversation in any language (needs Plus sub) - [Langua](https://langua.io) (~$8-12/mo) — native-speaker-cloned voices, 23 languages, real-time feedback - [Talkio AI](https://www.talkio.ai) (~$10-15/mo) — 40+ languages, 134 dialects, pronunciation feedback - [Speak](https://www.speak.com) (~$15-20/mo) — structured curriculum, good for beginners - [ELSA Speak](https://elsaspeak.com) (~$6-10/mo) — phoneme-level pronunciation analysis (English only) ## The DIY Stack If none of these fit, the common architecture for a custom speaking tutor: 1. **STT**: [Whisper](https://github.com/openai/whisper) (local or API) 2. **LLM**: [Claude API](https://docs.anthropic.com/en/docs) or [GPT-4o](https://platform.openai.com/docs) with a tutor system prompt 3. **TTS**: [OpenAI TTS API](https://platform.openai.com/docs/guides/text-to-speech), [Coqui TTS](https://github.com/coqui-ai/TTS) (open source), or speechbrain ## What Linguists Recommend (2025-2026) - Massive input first (listening + reading), then speaking practice - AI conversation daily for low-pressure reps - Human tutor 1-2x/week for nuance - Spaced repetition for vocab ([Anki](https://apps.ankiweb.net), etc.) - Consistency over intensity