TalkNatives POC

A conversational language learning app for Yoruba, Hausa, and Igbo using Gemini 2.5 Flash 8B and YarnGPT.

Tech Stack

Frontend: React + Vite + react-media-recorder
Backend: FastAPI + Pydantic AI
AI/ASR: Google Gemini 2.5 Flash (via Pydantic AI SDK)
TTS: YarnGPT API
Database: PostgreSQL (Supabase)
Deployment: Google Cloud Run (2 services)

Local Development

Prerequisites

Docker & Docker Compose
Python 3.11+
Node.js 20+

Environment Setup

Copy the example environment file:

cp backend/.env.example backend/.env

Fill in your API keys in backend/.env:

GOOGLE_API_KEY=your_google_api_key
YARNGPT_API_KEY=your_yarngpt_api_key
DATABASE_URL=your_supabase_postgres_url
CORS_ALLOW_ORIGINS=["http://localhost:5173"]

Run with Docker Compose

docker-compose up --build

Frontend: http://localhost:5173
Backend: http://localhost:8080
Health Check: http://localhost:8080/healthz

Run Locally (without Docker)

Backend:

cd backend
pip install -r requirements.txt
alembic upgrade head
uvicorn app.main:app --reload --port 8080

Frontend:

cd frontend
npm install
npm run dev

Testing the POC Chain

Test the full Gemini → YarnGPT pipeline:

cd backend
python scripts/poc_chain.py path/to/audio.webm yoruba

Target: Total latency < 4 seconds

Database Migrations

Create a new migration:

cd backend
alembic revision --autogenerate -m "description"

Apply migrations:

alembic upgrade head

Deployment

GitHub Secrets Required

Set these in your GitHub repository settings:

DOCKERHUB_USERNAME
DOCKERHUB_TOKEN
GCP_SA_KEY (Service Account JSON)
GCP_PROJECT_ID
GCP_REGION (e.g., us-central1)
GOOGLE_API_KEY
YARNGPT_API_KEY
DATABASE_URL
CORS_ALLOW_ORIGINS (JSON array string, e.g., ["https://your-frontend-url"])

Deploy

Push to main branch:

git push origin main

The GitHub Actions workflow will:

Build and push Docker images to DockerHub
Deploy backend to Cloud Run
Deploy frontend to Cloud Run with backend URL

Project Structure

.
├── backend/
│   ├── app/
│   │   ├── api/v1/
│   │   │   └── chat.py          # Chat endpoint
│   │   ├── ai/
│   │   │   └── agent.py         # Pydantic AI Agent
│   │   ├── core/
│   │   │   └── config.py        # Settings
│   │   ├── db/
│   │   │   ├── base.py          # SQLAlchemy setup
│   │   │   └── session.py       # DB session
│   │   ├── models/
│   │   │   ├── conversation.py
│   │   │   └── turn.py
│   │   ├── tts/
│   │   │   └── yarngpt.py       # YarnGPT client
│   │   └── main.py              # FastAPI app
│   ├── alembic/                 # Migrations
│   ├── scripts/
│   │   └── poc_chain.py         # Test script
│   ├── Dockerfile
│   ├── entrypoint.sh
│   └── requirements.txt
├── frontend/
│   ├── src/
│   │   ├── App.tsx              # Main UI
│   │   └── main.tsx
│   ├── Dockerfile
│   ├── package.json
│   └── vite.config.ts
├── .github/workflows/
│   └── deploy.yml               # CI/CD
└── docker-compose.yml

Voice Mapping

Yoruba: idera (female)
Hausa: zainab (female)
Igbo: amaka (female)

API Endpoints

`POST /api/v1/chat`

Query Parameters:

language: yoruba | hausa | igbo (default: yoruba)

Body:

file: Audio file (webm/wav)

Response:

{
  "transcription": "User's speech transcribed",
  "correction": "Grammar feedback (if needed)",
  "reply": "Tutor's response in target language",
  "audio": "base64-encoded audio"
}

Performance Targets

Total Latency: < 4 seconds (Gemini + YarnGPT)
Gemini Response: ~1-2s
YarnGPT TTS: ~1-2s

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
backend		backend
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
README.md		README.md
SCENARIO_UPGRADE_SUMMARY.md		SCENARIO_UPGRADE_SUMMARY.md
START_HERE.md		START_HERE.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
repomix-output.md		repomix-output.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TalkNatives POC

Tech Stack

Local Development

Prerequisites

Environment Setup

Run with Docker Compose

Run Locally (without Docker)

Testing the POC Chain

Database Migrations

Deployment

GitHub Secrets Required

Deploy

Project Structure

Voice Mapping

API Endpoints

`POST /api/v1/chat`

Performance Targets

License

About

Uh oh!

Releases

Packages

Languages

jaypee15/talknative

Folders and files

Latest commit

History

Repository files navigation

TalkNatives POC

Tech Stack

Local Development

Prerequisites

Environment Setup

Run with Docker Compose

Run Locally (without Docker)

Testing the POC Chain

Database Migrations

Deployment

GitHub Secrets Required

Deploy

Project Structure

Voice Mapping

API Endpoints

POST /api/v1/chat

Performance Targets

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`POST /api/v1/chat`

Packages