This tutorial walks you through a complete translation session — from setup to submitting your results and seeing them on the dashboard.
Make sure you've completed Getting Started:
gh auth logindone- Repo cloned and
uv syncrun uv run qebench doctorpasses all checks
Always start by pulling the latest code and data:
uv run qebench updateThis runs git pull --rebase and uv sync so you have everyone's latest
contributions and any CLI updates.
uv run qebench statsYou should see the coverage panel and domain table. The dataset is pre-seeded with 314 economics and math terms — enough to start practicing immediately.
Let's try a 3-entry session with basic-level terms:
uv run qebench translate -n 3 --difficulty basicYour GitHub username is detected automatically from gh auth.
For each entry, you'll see a panel like:
╭──────────────── TERM ────────────────╮
│ │
│ inflation │
│ │
│ Context: "The rate of inflation │
│ determines the real interest rate." │
│ │
│ term-001 · economics · basic │
╰──────────────────────────────────────╯
? Your translation (中文):
Terms with context sentences (populated by qebench update) show an example
from a QuantEcon lecture to help you choose the right translation.
Type your Chinese translation and press Enter.
After typing your translation, you'll be asked how confident you are:
? How confident are you?
1 — Guessing
2 — Uncertain
❯ 3 — Reasonable
4 — Confident
5 — Very confident
Pick the level that matches your certainty. You can also add optional notes (context, reasoning, alternative phrasing) — just press Enter to skip.
After you commit your answer, the reference translation is revealed with a similarity score:
╭────────── Reference ──────────╮
│ Your answer: 通货膨胀 │
│ Reference: 通货膨胀 │
│ Similarity: 100% │
│ Matches the reference. │
╰───────────────────────────────╯
If your translation differs (below 85% similarity), you'll be asked why:
? Why does your translation differ?
Formal/written register (书面语)
Informal/spoken register (口语)
❯ Regional preference
Contextual — depends on usage
Abbreviation or shorthand
Alternative technical term
Other (explain in notes)
? Notes (optional — explain further):
This is the most valuable data you can provide. Chinese has formal and informal representations, regional variants, and context-dependent translations. Documenting why translations vary helps us improve the translator.
After completing all entries, you'll see:
╭─── Session Summary ───╮
│ Completed: 3/3 │
│ XP earned: +30 │
│ Total XP: 30 │
╰────────────────────────╯
Each completed translation earns 10 XP. Your results are saved locally to
results/translations/{your-username}.jsonl and your XP to
results/xp/{your-username}.json.
Push your work to GitHub so it appears on the dashboard:
uv run qebench submitThis pulls the latest changes, commits your results, and pushes — all in one command.
╭────── Submitted as alice ──────╮
│ ✓ 1 result file(s) │
╰────────────────────────────────╯
Once pushed, the CI workflow rebuilds the dashboard with your activity and XP.
Focus on a specific domain:
uv run qebench translate -d mathematicsOr crank up the difficulty:
uv run qebench translate --difficulty advanced -n 10- Add entries: See Contributing Entries to grow the dataset
- Check progress:
qebench statsshows overall dataset coverage - View dashboard: check the live dashboard for leaderboard and activity
- See the full CLI Reference for all options