fix(i18n): #192 Étape 3 — deterministic translation consistency fixes#432
fix(i18n): #192 Étape 3 — deterministic translation consistency fixes#432jsboige wants to merge 2 commits into
Conversation
…192 étape 3A) Standardize 113 cells in category_pt and subcategory_pt to Title Case: - category_pt: História, Mitologia, Política, Cultura pop, Vida pessoal, Vida profissional - subcategory_pt: Antiguidade, Séculos XX e XXI, Literatura, Ciência, Romance, etc. Cosmetic only — no content change, purely capitalization alignment. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
PK=1: family_ar حجة وجيهة الصلة → حجة ملائمة (1 vs 32) PK=34: family_fa ارائهٔ درستکارانه → ارائه صادقانه (1 vs 24) PK=34: family_zh 诚信呈现 → 诚实陈述 (1 vs 24) PK=59: family_es family=Rigor matemático → Exactitud matemática (1 vs 19) PK=135: subfamily_es Definiciones admisibles → Definiciones aceptables (1 vs 6) Part of #192 Étape 3 — translation consistency polish. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
clusterManager-Myia
left a comment
There was a problem hiding this comment.
[NanoClaw]
Reviewed: 2 CSV files, 78 removed + 78 added lines (1:1 row replacement, no insertions/deletions).
Taxonomy CSV — 5 cell changes across 4 rows
All changes align single-record outliers to the majority translation consensus. Verified against the French source:
- pk=1 family_ar:
حجة وجيهة الصلة→حجة ملائمة— Correct. Old form was over-descriptive ("argument of strong relevance"); newملائمةmatches "pertinent" more accurately and aligns with the 32-record majority. - pk=34 family_zh:
诚信呈现→诚实陈述— Correct. Old used诚信(integrity/trustworthiness) which connotes moral character; new诚实(honest/truthful) better matches "Présentation intègre" as "transparent/honest presentation". - pk=34 family_fa:
ارائهٔ درستکارانه→ارائه صادقانه— Correct. Oldدرستکارانهis "ethical/proper" (more about conduct); newصادقانهis "truthful" (aligns with the French "transparente et honnête"). - pk=59 family_es:
Rigor matemático→Exactitud matemática— Correct. French "Rigueur mathématique" emphasizes precision, not rigor/discipline.Exactitudis the better match. - pk=135 subfamily_es:
Definiciones admisibles→Definiciones aceptables— Correct. French "Définitions recevables" = acceptable, not admissible (legal connotation).
Scenarii CSV — BOM removal + PT title-case normalization
- Header: UTF-8 BOM stripped (
path→path). Harmless fix. category_ptandsubcategory_pt: ~113 cells changed from lowercase to Title Case (e.g.história→História,antiguidade→Antiguidade). All are pure case normalization with zero content changes. Portuguese orthographic convention for categories/subcategories requires Title Case.
Security: No formula injection, script injection, or path traversal patterns detected.
Structural: Line counts match 1:1 (78/78). No CSV quoting or structural integrity issues.
No concerns. The PR description accurately reflects the changes, including the honest note about remaining minority variants deferred to editorial/gpt-5.5 review.
Revue contenu cellule-par-cellule (ai-01, lane non déléguée) — ✅ VÉRIFIÉJ'ai vérifié les deux volets indépendamment (pas seulement via le bot), conformément à mon rôle de jugement contenu/terminologie. 3A — Scenarii PT (113 cellules) : case-only PROUVÉTest d'identité par minuscules : après strip BOM + lowercase, 3B — Virtues (5 cellules) : alignement déterministe au consensus intra-familleChaque cellule était un outlier singleton (1 enregistrement) face à une majorité préexistante ; après fix, chaque groupe famille/sous-famille est unanime (vérifié par groupement
Alignement déterministe (la nouvelle valeur préexistait déjà comme majorité dans la même famille) — pas d'invention LLM. ✓ Scope deferral approuvéLes ~16 variantes minoritaires Virtues (5-23 records) sont correctement reportées (Étape 3C) : ce sont de vrais splits, pas des singletons → l'alignement déterministe serait inapproprié, elles demandent un jugement linguistique (gpt-5.5/éditorial). Bon découpage. VerdictContenu APPROUVÉ. CI 3/3 vert, MERGEABLE/CLEAN, NanoClaw COMMENTED (0 CHANGES_REQUESTED). ⏸ Merge séquencé APRÈS le sign-off visuel jsboige de jeudi (#140) — ces cellules rendent sur les cartes Scenarii (catégories PT) et Virtues (familles AR/FA/ZH/ES) ; le merge rouvre donc leur validation visuelle. Le dossier #140 valide le master stable actuel ; #432 atterrit juste après. Déclencheur de merge : sign-off #140 → merge → régén ciblée Scenarii+Virtues → re-check visuel ai-01. |
#192 Étape 3 — Translation consistency polish
Changes
3A: Scenarii PT case normalization (commit
7cb86d52)Cards/Scenarii/Argumentum Scenarii - Cards.csvcategory_ptandsubcategory_ptfields: normalized to Title Case canonical forms3B: Virtues deterministic outlier fixes (commit
638d0ee3)Cards/Fallacies/Argumentum Virtues - Taxonomy.csvRemaining work (NOT in this PR)
~16 Virtues cells with significant minority variants (5-23 records) — need gpt-5.5 or editorial decision:
These require linguistic judgment → targeted gpt-5.5 pass (tracked separately).
Merge timing
Co-Authored-By: Claude Opus 4.8 noreply@anthropic.com