You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A real-world deployment study [1] reported mainland-Chinese terms
slipping past the linter in published zh-TW articles, blockquote
citation contexts producing ~50 false positives across a 72-article
corpus, and ASCII quotes auto-converted to 「」 inside YAML
frontmatter breaking downstream parsers.
User-facing additions:
- '--consistency' reports mixed regional usage of one concept (both
線程 and 執行緒 in the same document). Groups by the rule's "english"
anchor; skips TM-suppressed terms.
- '--exempt-blockquotes' (CLI + '[markdown]' config) excludes
pulldown-cmark 'Tag::BlockQuote' ranges from scanning. Off by default:
adopted blockquote prose is real content.
- YAML frontmatter preserves ASCII '"' / ''' scalar delimiters.
Body prose still converts to 「」.
- '[glossary]' section in '.zhtw-mcp.toml': banned / preferred /
proper_nouns lists. Banned terms inject synthetic Errors that TM
cannot downgrade; proper_nouns suppress matching issues; both honor
exclusion zones.
- Per-rule 'editorial_confidence' (low / medium / high) flows through
issue inflation into MCP explain output. Low forces
auto_fix_safe = false and needs_review = true. 優化, 算法, 場景
tagged low because both regional forms are valid zh-TW.
Calque-audit refinements:
- 消息 gains positional_clues; 好消息 / 壞消息 / 消息來源 no longer
fire.
- Symmetric 元資料 rule mirrors 元數據 — both use to: [] plus english:
"metadata", surfacing the English original as the preferred form.
詮釋資料 and 後設資料 (NAER terminology bank) remain unflagged as
acceptable zh-TW alternatives.
- Real-world regression fixture pins the 14 documented blind-spot terms.
[1] https://ai-muninn.com/zh-TW/blog/zhtw-mcp-calque-blindspot-sweep
0 commit comments