Skip to content

Commit fbdfe34

Browse files
Add a vocabulary policy and update issue templates.
PiperOrigin-RevId: 919461838
1 parent f104d11 commit fbdfe34

4 files changed

Lines changed: 101 additions & 3 deletions

File tree

.github/ISSUE_TEMPLATE/typing_issue.yml

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,13 +9,19 @@ body:
99
> [!IMPORTANT]
1010
> Please use the Google Form for reporting typing issues.
1111
>
12-
> https://forms.gle/oXgwbqzvSyVHNBSc7
12+
> https://forms.gle/oXgwbqzvSyVHNBSc7 (Conversion accuracy feedback form)
1313
>
1414
> The
1515
> [reports](https://docs.google.com/spreadsheets/d/1AAK8wiTfIhj339Rb98lIRwgGfHndVz-jFXFNXZNQyeo/pubhtml?gid=649291758&single=true)
1616
> will be updated in 10 mins.
1717
>
18-
> See https://github.com/google/mozc/issues/1056
18+
> For inappropriate words, please use the following form.
19+
> The reports are not publicly shared.
20+
>
21+
> https://forms.gle/4wwwhkqsXtiDgPcYA (Inappropriate vocabulary report form)
22+
>
23+
> For our policy on language models and dictionary data, see
24+
> [Vocabulary Policy](https://github.com/google/mozc/blob/master/VOCABULARY_POLICY.md).
1925
2026
- type: input
2127
id: na

CONTRIBUTING.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,15 @@ sign a new one.
1818

1919
## Coding style guide
2020

21-
The Mozc project follows Google Style Guide. You PR is also requested to follow
21+
The Mozc project follows Google Style Guide. Your PR is also requested to follow
2222
the style guide.
2323

2424
https://google.github.io/styleguide/
25+
26+
## Policies on Vocabulary and Conversion Results
27+
28+
Please refer to [Vocabulary Policy](VOCABULARY_POLICY.md) before submitting
29+
suggestions or issues regarding language models, vocabularies (dictionary data),
30+
or conversion results. We accept such suggestions exclusively through dedicated
31+
forms, and do not engage in individual discussions or accept Pull Requests on
32+
these topics.

README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,9 @@ In this repository, *Mozc* means the second definition unless otherwise noted.
3131

3232
Detailed differences between Google Japanese Input and Mozc are described in [About Branding](docs/about_branding.md).
3333

34+
For policies on vocabulary and conversion results, see
35+
[Vocabulary Policy](VOCABULARY_POLICY.md).
36+
3437
Build Instructions
3538
------------------
3639

VOCABULARY_POLICY.md

Lines changed: 81 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,81 @@
1+
## Policy on Vocabulary and Conversion Results
2+
3+
The Mozc Project establishes the following policy regarding the construction of
4+
Japanese language models and vocabularies (dictionary data).
5+
6+
### 1. Properties of Dictionary Data
7+
8+
* **Creation Based on Statistical Information:** Mozc's language models and
9+
vocabularies are created automatically and mechanically based on proprietary
10+
corpora and objective statistical information.
11+
12+
* **No Position on Correctness or Social Validity:** The vocabularies and
13+
conversion results included in the project do not imply an endorsement or
14+
rejection of the "correctness," "appropriateness," or "social validity" of
15+
any specific words by the project.
16+
17+
### 2. Feedback and Individual Support
18+
19+
* **Use of Dedicated Channels:** Requests regarding the addition or deletion
20+
of vocabulary or conversion results are accepted exclusively through
21+
dedicated forms.
22+
23+
+ Conversion accuracy feedback form: https://forms.gle/oXgwbqzvSyVHNBSc7
24+
+ Inappropriate vocabulary report form:
25+
https://forms.gle/4wwwhkqsXtiDgPcYA
26+
27+
* **No Individual Responses or Discussions:** We do not engage in **individual
28+
responses or discussions** regarding specific vocabularies or conversion
29+
results on GitHub Issues, Discussions, or Pull Requests.
30+
31+
* **Closing and Locking of Posts:** If posts regarding these topics are made
32+
on GitHub Issues, Discussions, etc., they will, in principle, be closed and
33+
locked mechanically without prior notice (such as individual comments) based
34+
on this policy. Thank you for your understanding.
35+
36+
### 3. Background of This Policy and Open Source Options
37+
38+
* **For Sustainable Operation:** This policy is established to maintain and
39+
operate the project in a healthy and sustainable manner under limited
40+
development resources. The maintainer team strives to improve overall
41+
usability and conversion accuracy through statistical information updates,
42+
technical improvements, and feedback.
43+
44+
* **Encouragement to Fork:** Mozc is open-source software. If the standard
45+
vocabulary provided by the project does not align with your specific use
46+
case or philosophy, we welcome you to freely fork the project within the
47+
bounds of the license and build your own vocabulary or dictionary.
48+
49+
--------------------------------------------------------------------------------
50+
51+
## 語彙集合および変換結果に関する方針
52+
53+
Mozc プロジェクトでは、日本語の言語モデルおよび語彙集合(辞書データ)の構築において、以下の方針を定めます。
54+
55+
### 1. 辞書データの性質について
56+
57+
* **統計情報に基づいた作成:** Mozc の言語モデルや語彙集合は、独自のコーパスや客観的な統計情報に基づいて自動的・機械的に作成されています。
58+
59+
* **正誤や社会的妥当性の不保持:**
60+
収録されている語彙や変換結果は、特定の言葉の「正しさ」「適切性」「社会的妥当性」をプロジェクトとして肯定、または否定するものではありません。
61+
62+
### 2. フィードバックと個別対応について
63+
64+
* **専用窓口の利用:** 語彙の追加・削除や変換結果に関する要望は、すべて専用フォームからのみ受け付けます。
65+
66+
+ 変換精度に関するフォーム https://forms.gle/oXgwbqzvSyVHNBSc7
67+
+ 不適切語に関するフォーム https://forms.gle/4wwwhkqsXtiDgPcYA
68+
69+
* **個別回答・議論の不実施:** 変換結果や語彙集合の個別事例について、IssueやDiscussion、Pull
70+
Request等での **個別の回答や議論は一律で行いません。**
71+
72+
* **投稿のクローズとロック:**
73+
GitHub上のIssueやDiscussion等にこれらに関する投稿がなされた場合、本方針に基づき、原則として個別コメント等の事前通告なく機械的にクローズおよび会話のロックの処置がとられます。ご了承ください。
74+
75+
### 3. 本方針の背景とOSSとしての選択肢
76+
77+
* **持続可能な運営のため:**
78+
本措置は、限られた開発リソースのなかで、プロジェクトを健全かつ持続可能に維持・運営していくためのものです。運営チームは、統計情報のアップデートや技術的改善およびフィードバックを通じて、全体の利便性と変換精度の向上に努めています。
79+
80+
* **フォークの推奨:**
81+
Mozcはオープンソースソフトウェア(OSS)です。プロジェクトが提供する標準の語彙集合が特定の用途や思想に合致しない場合は、ライセンスの範囲内で自由にプロジェクトをフォークし、独自の語彙集合や辞書を構築されることを歓迎します。

0 commit comments

Comments
 (0)