Commit cb53beb
En names (#42)
* Add support for Financial year and for years between 1000 BC and 1000AD
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Add support for product names and add abbreviations to whitelist
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Add weights for some sequences, exclude 'a' before numeric sequence
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Add tests
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Update cache folder for EN
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update FR Cache path
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Move text to TSV files, and some code cleanup
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Add additional vocabulary, allow singular usage of units to support adjective phrases
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Fix issue with whitelist loader not handling weights correctly
Move cased loader file to graph_utils
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* insert space between value and unit
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Insert space between measurement and unit. Adjust weight for ordinal
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
* Update tests
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
---------
Signed-off-by: Anand Joseph <anajoseph@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>1 parent f806134 commit cb53beb
18 files changed
Lines changed: 625 additions & 89 deletions
File tree
- nemo_text_processing
- inverse_text_normalization/en
- data
- taggers
- verbalizers
- text_normalization/en
- tests/nemo_text_processing/en/data_inverse_text_normalization
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
13 | 14 | | |
14 | 15 | | |
15 | | - | |
| 16 | + | |
16 | 17 | | |
17 | 18 | | |
18 | 19 | | |
| |||
21 | 22 | | |
22 | 23 | | |
23 | 24 | | |
| 25 | + | |
24 | 26 | | |
25 | 27 | | |
26 | 28 | | |
| |||
Lines changed: 10 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
Lines changed: 19 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
18 | 23 | | |
| 24 | + | |
19 | 25 | | |
20 | 26 | | |
21 | 27 | | |
| |||
30 | 36 | | |
31 | 37 | | |
32 | 38 | | |
33 | | - | |
34 | 39 | | |
35 | 40 | | |
36 | | - | |
| 41 | + | |
37 | 42 | | |
38 | | - | |
| 43 | + | |
| 44 | + | |
39 | 45 | | |
40 | 46 | | |
41 | 47 | | |
| |||
86 | 92 | | |
87 | 93 | | |
88 | 94 | | |
| 95 | + | |
89 | 96 | | |
90 | 97 | | |
91 | 98 | | |
| |||
96 | 103 | | |
97 | 104 | | |
98 | 105 | | |
99 | | - | |
100 | | - | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
101 | 112 | | |
102 | 113 | | |
103 | 114 | | |
| |||
106 | 117 | | |
107 | 118 | | |
108 | 119 | | |
109 | | - | |
| 120 | + | |
110 | 121 | | |
111 | 122 | | |
112 | 123 | | |
| |||
126 | 137 | | |
127 | 138 | | |
128 | 139 | | |
129 | | - | |
| 140 | + | |
130 | 141 | | |
131 | 142 | | |
132 | 143 | | |
| |||
0 commit comments