Commit 6e5601d
Staging pt_br TN to main (#421)
* Add Portuguese (PT) text normalization: cardinal, ordinal, decimal, fraction (#403)
* Add Portuguese (PT) text normalization: cardinal, ordinal, decimal, fraction
Signed-off-by: Mai Anh <palasek182@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Mai Anh <palasek182@gmail.com>
* date and time semiotic classese
Signed-off-by: Mai Anh <palasek182@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Mai Anh <palasek182@gmail.com>
* update sh files
Signed-off-by: Mai Anh <palasek182@gmail.com>
* Update Portuguese text normalization tutorial with enhanced examples and outputs
- Changed the language parameter in the Normalizer instance from 'en' to 'pt'.
- Added detailed output examples for the normalizer's methods, including documentation for `__doc__` and `normalize()`.
- Updated example input string to reflect a more complex Portuguese sentence for normalization.
- Adjusted execution counts for code cells to ensure proper order of execution.
This update aims to improve the clarity and usability of the tutorial for Portuguese text normalization.
Signed-off-by: Mai Anh <palasek182@gmail.com>
* remove current unuse file
Signed-off-by: Mai Anh <palasek182@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update minor update and punct
Signed-off-by: Mai Anh <palasek182@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Signed-off-by: Mai Anh <palasek182@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* PT TN: money, measure, telephone, electronic (#416)
* PT TN: money, measure, telephone, electronic
Adds semiotic classes and tests on top of staging/pt-br_tn; includes
cardinal fix for X00 + 01–09 and Sparrowhawk script updates.
Signed-off-by: Mai Anh <palasek182@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix bugs based on review cardinal, fraction, money, measure
Signed-off-by: Mai Anh <palasek182@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* modify test case time
Signed-off-by: Mai Anh <palasek182@gmail.com>
* modify with mariana's review
Signed-off-by: Mai Anh <palasek182@gmail.com>
---------
Signed-off-by: Mai Anh <palasek182@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* update jenkins cache
Signed-off-by: Mariana Graterol Fuenmayor <marianag@nvidia.com>
* add init to whitelist
Signed-off-by: Mariana Graterol Fuenmayor <marianag@nvidia.com>
---------
Signed-off-by: Mai Anh <palasek182@gmail.com>
Signed-off-by: Mariana Graterol Fuenmayor <marianag@nvidia.com>
Signed-off-by: Mariana <47233618+mgrafu@users.noreply.github.com>
Co-authored-by: Mai Anh <palasek182@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>1 parent 1a38df0 commit 6e5601d
107 files changed
Lines changed: 4647 additions & 17 deletions
File tree
- nemo_text_processing/text_normalization
- pt
- data
- date
- electronic
- fractions
- measure
- money
- numbers
- ordinals
- telephone
- time
- whitelist
- taggers
- verbalizers
- tests/nemo_text_processing/pt
- data_text_normalization
- tools/text_processing_deployment
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
| 20 | + | |
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| |||
242 | 242 | | |
243 | 243 | | |
244 | 244 | | |
245 | | - | |
246 | | - | |
247 | | - | |
248 | | - | |
249 | | - | |
250 | | - | |
251 | | - | |
252 | | - | |
253 | | - | |
254 | | - | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
255 | 255 | | |
256 | 256 | | |
257 | 257 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
185 | 185 | | |
186 | 186 | | |
187 | 187 | | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
188 | 191 | | |
189 | 192 | | |
190 | 193 | | |
| |||
734 | 737 | | |
735 | 738 | | |
736 | 739 | | |
737 | | - | |
| 740 | + | |
738 | 741 | | |
739 | 742 | | |
740 | 743 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
Lines changed: 13 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
Lines changed: 13 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
Lines changed: 21 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
Lines changed: 3 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
Lines changed: 13 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
Lines changed: 7 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
0 commit comments