Replies: 1 comment 1 reply
-
|
@rasbt gentle ping in case this bug report got overlooked 🙂 |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The line
words = text.replace("\n", " \n ").split()in the encode() function inch02/05_bpe-from-scratch/bpe-from-scratch-simple.ipynbis a bit confusing. Adding.replace("\n", " \n ")or not would not change the split result. In the split result, '\n' is not treated as a separate token but instead not included. This seems to be a bug? Can I get clarification on this please?Beta Was this translation helpful? Give feedback.
All reactions