Add sentence tokenization to process longer texts. by askonivala · Pull Request #71 · kamalkraj/BERT-NER

askonivala · 2020-01-24T11:23:30Z

The supported sequence length of BERT is up to 512 tokens. Adding a simple sentence tokenization to API would enable users to process longer texts.

tanmayag78 · 2020-03-08T12:33:10Z

Any other way to handle longer texts as time complexity is higher and it will be inefficient while handling huge text. Like Mitie Ner and Stanford Ner are more efficient for handling longer texts though not as accurate as BERT-NER

Add sentence tokenization to process longer texts.

cfa8094

ntedgi mentioned this pull request Mar 28, 2020

inconsistency between GPU/CPU inference #78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sentence tokenization to process longer texts.#71

Add sentence tokenization to process longer texts.#71
askonivala wants to merge 1 commit into
kamalkraj:devfrom
askonivala:dev

askonivala commented Jan 24, 2020

Uh oh!

tanmayag78 commented Mar 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

askonivala commented Jan 24, 2020

Uh oh!

tanmayag78 commented Mar 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants