Skip to content
This repository was archived by the owner on Sep 25, 2025. It is now read-only.
This repository was archived by the owner on Sep 25, 2025. It is now read-only.

BERT finetuning - call center transcripts - named entity recognition of credit card / account number #1408

@prkumar112451

Description

@prkumar112451

I am trying to finetune BERT for named entity recognition based on annotated data of call center transcript.

This is a dummy call between 2 agents -

Prabhat - Hello Neeraj, How are you
Neeraj - I am good, Thanks, How are you
Prabhat - I am good as well
Neeraj - For HiHi Phones, we would like to offer 10% discount
Prabhat- I am interested
Neeraj - Could you share your credit card number
Prabhat - Sure one two four
Neeraj - Yes
Prabhat - five three.. no two.. five and nine
Neeraj - Okay, what's next
Prabhat - six two
Neeraj - six and then two
Prabhat - twenty five
Neeraj - So it is five, two, five
Prabhat - Yes
Neeraj - nine, six, two, two and five.
Prabhat - Yes
Neeraj - Thanks for sharing your credit card
Prabhat - Do you need anything else
Neeraj - Yes, what's your customer Id
Prabhat - It is five, two
Neeraj - Okay
Prabhat - three, nine
Neeraj - sure, Thanks. I will place the order for you

we want to detect the numbers after the word 'credit' as B-CREDIT/I-CREDIT. but the numbers after the word customer id should go under B-CUSTOMER/I-CUSTOMER.

after training a bert uncases model, getting a very random output of numbers.

needed a clarity on 1 detail - named entity recognition happens on entity level, but does BERT keeps context with it (for example - word credit appeared before the number so assign the number to B-CREDIT/I-CREDIT. and if number is occuring after ACCOUNT then number should be detected as B-ACCOUNT/I-ACCOUNT?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions