Skip to content

Assignment 1.3 - Rare words #82

@daviddao

Description

@daviddao

In Assignment 1.3 it is written: "This will load the data in a bag-of-words representation where rare words (occurring less than 5 times in the training data) are removed". However, when I sum the word occurrences using the provided training dataset with

scr = srs.SentimentCorpus("books")

I get words, which doesn't appear at all (occurring less than 5 times)

>> scr.train_X.sum(0)
[..., 0.0, ...]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions