Skip to content

software_normalized retains case variants (e.g. MATLAB / Matlab) #45

@kierangivens

Description

@kierangivens

The model extraction appears to have multiple different software_normalized names that all refer to the same software. For example, in the 5% dataset, I found several case variants of MATLAB

Image

These all clearly refer to the same software. A quick fix could be lower-casing the software_normalized values during extraction, which would collapse many of these variants into a single name and make downstream grouping/filtering more reliable.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions