Skip to content

tanto/como, both/and, etc #1248

@AngledLuffa

Description

@AngledLuffa

Maybe my first attempt at figuring out como was too broad, didn't get any engagement at any rate, so let's try a smaller version. What to do with both X and Y, tanto X como Y, and similar constructions?

both X and Y in English GUM/EWT is tagged CCONJ/CCONJ

tanto / quanto in it_isdt: ADV / ADV

tanto / como in pt_bosque: ADV / ADV

atât / cât in ro_simonero: ADV / ADV

tant / com in ca_ancora: ADV / SCONJ

which brings us to Spanish tanto/como, naturally not consistent between AnCora and GSD

in es_ancora: ADV / SCONJ is typical

in es_gsd: PRON/CCONJ, ADV/CCONJ, ADV/ADV, ADV/ADP, PRON/ADP

out of these Romance languages, seems ADV/SCONJ is unique to the AnCora datasets, and SCONJ doesn't seem right since Y is often a noun. So it might be better to switch the two AnCora datasets and GSD to ADV/ADV. Still, I'd be okay with using ADV/SCONJ in ES just to make the general annotation of como easier.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions