Maybe my first attempt at figuring out como was too broad, didn't get any engagement at any rate, so let's try a smaller version. What to do with both X and Y, tanto X como Y, and similar constructions?
both X and Y in English GUM/EWT is tagged CCONJ/CCONJ
tanto / quanto in it_isdt: ADV / ADV
tanto / como in pt_bosque: ADV / ADV
atât / cât in ro_simonero: ADV / ADV
tant / com in ca_ancora: ADV / SCONJ
which brings us to Spanish tanto/como, naturally not consistent between AnCora and GSD
in es_ancora: ADV / SCONJ is typical
in es_gsd: PRON/CCONJ, ADV/CCONJ, ADV/ADV, ADV/ADP, PRON/ADP
out of these Romance languages, seems ADV/SCONJ is unique to the AnCora datasets, and SCONJ doesn't seem right since Y is often a noun. So it might be better to switch the two AnCora datasets and GSD to ADV/ADV. Still, I'd be okay with using ADV/SCONJ in ES just to make the general annotation of como easier.
Maybe my first attempt at figuring out
comowas too broad, didn't get any engagement at any rate, so let's try a smaller version. What to do with both X and Y, tanto X como Y, and similar constructions?both X and Yin English GUM/EWT is tagged CCONJ/CCONJtanto / quantoin it_isdt: ADV / ADVtanto / comoin pt_bosque: ADV / ADVatât / câtin ro_simonero: ADV / ADVtant / comin ca_ancora: ADV / SCONJwhich brings us to Spanish tanto/como, naturally not consistent between AnCora and GSD
in es_ancora: ADV / SCONJ is typical
in es_gsd: PRON/CCONJ, ADV/CCONJ, ADV/ADV, ADV/ADP, PRON/ADP
out of these Romance languages, seems ADV/SCONJ is unique to the AnCora datasets, and SCONJ doesn't seem right since Y is often a noun. So it might be better to switch the two AnCora datasets and GSD to ADV/ADV. Still, I'd be okay with using ADV/SCONJ in ES just to make the general annotation of
comoeasier.