This looks great, but for the first word I checked (Finnish "saada" = to get), there seem to be a lot of missing forms.
This lemma appears in fin/fin1.tsv and the following 18 forms appear therein:
saada sain fi-conj-saada VERB Voice=Act;Tense=Past;Polarity=Pos;Mood=Ind;Person=1;Number=Sing in sa
saada sait fi-conj-saada VERB Voice=Act;Tense=Past;Polarity=Pos;Mood=Ind;Person=2;Number=Sing it sa
saada sai fi-conj-saada VERB Voice=Act;Tense=Past;Polarity=Pos;Mood=Ind;Person=3;Number=Sing i sa
saada saimme fi-conj-saada VERB Voice=Act;Tense=Past;Polarity=Pos;Mood=Ind;Person=1;Number=Plur imme sa
saada saitte fi-conj-saada VERB Voice=Act;Tense=Past;Polarity=Pos;Mood=Ind;Person=2;Number=Plur itte sa
saada saivat fi-conj-saada VERB Voice=Act;Tense=Past;Polarity=Pos;Mood=Ind;Person=3;Number=Plur ivat sa
saada saisin fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Pos;Mood=Cnd;Person=1;Number=Sing isin sa
saada en saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Neg;Mood=Cnd;Person=1;Number=Sing en isi sa
saada saisit fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Pos;Mood=Cnd;Person=2;Number=Sing isit sa
saada et saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Neg;Mood=Cnd;Person=2;Number=Sing et isi sa
saada saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Pos;Mood=Cnd;Person=3;Number=Sing isi sa
saada ei saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Neg;Mood=Cnd;Person=3;Number=Sing ei isi sa
saada saisimme fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Pos;Mood=Cnd;Person=1;Number=Plur isimme sa
saada emme saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Neg;Mood=Cnd;Person=1;Number=Plur emme isi sa
saada saisitte fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Pos;Mood=Cnd;Person=2;Number=Plur isitte sa
saada ette saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Neg;Mood=Cnd;Person=2;Number=Plur ette isi sa
saada saisivat fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Pos;Mood=Cnd;Person=3;Number=Plur isivat sa
saada eivät saisi fi-conj-saada VERB Voice=Act;Tense=Pres;Polarity=Neg;Mood=Cnd;Person=3;Number=Plur eivät isi sa
However, manually counting on Wiktionary, there seem to be about 158 forms listed, so this corpus seems to be missing about 89% of forms listed there, including the most basic present tense forms like "saan" (I get), "saat" (you get), and so on.
Am I misunderstanding the data, or is this a bug?
Thanks. :)
This looks great, but for the first word I checked (Finnish "saada" = to get), there seem to be a lot of missing forms.
This lemma appears in
fin/fin1.tsvand the following 18 forms appear therein:However, manually counting on Wiktionary, there seem to be about 158 forms listed, so this corpus seems to be missing about 89% of forms listed there, including the most basic present tense forms like "saan" (I get), "saat" (you get), and so on.
Am I misunderstanding the data, or is this a bug?
Thanks. :)