-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing form_of keys for some senses #514
Comments
It's because of this formatting. |
I've committed a kludge (inserting it into a list of kludges) that specifically handles "inflection of" followed by a sublist of entries like these. If you could check the data again later to see how much is improved, it would be appreciated. |
I'll do it with the next released dump. 👍 Thanks a lot for the work! |
I've found quite a few of these, I'm specifically working with the Spanish dataset. These seem to happen with a handful of specific conjugations, and with some verbs with less common morphology (specifically reflexive verbs whose lemmas are recorded under the infinitive + reflexive pronoun, rather than as a sense under the infinitive generically). I found 18534 entries with |
Kaikki has finally updated, and it seems like This doesn't mean everything is now fixed, of course, but it will help to find the next issue. |
What does the situation look like, currently? |
The external hard drive I ran the calculations on seems to be dying, I will have to find another way to repeat it 😅. |
A few senses are missing the form_of key.
I made a heuristic check and searched for glosses containing the word "singular", but where there was no form_of or alt_of key in the sense:
The results are as follows:
A few are false positives, but for languages like Italian and French it is a somewhat widespread occurance.
Example words:
Italian: ami, tuba, impala, copula, replica, musica, pesca.
French: cube, abuse, azure, update, love
Latin: multi, maximum, visa, gemini
Russian: используешь, удачи, малого, змея
Thanks a lot for all the recent commits!
The text was updated successfully, but these errors were encountered: