Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lowercase infinitive not working in german #47

Open
nnmrts opened this issue Feb 24, 2021 · 7 comments
Open

lowercase infinitive not working in german #47

nnmrts opened this issue Feb 24, 2021 · 7 comments

Comments

@nnmrts
Copy link

nnmrts commented Feb 24, 2021

I have absolutely no idea how the .dic format works, all I can say is that practically any infinitve german word is marked as misspelt when using this in retext-spell.

For example, "markieren" is a normal word and has several entries in the dictionary, but somehow just "markieren" doesn't work, while "Markieren", "markierend", "markierens" or even "-markieren" work. What is the issue here?

"würde" is another case, even though that isn't an infinitive. "würde" is a verb, meaning "would", "Würde" (which works), meaning "dignity", is a substantive. It seems like every infinitive verb you could also write in uppercase (so basically any german verb) is not allowed when written in lowercase.

@wooorm
Copy link
Owner

wooorm commented Feb 25, 2021

Hmm, might be nspell related, as it’s not 100% compatible with hunspell.
It might be worth checking with hunspell, if the dictionary works there. I would appreciated it a lot if you could check that?

Looking at the .aff file here, it might be duplicated. Those lines are typically at the top, not half way into a document

@nnmrts
Copy link
Author

nnmrts commented Feb 25, 2021

C++ compiling and all that makefile spam is such a joke, but I guess that's how things are done there...

What I did after installing hunspell:

wget -O de.dic https://raw.githubusercontent.com/wooorm/dictionaries/main/dictionaries/de/index.dic
wget -O de.aff https://raw.githubusercontent.com/wooorm/dictionaries/main/dictionaries/de/index.aff

and then testing it:

hunspell -d de
Hunspell 1.7.0
markieren
*

markierend
+ markieren

Markieren
*

markierens
& markierens 8 0: Markierens, markierend, markieren, -markierens, markiere, maskieren, marinieren, massieren

As you can see, "markieren" is correct here. "markierens" is not, but it would be in retext-spell. But only the uppercase "Markierens" is correct.

It really seems like nspell has some issues with case.

@nnmrts
Copy link
Author

nnmrts commented Feb 25, 2021

Testing with nspell here:
https://runkit.com/nnmrts/nspell-de-test

yields this:

{
	"markieren": [
		false,
		[
			"Markieren"
			"markierend",
			"markierens",
			"-markieren",
			"markiere",
			"markierten",
			"maskieren"
		]
	],
	"markierend": [
		true,
		[]
	],
	"Markieren": [
		true,
		[]
	],
	"markierens": [
		true,
		[]
	]
}

@wooorm wooorm transferred this issue from wooorm/dictionaries Feb 25, 2021
@wooorm
Copy link
Owner

wooorm commented Feb 25, 2021

Thanks, transferred to nspell.

I don‘t have time/priority to work on nspell currently, so unfortunately this will take a while.

If you’re fine with native dependencies, nodehun might be a good alternative for spell checking in Node!

@nnmrts
Copy link
Author

nnmrts commented Feb 26, 2021

If you’re fine with native dependencies, nodehun might be a good alternative for spell checking in Node!

Is there a way to plug nodehun into retext-spell?

@wooorm
Copy link
Owner

wooorm commented Feb 26, 2021

Nope. Perhaps retext- spell could be formed to allow it tho

@Joelius300
Copy link

Any updates on this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants