Integrate sr Wikidata into Unicode Inflection #50

grhoten · 2025-01-21T21:49:58Z

The revised dictionary-parser can parse Wikidata, but some issues need to be resolved.

The initial issues include:

The data is sparse and scarce. Consider contributing a few more words to Wikidata.
No tests are available to test this data.

Tool output that needs to be addressed:

Line 516117: Q2006180 is not a known part of speech grammeme for L6310(онa)

Here is the current generated lexical dictionary files to debug the test failures.

The text was updated successfully, but these errors were encountered:

grhoten · 2025-01-21T21:56:05Z

If you need inspiration for a list of words, consider this list.

nciric · 2025-01-24T19:47:12Z

There are only 22 nouns in Serbian. See this query. We'll need to contribute more before building the lexicon/rules.

nciric · 2025-01-28T03:46:23Z

Denny, here's the ~150 nouns that should go into WIkidata, it would be great if you could do a bulk upload.

grhoten assigned nciric Jan 21, 2025

nciric added this to the 0.1 milestone Jan 21, 2025

Provide feedback