Fix ambiguous features on Swedish verbs #6

andrewtavis · 2022-01-03T20:17:52Z

Terms

I have searched open and closed data issues
I agree to follow Scribe-Data's Code of Conduct

Languages

Swedish

Description

Many Swedish verbs have ambiguous features that don't allow their conjugations to be properly classified. Specifically, there are doubles of many feature sets, as can be seen on the Wikidata page for the verb överge. These duplicates should be distinguished, and the formatting script for Swedish verbs should be updated, as it is now written to remove any verb that has a duplicate value caused by ambiguous features.

Ainali · 2022-08-12T17:17:21Z

Just a quick note that the formatting script for Swedish verbs has moved to src/scribe_data/extract_transform/Swedish/verbs/format_verbs.py.

andrewtavis · 2022-08-12T17:18:41Z

Updated, @Ainali! Thank you 🙏

Ainali · 2022-08-12T18:30:19Z

I'll be using this query to clean up most of the data errors. Around 80-85% of the results there should be split into two separate lexemes. The rest are cases where there really are two acceptable forms. However, in several of these, one of the forms is not modern and should be marked as such. The query should probably check for language style (P6191) and filter some values.

andrewtavis · 2022-08-12T18:43:59Z

This is so epic, @Ainali 😊 Thanks so much! Would be happy to talk with you after the hackathon about what changes need to happen to the query. After a check in I can try to make the changes, or we can do a quick call to talk over what needs to change. Whatever works best for you :)

Really happy to have this issue getting some love!

Ainali · 2022-08-17T20:30:49Z

I have now split all the ones that needed to be split into different lexemes. The ones that are left (21 in the query above) are probably mostly synonyms, but I have asked around to see if there is something grammatical that could be added to them to highlight any eventual nuance between them.

andrewtavis · 2022-08-17T20:58:54Z

I was thinking about messaging you about this 😊 Really thanks so much for your efforts!

Do I need to change anything in the query, or can I just run the normal update process? We still have some minor bug fixes for autocomplete and will add in a basic autosuggest prior to the next release, but we should have it out by say the end of next week :)

Ainali · 2022-08-19T08:26:40Z

For now, it will just be an improvement if you run the normal update process. But I think we should keep the issue open to figure out the last remaining part.

andrewtavis · 2022-08-19T10:19:05Z

Sounds great, thanks @Ainali :)

andrewtavis added good first issue Good for newcomers help wanted Extra attention is needed data Relates to data or Wikidata labels Jan 3, 2022

andrewtavis mentioned this issue Feb 12, 2022

Add Danish keyboard scribe-org/Scribe-iOS#133

Open

11 tasks

andrewtavis transferred this issue from scribe-org/Scribe-iOS Mar 29, 2022

andrewtavis mentioned this issue Apr 13, 2023

Auxiliary verbs for German perfect conjugations #10

Open

2 tasks

andrewtavis removed the good first issue Good for newcomers label Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ambiguous features on Swedish verbs #6

Fix ambiguous features on Swedish verbs #6

andrewtavis commented Jan 3, 2022 •

edited

Loading

Ainali commented Aug 12, 2022

andrewtavis commented Aug 12, 2022

Ainali commented Aug 12, 2022

andrewtavis commented Aug 12, 2022

Ainali commented Aug 17, 2022

andrewtavis commented Aug 17, 2022

Ainali commented Aug 19, 2022

andrewtavis commented Aug 19, 2022

Fix ambiguous features on Swedish verbs #6

Fix ambiguous features on Swedish verbs #6

Comments

andrewtavis commented Jan 3, 2022 • edited Loading

Terms

Languages

Description

Ainali commented Aug 12, 2022

andrewtavis commented Aug 12, 2022

Ainali commented Aug 12, 2022

andrewtavis commented Aug 12, 2022

Ainali commented Aug 17, 2022

andrewtavis commented Aug 17, 2022

Ainali commented Aug 19, 2022

andrewtavis commented Aug 19, 2022

andrewtavis commented Jan 3, 2022 •

edited

Loading