[Deleted] Explore formatting data with SQLite rather than Python directly #47
Labels
feature
New feature or request
help wanted
Extra attention is needed
refactor
Refactor code to improve quality
wontfix
This will not be worked on
Terms
Description
One of the major issues with Scribe-Data at time of writing is that we have the formatting for all the language data within relatively large/complex
format_WORD_TYPE.py
scripts. A general thought within the team is that this could be simplified by converting these processes over to use SQLite via sqlite3. Rather than loading in JSON files and formatting them using conditionals in a dictionary structure, the raw JSONs could be loaded as a table with the final output being a conditional selection from this table.This issue could just be the creation of a proof of concept that this cane work, and from there we expand to converting the formatting processes over 🚀
There's also the potential to do this with SPARQL on the Wikidata end, but we already are needing to break up the files because the rate limits are hit, which would only get worse with more complex selections. I'd say that this would be the ideal way of doing this :)
Contribution
Happy to work on this myself or support someone who'd like to contribute! 😊
The text was updated successfully, but these errors were encountered: