Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Basic Suggestions for the Data Handling #1

Open
4 tasks
LinguList opened this issue Dec 17, 2024 · 0 comments
Open
4 tasks

Basic Suggestions for the Data Handling #1

LinguList opened this issue Dec 17, 2024 · 0 comments

Comments

@LinguList
Copy link

Hi @verenablaschke, here are some basic suggestions for your repo:

  • make definite versions and update them on a yearly basis (git tag, etc.)
  • provide data in tabular form, ideally with metadata, check csvw for this purpose, there is also a Python library csvw, which is the core of our way to handle data at Leipzig / Passau
  • be aware that you are not allowed to use data by soundcomparisons in any way, the license is pretty clear in this regard, the question is, if it is worth to list resources that cannot be used for research
  • If you list wordlists, such as soundcomparisons, there are many more datasets you do not list, particularly on Dutch dialects, and German dialects, but also major resources such as lexibank that serve as data aggregator and basic repository for standardization
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant