Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expansion of source metadata fields #25

Open
sadiekelly opened this issue Dec 5, 2023 · 3 comments
Open

Expansion of source metadata fields #25

sadiekelly opened this issue Dec 5, 2023 · 3 comments
Labels
help wanted Extra attention is needed P1 Priority: high

Comments

@sadiekelly
Copy link
Contributor

Currently the url for the source document is captured. The information obtained from the source for the case record can be buried within the document, perhaps requiring translation. In addition there can be more than one source per case and curator decisions made upon which part of which source document should be used in the line list.
It might be useful to somehow provide an audit trail for this, to show where the information came from in the source and how the line list was created to help others understand the data. Perhaps something for discussion with the curation team.

@sadiekelly
Copy link
Contributor Author

Most important item to capture is whether the source is official or not official.
Do not want to create additional work for curators.
Can ensure that multiple sources are linked within the final dataset and the originator of the data is cited.
Curation process and how discrepancies are dealt with as a general procedure could be made available.

@sadiekelly
Copy link
Contributor Author

Additional source level fields as defined in sheet https://docs.google.com/spreadsheets/d/1WQXuqeRbOctijcQUGqEoFucumwDdRIxCJRBd188Oqks/edit?gid=911832469#gid=911832469
Capture of whether source is official (for each source), where the data has originated from and the country of report origin.

@abhidg abhidg added the help wanted Extra attention is needed label Aug 28, 2024
@sadiekelly sadiekelly added the P1 Priority: high label Aug 30, 2024
@sadiekelly
Copy link
Contributor Author

Recommendation: addition of source level variables to capture data originator, country of origin, official report. Could also include type of report (aggregate, map, individual level, situation report) to give context to the curation effort required and therefore the limitations and assumptions made. Original language of report. Detail of curation process in general would be good to share to more widely understand how the data are curated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed P1 Priority: high
Projects
None yet
Development

No branches or pull requests

2 participants