Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Indeterminate date produces odd output in DCAT #254

Closed
torrin47 opened this issue Nov 29, 2016 · 6 comments
Closed

Indeterminate date produces odd output in DCAT #254

torrin47 opened this issue Nov 29, 2016 · 6 comments
Assignees
Milestone

Comments

@torrin47
Copy link

We implemented 1.2.7 in part to address issue #224 and are now seeing great results with determinate dates in our dcat output:
https://edg.epa.gov/metadata/rest/find/document?f=dcat&searchText=timeperiod.meta%3Aisdeterminate

and unknown and invalid dates are appropriately being suppressed. But we have one example of an indeterminate date (2004-present) which is not being handled appropriately: "2004-01-01/292269055-12-02"
https://edg.epa.gov/metadata/rest/find/document?f=dcat&searchText=timeperiod.meta%3Aisindeterminate

Thoughts on how to address this?

@pandzel-zz
Copy link

The "292269055-12-02" value is a result of translation "present" into some value storeable in the Lucene index. Perhaps this trick is good for searching for metadata.

The real issue is how to express "present" in DCAT terms, or to be more accurate, how to express "present" as ISO 8601 date (292269055-12-02 clearly is not a valid ISO date). It turn's out that there are no means to do that, so I decoded to discard any temporal with "present" as one of it's interval value. The code is in the github already.

@torrin47
Copy link
Author

Definitely an amusing translation, and thanks for the quick fix, discarding it should be fine, but how's this for consideration? If the value is legitimately "Present" (and we assume the data actually is being updated in close to real time) couldn't we translate "Present" into a datestamp of "Now" equivalent to the instant the DCAT file is generated?

@mhogeweg
Copy link
Member

mhogeweg commented Dec 2, 2016

I think that would not be the right thing to do. remember that Data.gov (or other sites) harvest the metadata and they would then see an arbitrary date in the metadata. leaving this blank appears better.

@pandzel-zz
Copy link

To my best understanding this is all a lack of a precise definition of the standard. Instead of leaving this blank as Marten suggested, I would rather talk to DCAT people about the issue and perhaps they may come with a better notation next time. Beside, a similar or related issue has already been discussed here:
project-open-data/project-open-data.github.io#415

@torrin47
Copy link
Author

I agree their proposed solution is awkward - repeating interval as time period? I prefer the first solution they mentioned in passing...

What you can do with 8601 is specify an ambiguous end date (eg 2005-05-30/2016 where you just know the end date is in 2016) or you can use repeating intervals.

@mhogeweg, does that offend your sensibilities less than the specific harvest date?

@mhogeweg
Copy link
Member

mhogeweg commented Mar 2, 2017

torrin will submit a pull request with the EPA modification. we will then include that as a configurable choice for how to deal with 'present' date values

@zguo zguo added this to the 1.2.8 milestone May 24, 2017
pandzel-zz pushed a commit that referenced this issue May 31, 2017
Indeterminate date produces odd output in DCAT #254
@zguo zguo closed this as completed Jun 13, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants