-
Notifications
You must be signed in to change notification settings - Fork 585
Further accrualPeriodicity values #292
Comments
wouldn't it make sense to just report the expected time interval between updates, with a default unit of days? |
I've got one that's updated every five years. |
@smrazgs I appreciate the desire to move away from textual interval descriptions, otherwise the list could grow exhaustively long (our folks are asking for quadrennial). Using days to specify the interval, however, might imply an inappropriate level of precision. The guidance for the "modified" field already addresses this: If there is a need to reflect that the dataset is continually updated, ISO 8601 formatting can account for this by giving the duration. For instance, P1D for daily, P2W for every two weeks, and PT5M for every five minutes. Why not utilize this standard here as well? It's at least as concise and efficient, if not quite as readable as the text descriptions. |
I like the idea of using the standard, but the duration part of 8601 seems to be less widely used (at least FWIW I was unaware of that part). It certainly provides a way to specify the necessary information. |
A few thoughts:
Does anyone know of alternate standards for this field? |
At Department of Defense, we need Quadrennial added to the list of values. |
Thanks, Bill. We'll hopefully wrap this issue up soon. @waldoj, @JoshData, @dsmorgan77, @benbalter, @philipashlock, @mhogeweg - Any thoughts on my question about alternate standards? If there's not another good answer, it would seem that possibly the move is to say that the field must be a value from DCCDAccrualPeriodicity or one of these other handful of options that we list. |
I do not know of alternate standards, and I've been looking for them, since I need them for another project. For my other project, I've tentatively come to the conclusion that it's necessary to go outside of the standard, basically by doing what you're describing here, allowing select additional values. |
DCCDAccrualPeriodicity appears to be a proposed term that has been a working draft in the 'proposed' state for 10 years... the challenge with using a fixed domain for values that vary greatly. ISO 8601 has a nice definition of time interval that is simple to parse, human readable, and allows all flexibility needed. You could limit variation by further specifying that time intervals be specified using the duration method: P3Y1M4DT1H59M26.5S. |
👍 to the idea of using the ISO 8601 time interval to represent this. Using the time interval offers the most flexibility and allows for any agency that needs a particular permutation to just go ahead and express it. DoD wants quadrennial? That's P4Y. I want every 10 minutes? That's P10M. Speaking of that last example - the ISO 8601 time interval also offers more precision than DCCDAccrualPeriodicity. In the current form, anything that's updated more frequently than daily is shown as "continuously updated." |
👍 ISO8601 |
1 similar comment
👍 ISO8601 |
I vote for ISO8601 |
There seems to be a strong support for migrating to ISO 8601. Some ideas to facilitate this transition would be:
|
Some concerns include:
|
Does 8601 allow for a way to indicate that the interval is irregular? I do not see that it does. Do many records use that option or should? |
We have several datasets that are updated irregularly, so it would be helpful to find a standard that could supports this. |
I'm in favor of ISO8601, but one limitation worth noting: As far as I know ISO 8601 doesn't allow you to specify intervals without end dates unless you specify a repeating interval. I've come across two attempts to address this. One is the Dublin Core Collection Description : Open Date Range Format (DCCD ODRF) which states, "This representation of an open date range is not compatible with the representation of a time-interval defined by ISO8601:2000." and the other is the LOC Extended Date/Time Format (EDTF). What you can do with 8601 is specify an ambiguous end date (eg 2005-05-30/2015 where you just know the end date is in 2015) or if it's a dataset representing data with repeating intervals you can specify that without an enddate, as noted on Project Open Data (http://project-open-data.github.io/schema/#temporal): If there is a need to reflect that the dataset is continually updated, ISO 8601 formatting can account for this with repeating intervals. For instance, updated monthly starting in January 2010 and continuing through the present would be represented as: R/2010-01/P1M |
Thanks for the clarification @philipashlock. I misunderstood ISO8601. I thought it was a unit of time format. But rather it is a date and time format, i.e., a specific date and time. Another option would be to represent units/quantities of time. ISO 80000-3:2006, similar-ish to @smrazgs suggestion. From a practical standpoint, filling in the unit of time is easier without a specific begin/end date, particularly for older surveys, or data collection systems. ISO 80000-3:2006 would also allow for an easier find/replace conversion of the current metadata field. |
There's a been a good deal of digging into this further and here's an update: There's not an answer as to what is required for this field. It would be allowable to specify ISO-8601 as the norm though it'd actually be important to note that the figure must be a frequency, in the form of a duration. It other words, there's some ISO-8601-structured records that would not be appropriate for this, for example, including a start date with the recurring time period. Instead, it should hold closely to the examples shown below.
Notes: This structure would allow for any other needs that are not addressed by the above (e.g. decennial = R/P10Y). On the issue of converting 'completely irregular', it would seem to be allowable to specify that the only alternative to ISO-8601 for this field would be Update: @philipashlock just pointed out that the table should be more along the lines of R/P1Y instead of P1Y. I'm updating the above to that effect. |
Seeking to address #292. Note that this is accompanied by an appendix that is [still under construction](https://github.com/project-open-data/project-open-data.github.io/blob/metadata-v-1.1/iso8601_guidance.md#accrualperiodicity).
This is addressed in this commit - 582fe16 |
Changes that still need to be addressed are changes in structure and should we add usage notes additions here or no?: * Adds optional describedByType field at the dataset and distribution level (#291, #332) * Changes contactPoint field to an object that contains the name (fn) and email address (hasEmail) (#358) * Adds fn field as part of contactPoint replacing earlier use of contactPoint (#358) * Changes publisher field to an object that allows multiple levels of organizations (#296) * Changes accessURL field to represent indirect access and to exist only within distribution (#217, #335) * Changes format field to a human readable description and to exist only within distribution (#272, #293) * Adds optional description field for use within distribution (#248) * Adds optional title field for use within distribution (#248) * Changes accrualPeriodicity field to use ISO 8601 date syntax (#292) * Changes distribution field to become required-if-applicable and to always contain the accessURL or downloadURL fields (#217) * Changes license field to be a URL (#196)
Was it finalized that Project Open Data is extending ISO-8601 vocabulary to include "irregular"? |
@bbrotsos - yes, that is what is currently on track. You can see the proposed language on the staging instance: |
Thank you for driving the conversation around this issue and helping to assemble the v1.1 metadata update. There appears to be strong consensus around this issue, which has been accepted in the v1.1 update and merged into Project Open Data. Project Open Data is a living project though. Please continue any conversations around how the schema can be improved with new issues and pull requests! It's important for government staff as well as the public to continue to collaborate to make the Open Data Policy ever better. Though the v1.1 update is a substantial update, future iterations do not have to be, so whatever your ideas - big or small - please continue to work with this community to improve how government manages and opens its data. |
Moving from GSA/enterprise-data-inventory#92 - cc @dinali @regina-avila
What other values besides decennial are needed by agencies?
The text was updated successfully, but these errors were encountered: