-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve searchability and linkability of AURIN datasets #6
Comments
Draft
Would be good if we can get all metadata fields of all datasets then do some magic using text analysis/some heuristic etc to link similar datasets together. The novelty of this feature would be that the user can discover datasets that they didn't know existed or wanted. |
Here are metadata fields we can play with. library(aurin)
wfs_client <- create_aurinapi_wfs_client()
#> ℹ Creating AURIN WFS Client...
#> Loading ISO 19139 XML schemas...
#> Loading ISO 19115 codelists...
#> Loading IANA mime types...
#> No encoding supplied: defaulting to UTF-8.
fts <- wfs_client$getFeatureTypes(pretty = FALSE)
meta <- do.call("rbind", lapply(fts[1:10], function(x) {
tibble::tibble(
aurin_open_api_id = x$getName(),
title = x$getTitle() %>% trimws(),
keywords = list(x$getKeywords() %>% trimws()),
abstract = x$getAbstract(),
bbox = list(x$getBoundingBox()),
desc = list(x$getDescription(pretty = TRUE))
)
}))
meta
#> # A tibble: 10 × 6
#> aurin_open_api_id title keywords abstract bbox desc
#> <chr> <chr> <list> <chr> <list> <lis>
#> 1 aurin:datasource-NSW_Govt_DPE-UoM_AUR… Addi… <chr> "Additi… <dbl[…]> <df>
#> 2 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 3 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 4 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 5 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 6 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 7 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 8 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 9 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
#> 10 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr> "This d… <dbl[…]> <df>
meta[1, ] %>% unlist(recursive = FALSE)
#> $aurin_open_api_id
#> [1] "aurin:datasource-NSW_Govt_DPE-UoM_AURIN_DB_nsw_srlup_additional_rural_2014"
#>
#> $title
#> [1] "Additional Rural Village Land 18/01/2014 for NSW"
#>
#> $keywords
#> [1] "planning" "village" "rural" "NSW"
#> [5] "New South Wales"
#>
#> $abstract
#> [1] "Additional Rural Village Land is land zoned R5 large lot residential (or equivalent) that meets defined 'village' criteria. It is proposed that coal seam gas activities will be prohibited in and within two kilometres of these areas. Additional Rural Village relates directly to and adopts the accuracy of its reference dataset, Land and Property Information DCDB. <br/><br/> Copyright attribution: Government of New South Wales - Department of Planning and Environment, (2013): ; accessed from AURIN on 2/9/2022. <br/><br/> Licence type: Creative Commons Attribution 3.0 Australia (CC BY 3.0 AU)"
#>
#> $bbox
#> min max
#> x 140 162
#> y -38 -28
#>
#> $desc
#> name type minOccurs maxOccurs nillable
#> 1 ogc_fid integer 1 1 FALSE
#> 2 sepp_name character 0 1 TRUE
#> 3 sepp_no character 0 1 TRUE
#> 4 amendment_ character 0 1 TRUE
#> 5 amendment1 character 0 1 TRUE
#> 6 feature_co double 0 1 TRUE
#> 7 feature_de character 0 1 TRUE
#> 8 gazetted_d Date 0 1 TRUE
#> 9 effective_ Date 0 1 TRUE
#> 10 currency_d Date 0 1 TRUE
#> 11 comments character 0 1 TRUE
#> 12 urlid character 0 1 TRUE
#> 13 plan_contr character 0 1 TRUE
#> 14 feature__1 character 0 1 TRUE
#> 15 shape_area double 0 1 TRUE
#> 16 shape_len double 0 1 TRUE
#> 17 wkb_geometry geometry 0 1 TRUE Created on 2022-02-10 by the reprex package (v2.0.1) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Use akc to provide a graph visualisation of keywords.
The text was updated successfully, but these errors were encountered: