Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve searchability and linkability of AURIN datasets #6

Open
asiripanich opened this issue Dec 4, 2020 · 2 comments
Open

Improve searchability and linkability of AURIN datasets #6

asiripanich opened this issue Dec 4, 2020 · 2 comments

Comments

@asiripanich
Copy link
Owner

Use akc to provide a graph visualisation of keywords.

@asiripanich asiripanich mentioned this issue Dec 17, 2021
@asiripanich
Copy link
Owner Author

Would be good if we can get all metadata fields of all datasets then do some magic using text analysis/some heuristic etc to link similar datasets together. The novelty of this feature would be that the user can discover datasets that they didn't know existed or wanted.

@asiripanich
Copy link
Owner Author

Here are metadata fields we can play with.

library(aurin)
wfs_client <- create_aurinapi_wfs_client()
#> ℹ Creating AURIN WFS Client...
#> Loading ISO 19139 XML schemas...
#> Loading ISO 19115 codelists...
#> Loading IANA mime types...
#> No encoding supplied: defaulting to UTF-8.
fts <- wfs_client$getFeatureTypes(pretty = FALSE)

meta <- do.call("rbind", lapply(fts[1:10], function(x) {
    tibble::tibble(
        aurin_open_api_id = x$getName(),
        title = x$getTitle() %>% trimws(),
        keywords = list(x$getKeywords() %>% trimws()),
        abstract = x$getAbstract(),
        bbox = list(x$getBoundingBox()),
        desc = list(x$getDescription(pretty = TRUE))
    )
}))

meta
#> # A tibble: 10 × 6
#>    aurin_open_api_id                      title keywords abstract bbox     desc 
#>    <chr>                                  <chr> <list>   <chr>    <list>   <lis>
#>  1 aurin:datasource-NSW_Govt_DPE-UoM_AUR… Addi… <chr>    "Additi… <dbl[…]> <df> 
#>  2 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  3 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  4 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  5 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  6 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  7 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  8 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#>  9 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df> 
#> 10 aurin:datasource-AU_Govt_ABS-UoM_AURI… ABS … <chr>    "This d… <dbl[…]> <df>

meta[1, ] %>% unlist(recursive = FALSE)
#> $aurin_open_api_id
#> [1] "aurin:datasource-NSW_Govt_DPE-UoM_AURIN_DB_nsw_srlup_additional_rural_2014"
#> 
#> $title
#> [1] "Additional Rural Village Land 18/01/2014 for NSW"
#> 
#> $keywords
#> [1] "planning"        "village"         "rural"           "NSW"            
#> [5] "New South Wales"
#> 
#> $abstract
#> [1] "Additional Rural Village Land is land zoned R5 large lot residential (or equivalent) that meets defined 'village' criteria. It is proposed that coal seam gas activities will be prohibited in and within two kilometres of these areas. Additional Rural Village relates directly to and adopts the accuracy of its reference dataset, Land and Property Information DCDB. <br/><br/> Copyright attribution: Government of New South Wales - Department of Planning and Environment, (2013): ; accessed from AURIN on 2/9/2022. <br/><br/> Licence type: Creative Commons Attribution 3.0 Australia (CC BY 3.0 AU)"
#> 
#> $bbox
#>   min max
#> x 140 162
#> y -38 -28
#> 
#> $desc
#>            name      type minOccurs maxOccurs nillable
#> 1       ogc_fid   integer         1         1    FALSE
#> 2     sepp_name character         0         1     TRUE
#> 3       sepp_no character         0         1     TRUE
#> 4    amendment_ character         0         1     TRUE
#> 5    amendment1 character         0         1     TRUE
#> 6    feature_co    double         0         1     TRUE
#> 7    feature_de character         0         1     TRUE
#> 8    gazetted_d      Date         0         1     TRUE
#> 9    effective_      Date         0         1     TRUE
#> 10   currency_d      Date         0         1     TRUE
#> 11     comments character         0         1     TRUE
#> 12        urlid character         0         1     TRUE
#> 13   plan_contr character         0         1     TRUE
#> 14   feature__1 character         0         1     TRUE
#> 15   shape_area    double         0         1     TRUE
#> 16    shape_len    double         0         1     TRUE
#> 17 wkb_geometry  geometry         0         1     TRUE

Created on 2022-02-10 by the reprex package (v2.0.1)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant