Skip to content

Latest commit

 

History

History
121 lines (80 loc) · 11.1 KB

README.md

File metadata and controls

121 lines (80 loc) · 11.1 KB

World Bank – Climate Change Knowledge Portal Data Collections on AWS

Overview

The World Bank’s Climate Change Knowledge Portal (CCKP) provides open access to a comprehensive suite of climate and climate change resources derived from the latest generation of climate data archives. Products are based on a consistent and transparent approach with a systematic way of pre-processing the raw observed and model-based projection data to enable inter-comparable use across a broad range of applications. Climate products consist of basic climate variables as well as a large collection (70+) of more specialized, application-orientated variables and indices across different scenarios. Precomputed data can be extracted per specified variables, select timeframes, climate projection scenarios, as multi-model ensembles or by individual models. CCKP adheres to data distribution standards defined under the Coupled Model Intercomparison Project (CMIP) and its contributions to the Intergovernmental Panel on Climate Change (IPCC) Assessment Reports and latest scientific methodologies identified by the World Meteorological Organization and climate science community.

Access the World Bank Climate Change Portal: https://climateknowledgeportal.worldbank.org

This data archive consists of global gridded NetCDF files that represent ~30 models, 70+ variables, 5 SSPs, multiple climatology periods, 3 temporal aggregations (annual, seasonal, monthly), plus additional statistics for specific variable presentations.

Climate products are global and are available for the following collections:

cmip6-x0.25: CMIP6 global downscaled, bias corrected 0.25-degree – 1950-2100;
era5-x0.25: EAR5 global 0.25-degree – 1950-2022;
cru-x0.5: CRU global 0.50-degree – 1901-2022;
pop-x0.25: Population global 0.25-degree – 1995-2100 (Gridded Population of the World Version 4).

Accessing Climate Change Knowledge Portal Data On AWS

You can access these data via the link, <aws s3 ls –no-sign-request s3://wbg-cckp/> and using the web interface via the AWS console. To get a list for all variables and climate indicators available for the CMIP6 data at 0.25x0.25 degree resolution, this sub-directory will list them (using the --no-sign-request flag since this bucket is public):

Climate Change Knowledge Portal File Structure

The collections of WB-CCKP raster data stored under the root AWS S3 bucket <aws s3 ls –no-sign-request s3://wbg-cckp/>. The different collections, their variables and datasets with individual raster products (netCDF files) are organized by a hierarchy of facets that follow this structure:

Example file: (with ‘..’ representing the root of the CCKP S3 bucket):

       ../Sub-Level 1/Sub-Level 2/Sub-Level 3/Sub-Level 4 (data)

       ../collection/variable/Model-scenario/filename.nc
        
with filename.nc being:

       product-variable-aggregationperiod-statistic_collection_modelwithscenario_category_percentile_timeperiod.nc

Sub-Level 1 - Collection

Under the CCKP root, the first level is the level of “Collection” (cmip6-x0.25, era5, cru, …). This represents the fundamental dataset collection. Inside each of these collections, the same structure is offered, albeit with potentially different entries.

Sub-Level 2 – Variable

The first sub-level is the level of the variable. These are basic climate variables (tas, pr, …) as well as climate indicators derived from basic variables (txx, rx1day, cdd, hi35, …).

Sub-Level 3 – Dataset

Below each of these “variables” (or indicators), there is the “Dataset”, which represents the source itself. This facet is more granular than the collection, because at this level the different versions and individual simulations are being distinguished. For example, the model access-cm2 with model simulation variant r1i1p1f1 for the simulation covering the historical period is an exactly defined, single simulation (one model run). This allows to distinguish this dataset from an entry of “access-cm2-r2i1p1f1-historical” (note difference in variant label r2 vs r1). For observational data with evolving versions, the dataset level distinguishes these entries: cru-ts4.07-historical is different from cru-ts4.07-historical.

Sub-Level 4 – Data

The final level stores the actual data files where the content is being distinguished in the file naming convention.:

File Naming Convention

The file names are designed to be self-explanatory, meaning they can be copied somewhere and still retain the full information of the collection, variable and dataset source in addition to the actual content. The facets in the file name are as follows:

       ../cmip6-x0.25/tas/access-cm2-r1i1p1f1-historical/
         climatology-tas-annual-mean_cmip6-x0.25_access-cm2-r1i1p1f1-historical_climatology_mean_1995-2014.nc

Examples at various sub-levels and facets:

Collection: {cmip6-x0.25,cru-x0.5,era5-x0.25,pop-x1,…}
   Variable: {tas,tasmax,tasmin,txx,tnn,…}
      Datasetwithscenario: {ensemble-all-historical, access-cm2-r1i1p1f1-historical…}
         ProductType: {timeseries,climatology,heatplot}
            Product: {timeseries,climatology,anomaly,trend,trendsignificance,…}
               AggregationPeriod: {annual,seasonal,monthly}
                  Statistic: {mean,min,max}
                     TimePeriod: {1950-2014,2015-2100,2020-2039,….}
                        Percentile: {mean,median,p10,p90,…}

Note, the “scenario”, which is a sub-level of the Dataset, is also a separate facet that can be searched, so that a specific projection scenario can be selected (historical, ssp126, ssp245, …).

Priority structure for search:

Collection: {cmip6-x0.25,cru-x0.5,era5-x0.25,pop-x1,…}
   Variable: {tas,tasmax,tasmin,txx,tnn,…}
      Product: {timeseries,climatology,anomaly,trend,…}
         Scenario: {historical,ssp119,ssp126,ssp245,ssp370,ssp585}
            Aggregation: {annual,seasonal,monthly
               Model including scenario {ensemble-all-historical,access-cm2-r1i1p1f1-ssp126}
                  TimePeriod: {1950-2014,2015-2100,2020-2039,….}
                     Percentile: {mean,median,p10,p90}
                        ProductType: {timeseries,climatology,heatplot}
                           Statistic: {mean,min,max}

Single file query example:

Collection: cmip6-x0.25
   Variable: tas
      Product: trend
         Scenario: ssp585
            Aggregation: annual
               Model: ensemble-all
                  TimePeriod: 1971-2020
                     Percentile: mean
                        Product Type: climatology
                           Statistic: mean

Available Selections Per Each Collection

Data Documentation

A detailed documentation of the data content can be found in the CCKP Meta Data Document at: https://climateknowledgeportal.worldbank.org/media/document/metatag.pdf

Additional Resources

Contributing

Contributions are welcome! If you'd like to contribute, please follow our contribution guidelines.

License

This documentation is licensed under the Mozilla Public License.