The World Bank’s Climate Change Knowledge Portal (CCKP) provides open access to a comprehensive suite of climate and climate change resources derived from the latest generation of climate data archives. Products are based on a consistent and transparent approach with a systematic way of pre-processing the raw observed and model-based projection data to enable inter-comparable use across a broad range of applications. Climate products consist of basic climate variables as well as a large collection (70+) of more specialized, application-orientated variables and indices across different scenarios. Precomputed data can be extracted per specified variables, select timeframes, climate projection scenarios, as multi-model ensembles or by individual models. CCKP adheres to data distribution standards defined under the Coupled Model Intercomparison Project (CMIP) and its contributions to the Intergovernmental Panel on Climate Change (IPCC) Assessment Reports and latest scientific methodologies identified by the World Meteorological Organization and climate science community.
Access the World Bank Climate Change Portal: https://climateknowledgeportal.worldbank.org
This data archive consists of global gridded NetCDF files that represent ~30 models, 70+ variables, 5 SSPs, multiple climatology periods, 3 temporal aggregations (annual, seasonal, monthly), plus additional statistics for specific variable presentations.
Climate products are global and are available for the following collections:
cmip6-x0.25: CMIP6 global downscaled, bias corrected 0.25-degree – 1950-2100;
era5-x0.25: EAR5 global 0.25-degree – 1950-2022;
cru-x0.5: CRU global 0.50-degree – 1901-2022;
pop-x0.25: Population global 0.25-degree – 1995-2100 (Gridded Population of the World Version 4).
You can access these data via the link, <aws s3 ls –no-sign-request s3://wbg-cckp/>
and using the web interface via the AWS console. To get a list for all variables and climate indicators available for the CMIP6 data at 0.25x0.25 degree resolution, this sub-directory will list them (using the --no-sign-request flag since this bucket is public):
The collections of WB-CCKP raster data stored under the root AWS S3 bucket <aws s3 ls –no-sign-request s3://wbg-cckp/>
. The different collections, their variables and datasets with individual raster products (netCDF files) are organized by a hierarchy of facets that follow this structure:
Example file: (with ‘..’ representing the root of the CCKP S3 bucket):
../Sub-Level 1/Sub-Level 2/Sub-Level 3/Sub-Level 4 (data)
../collection/variable/Model-scenario/filename.nc
with filename.nc being:
product-variable-aggregationperiod-statistic_collection_modelwithscenario_category_percentile_timeperiod.nc
Under the CCKP root, the first level is the level of “Collection” (cmip6-x0.25, era5, cru, …). This represents the fundamental dataset collection. Inside each of these collections, the same structure is offered, albeit with potentially different entries.
The first sub-level is the level of the variable. These are basic climate variables (tas, pr, …) as well as climate indicators derived from basic variables (txx, rx1day, cdd, hi35, …).
Below each of these “variables” (or indicators), there is the “Dataset”, which represents the source itself. This facet is more granular than the collection, because at this level the different versions and individual simulations are being distinguished. For example, the model access-cm2 with model simulation variant r1i1p1f1 for the simulation covering the historical period is an exactly defined, single simulation (one model run). This allows to distinguish this dataset from an entry of “access-cm2-r2i1p1f1-historical” (note difference in variant label r2 vs r1). For observational data with evolving versions, the dataset level distinguishes these entries: cru-ts4.07-historical is different from cru-ts4.07-historical.
The final level stores the actual data files where the content is being distinguished in the file naming convention.:
The file names are designed to be self-explanatory, meaning they can be copied somewhere and still retain the full information of the collection, variable and dataset source in addition to the actual content. The facets in the file name are as follows:
../cmip6-x0.25/tas/access-cm2-r1i1p1f1-historical/
climatology-tas-annual-mean_cmip6-x0.25_access-cm2-r1i1p1f1-historical_climatology_mean_1995-2014.nc
Collection: {cmip6-x0.25,cru-x0.5,era5-x0.25,pop-x1,…}
Variable: {tas,tasmax,tasmin,txx,tnn,…}
Datasetwithscenario: {ensemble-all-historical, access-cm2-r1i1p1f1-historical…}
ProductType: {timeseries,climatology,heatplot}
Product: {timeseries,climatology,anomaly,trend,trendsignificance,…}
AggregationPeriod: {annual,seasonal,monthly}
Statistic: {mean,min,max}
TimePeriod: {1950-2014,2015-2100,2020-2039,….}
Percentile: {mean,median,p10,p90,…}
Note, the “scenario”, which is a sub-level of the Dataset, is also a separate facet that can be searched, so that a specific projection scenario can be selected (historical, ssp126, ssp245, …).
Collection: {cmip6-x0.25,cru-x0.5,era5-x0.25,pop-x1,…}
Variable: {tas,tasmax,tasmin,txx,tnn,…}
Product: {timeseries,climatology,anomaly,trend,…}
Scenario: {historical,ssp119,ssp126,ssp245,ssp370,ssp585}
Aggregation: {annual,seasonal,monthly
Model including scenario {ensemble-all-historical,access-cm2-r1i1p1f1-ssp126}
TimePeriod: {1950-2014,2015-2100,2020-2039,….}
Percentile: {mean,median,p10,p90}
ProductType: {timeseries,climatology,heatplot}
Statistic: {mean,min,max}
Collection: cmip6-x0.25
Variable: tas
Product: trend
Scenario: ssp585
Aggregation: annual
Model: ensemble-all
TimePeriod: 1971-2020
Percentile: mean
Product Type: climatology
Statistic: mean
A detailed documentation of the data content can be found in the CCKP Meta Data Document at: https://climateknowledgeportal.worldbank.org/media/document/metatag.pdf
Contributions are welcome! If you'd like to contribute, please follow our contribution guidelines.
This documentation is licensed under the Mozilla Public License.