The Dendrochronology Program Library (DPL) in Python has its roots in both the original FORTRAN program created by the legendary Richard Holmes and the subsequent R Project package by Andy Bunn, dplR. Our aim is to provide researchers working with tree-ring data the necessary tools in open-source environments, promoting open science practices, enhancing rigor and transparency in dendrochronology, and eventually allowing reproducible research entirely in a single programming language.
The development of dplPy is supported by a grant from the Paleoclimate program of the US National Science Foundation (AGS-2054516) to Andy Bunn, Kevin Anchukaitis, Ed Cook, and Tyson Swetnam.
- dplPy - the Dendrochronology Program Library in Python
Under the hood, dplPy uses numpy
, pandas
, matplotlib
, statsmodels
, scipy
, and csaps
.
dplPy is currently at version v0.1.5
- The project has changing to a new development structure where all development will be on main
and releases and updates to Pypi will be branched to a version number and deployed.
dplPy is now available to install via pip:
pip install dplpy
To ensure you have the latest version of dplPy installed, you can run:
pip install dplpy --upgrade
You can install a conda virtual environment using the environment.yml for the project:
$ conda env create -f environment.yml
You can still still install dplPy firectly from Github if you wish:
1. Clone and change directory to this repository
$ git clone https://github.com/OpenDendro/dplPy.git
$ cd dplPy
2. Create a conda environment through the environment.yml
file. This will ensure all packages required are installed.
$ conda env create -f environment.yml
# if you have mamba installed you could instead do
$ mamba env create -f environment.yml
When prompted for permission to install required packages (with y/n
), select y
.
3. Activate your environment:
$ conda activate dplpy
Your environment should be successfully built.
4. Your python environment should be able to import numpy
, pandas
, matplotlib
, statsmodels
and csaps
.
1. In your VSCode terminal, activate the conda environment with conda activate dplpy3
.
2. Open a Jupyer Notebook (<file>.ipynb
) and select the dplpy3
Kernel when prompted (or from the top right of your screen). This will automatically load the environment we created.
In VSCode:
1. In your VSCode terminal window, activate the conda environment with conda activate dplpy3
.
2. In the same terminal window, start a Jupyter Notebook with jupyter notebook
. Jupyter will then return URLs that you can copy; Copy one of these URLs.
3. Open a Jupyter Notebook (<file>.ipynb
) and from the bottom right of the VSCode screen, click Jupyter Server;
A dropdown menu will open from the top of the screen: select Existing and paste the URL you copied.
4. Jupyter Notebook will now be able to access the environment created.
Import the dplPy tool with
import dplpy as dpl
or alternatively:
import dplpy
This will load the package and its functions.
- Description: reads data from supported file types (
csv
andrwl
) and stores them in a dataframe. - Options:
header
: input files often have a header present; Default isFalse
, useTrue
if input has a header.
- Usage example:
>>> data = dpl.readers("/path/to/file.rwl", header=True)
- Description: generates a summary of each series recorded in
rwl
andcsv
format files - Usage Example:
>>> dpl.summary("/path/to/file.rwl") # or >>> dpl.summary(data)
- Description: generates summary statistics for
rwl
andcsv
format files - Usage Example:
>>> dpl.stats("/path/to/file.rwl") # or >>> dpl.stats(data)
- Description: generates a report about absent rings in the data set
- Usage Example:
>>> dpl.report("/path/to/file.rwl") # or >>> dpl.report(data)
- Description: generates plots of tree ring with data from dataframes. Currently capable of generating
line
(default),spag
(spaghetti) andseg
(segment) plots. - Options:
type="line"
: creates a line plot (default)type="spag"
: creates a spaghetti plottype="seg"
: creates a segment plot
- Usage Example:
>>> dpl.report("/path/to/file.rwl") # or >>> dpl.plot(data) # User is able to select specific series of interests. # In the example below, the user selects SERIES_1, SERIES_2, SERIES_3 # from the "data" dataset and generates a spaghetti plot >>> dpl.plot(data[[SERIES_1, SERIES_2, SERIES_3]], type="spag")
- Description: Detrends a given series or data frame, first by fitting data to curve(s), and then by calculating residuals or differences compared to the original data.
- Options:
fit="spline"
: default detrending method.fit="ModNegEx"
: detrending using negative exponent method.fit="Hugershoff"
: detrending using the Hugenshoff method.fit="linear"
: detrending using the linear method.fit="horizontal"
: detrending using the horizontal method.method="residual"
: calculates residuals vs original data (default).method="difference"
: calculates differences vs original data.Plot
: plotting results default isTrue
, acceptsFalse
.
- Usage Example:
>>> dpl.detrend(data) # or >>> dpl.detrend(data, fit="Hugershoff", method="difference") # User is able to select specific series of interests. >>> dpl.detrend(data[[SERIES_1, SERIES_2, SERIES_3]], fit="Hugershoff", method="difference")
- Description: ontains methods that fit series to autoregressive models and perform functions related to AR modeling.
- Functions:
autoreg(data['Name of series'], max_lag)
: returns parameters of best fit AR model with maxlag of 5 (default) or other specified numberar_func(data['Name of series'], max_lag)
: returns residuals plus mean of best fit from AR models with max lag of either 5 (default) or specified number
- Options:
max_lag
: default 5, can be specified to user's needs.
- Usage Example:
>>> dpl.autoreg(data[SERIES_1]) # or >>> dpl.ar_func(data[SERIES_2], max_lag=7)
- Description: creates a mean value chronology for a dataset, typically the ring width indices of a detrended series. Note: input data has to be detrended first.
- Options:
biweight
: find means using Tukey's biweight robust mean; defaultTrue
.prewhiten
: prewhitens data by fitting to an AR model; defaultFalse
.plot
: plots results; defaultTrue
.
- Usage Example:
# Detrend data first! >>> rwi_data = dpl.detrend(data) # Perform chronology >>> dpl.chron(rwi_data, biweight=False, plot=False)
- Description: evaluate the dating accuracy of a set of tree-ring measurements
- Options: