Cookiecutter for Science Projects

A cookiecutter template for science and data science projects that include data, code, and dissemination.

Optimized for data-based publications
Optimized for use with VS Code
Docker-based, version-controlled environment using VS Code Dev Containers
uv based environment inside the Dev Container
to add a package just follow to uv workflow: use the VS code terminal and to go the code folder and run: uv add pandas
use of Dev container Features with pre-installed, Python andLaTeX
Setup for use with Python but could also be addapted for Julia, and R
Make commands for: collecting data, generating, figures, typsetting latex, clean temp files, clean demo files
use of VS Code tasks to trigger data collection, plotting and paper compilation
LaTeX-based paper
Added path definitions in the project_package Python module
Kedro-inspired data folder structure
filled with a demo - which can be cleaned with "make delete_demo"
used in at least 5 papers

For more detailed information, please see the README of the resulting project.

Quick Start

cookiecutter https://github.com/tgoelles/cookiecutter_science

File Structure

├── Makefile                        	    #  Automation script for common tasks
├── README.md                       	    #  Project overview and instructions
├── code                                   #  Python Source code and notebooks
│   ├── notebooks                          #  Jupyter notebooks for analysis
│   │   └── exploratory                    #  Exploratory data analysis
│   │       └── 1.0-tg-example.ipynb       #  Example exploratory notebook
│   └── project_package                    #  The project package where refined code goes
│       ├── pyproject.toml                 #  project_package dependencies and configuration
│       └── src                            #  Source code directory
│           └── project_package      	    #
│               ├── __init__.py            #
│               ├── data                   #  Data processing module and scripts
│               │   ├── __init__.py        #
│               │   ├── config.py          #  Configuration settings
│               │   ├── example.py         #  Example script
│               │   ├── import_data.py     #  Data import functions
│               │   └── make_dataset.py    #  Dataset creation script, used by make data
│               ├── tools                  #  Utility scripts
│               │   ├── __init__.py        #
│               │   └── convert_latex.py   #  LaTeX conversion script
│               └── visualization          #  Visualization module and scripts
│                   ├── __init__.py        #
│                   ├── make_plots.py      #  Plot generation functions
│                   └── visualize.py       #  Data visualization utilities
├── data                                   #
│   ├── 01_raw                             #  Raw data, do not change the data in there
│   │   └── demo.csv                       #  Example raw data file
│   ├── 02_intermediate                    #  Processed but unrefined data
│   │   └── demo_clean.csv                 #  Example cleaned data file
│   ├── 03_primary                         #  Primary processed datasets
│   ├── 04_feature                         #  Feature-engineered datasets
│   ├── 05_model_input                     #  Data ready for modeling
│   ├── 06_models                          #  Trained models
│   ├── 07_model_output                    #  Model predictions/results
│   └── 08_reporting                       #  Reports and summaries
├── dissemination                          #  Outputs for publication/presentation
│   ├── figures                            #  Figures and plots go in here
│   │   └── demo.png                       #  Example figure
│   ├── papers                             #  LaTeX desimition for paper or Thesis
│   │   ├── paper.pdf                      #  Final paper output
│   │   └── paper.tex                      #  LaTeX source for the paper
│   └── presentations                      #  Presentation slides and materials
├── literature                             #  References and related work
│   └── references.bib                     #  Bibliography file
├── pyproject.toml                         #  All Project dependencie and tool settings, managed by uv
└── uv.lock                                #  Dependency lock file for reproducibility

Tasks

Use of VS Code tasks:

Requirements

Git: Should be part of your OS or install it here
GitHub account
GitHub CLI: Install from here
Docker Desktop: Install from here
VS Code: Install from here
VS Code Extension: Remote Development: Install from here
Cookiecutter Python package: Install like this:

pip install cookiecutter

For Mac users:

brew install cookiecutter

Getting Started

Navigate to the folder where you want to create the project (on your local drive):
```
cookiecutter https://github.com/tgoelles/cookiecutter_science
```
Answer the questions prompted by cookiecutter.
A new VS Code window will open automatically.
Click "OK" to reopen the folder in a container (only asked the first time).
Read the README.md in the generated project folder.

Git and GitHub

Cookiecutter can generate a GitHub repository for you. This initializes the git repo and pushes it to GitHub. You can then invite your team members to join the project.

Each team member works on their local version of the project, regularly committing and pushing changes.
Avoid working on the same folder over a network.

Note for Windows Users

If you want to use git inside the container (recommended), you need to clone the repo from WSL, as Windows may mess up the .git folder. Git inside the container uses the same .gitconfig as Windows, which is copied into the container.

Ensure user.email and user.name are set (in PowerShell):

git config --global user.name "your_name"
git config --global user.email "[email protected]"

Name		Name	Last commit message	Last commit date
Latest commit History 139 Commits
hooks		hooks
test		test
{{ cookiecutter.repo_name }}		{{ cookiecutter.repo_name }}
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Tasks.png		Tasks.png
changelog.md		changelog.md
cookiecutter.json		cookiecutter.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cookiecutter for Science Projects

Quick Start

File Structure

Tasks

Requirements

Getting Started

Git and GitHub

Note for Windows Users

About

Uh oh!

Releases 5

Uh oh!

Languages

License

tgoelles/cookiecutter_science

Folders and files

Latest commit

History

Repository files navigation

Cookiecutter for Science Projects

Quick Start

File Structure

Tasks

Requirements

Getting Started

Git and GitHub

Note for Windows Users

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Uh oh!

Languages