The overall structure of this repository is as follows:
playbooks
- collections of roles executed in series against a host.roles
- local Ansible functionality grouped by taskinventory
- hosts and variablesall_hosts
- host file with hostnames and host groupsgroup_vars
- group variables for different hosts and host groupsall
- variables shared by all playbooksvars.yml
- unencrypted all variablesvault.yml
-ansible vault
encrypted variables
architecture-decisions
- list of significant architectural decisions, as markdown files
-
Python virtual environment.
- See
.python-version
for the recommended version of Python. If you use pyenv for managing python versions, runpyenv install
. - If you create a python virtualenv in this directory and name it
env
orvenv
, it is included in.gitignore
to be excluded by git - Install python dependencies:
pip install -r requirements.txt
- See
-
Install required Ansible galaxy collections and roles:
ansible-galaxy install -r requirements.yml
-
The CDH Ansible vault keys are stored in LastPass. You need to be added to the appropriate LastPass share and install lastpass-cli. There are two command-line scripts in the
bin/
directory to calllpass
to retrieve the vault keys, and the default configuration is set inansible.cfg
. See below for more details on the vault setup. -
A GitHub personal access token for any playbook that uses the
create_deployment
andclose_deployment
tasks. You can set this as an environment variable asANSIBLE_GITHUB_TOKEN
or pass it on the command line as-e github_token=
. If not specified, a fallback token will be used, which will deploy as theprincetoncdh
user. When running ansible locally, we recommend setting a personal GitHub token.
If you plan to contribute to this repository, you should install the configured pre-commit hooks. (If you installed python dependencies, pre-commit should already be installed)
pre-commit install
This will add install a pre-commit hook to prevent committing an unencrypted vault or private key file. If new encrypted files are added with different names, filename patterns should be added to the pre-commit configuration in .pre-commit-config.yaml
NOTE: If you have a previous installation with the local pre-commit hook script that was included in this repository, you will need to run
pre-commit install -f
to replace it.
To run a playbook, with your python virtual environment activated:
ansible-playbook playbooks/name_of_playbook.yml
Each application has a deploy playbook that can be used to deploy that application to either the production or staging environments. By default, the playbook will deploy to staging. To deploy to production, override the runtime environment with this command line option: -e runtime_env=production
By default, deploying to production will deploy the main
branch of the github repository for that application; deploying to staging will deploy the develop
branch. These may be overridden by specific applications with different conventions.
To deploy a branch or tag other than the default, you can specify an alternate git reference via -e ref=GITREF
where GITREF
is the name of the branch, tag, or a commit hash.
The playbook will run, noting success and failures. The -v
flag adjusts verbosity (adding more v
s will produce more verbosity. Debug tasks are usually written at 2
)
By default, initial provisioning and setup tasks are configured to be skipped. Tasks or groups of tasks should be tagged with both setup
and never
.
To run a playbook without skipping setup tasks, pass the setup
and all
tags, so untagged tasks and tasks tagged setup
run:
ansible-playbook --tags=all,setup playbooks/name_of_playbook.yml
By default, most playbooks create a GitHub deployment, which is used to track which version of the code is deployed to which environment. Through Slack/GitHub integration, GitHub deployments can be used to notify team members when a deploy is taking place and whether or not it succeeds or fails. (This can be very noisy when working on or troubleshooting a deploy.)
The tasks for creating and closing the GitHub deployment are tagged with gh_deploy
. If you want to run a playbook without deploying code, pass --skip-tags gh_deploy
.
If you want to pause the playbook before the new version is switched live, there is an optional pause step you can enable using tags. When running normally and skipping setup tasks, run as follows:
ansible-playbook --tags=all,final-pause playbooks/playbook.yml
When running with setup steps enabled:
ansible-playbook --tags=all,setup,final-pause playbooks/playbook.yml
To revert to previous deploy run call the revert_deploy
playbook with a host_group
matching the deploy you want to revert, e.g.:
ansible-playbook -e host_group=shxco_staging playbooks/revert_deploy.yml
Configurations that are sensitive, such as passwords or API keys, should be
stored in a vault variable file (i.e., inventory/group_vars/*/vault.yml
) and the value of the variable should be encrypted (but not the entire file). For compatibility with Ansible Tower, which loads group variables into inventory, we encrypt individual variables rather than the entire vault.yml file.
To encrypt a single variable, you can use ansible-vault
:
ansible-vault encrypt_string <password_source> '<string_to_encrypt>' --name '<string_name_of_variable>'
To work with multiple encrypted variables, use the local vault_vars.py
helper script.
- If all variables in a vault file are unencrypted, use
encrypt
mode to encrypt them - To view the values of your vaulted variables, use the
decrypt
mode (does not replace content or preserve content) - To check that all variable values in a vault file are encrypted use
check
Example uage:
./bin/vault_vars.py encrypt inventory/group_vars/all/vault.yml
./bin/vault_vars.py decrypt inventory/group_vars/all/vault.yml
./bin/vault_vars.py check inventory/group_vars/all/vault.yml
The check mode of this script is used as a pre-commit hook to prevent sensitive configurations from being checked into version control in plain text.
Ansible vault passwords are stored in shared LastPass vault and loaded using lastpass-cli.
brew install lastpass-cli
lpass login <[email protected]>
For convenience, a local shell script is provided to pull the vault password from lastpass.
Due to compatibility issues with Ansible Tower, the ansible vault identity list configuration cannot be set as a default ansible.cfg.
For local use, set this environment variable:
ANSIBLE_VAULT_IDENTITY_LIST=default@bin/lpass_default.sh
Some encrypted variables are shared across host groups. As with other variables, these may be overridden in a more specific host group vault file.
Copying production data from production to staging can be done for some projects using a special replication playbook. It is defined with multiple hosts, but for typical use you should limit to the host group you want to replicate, e.g.:
ansible-playbook playbooks/replicate.yml --limit=geniza
On subsequent runs for the same host group on the same day, to skip regenerating production database dumps and media archive files, you can limit to just the staging host:
ansible-playbook playbooks/replicate.yml --limit=geniza_staging
Currently replication consists of:
- dumping the production database, restoring it to staging, and running django migrations in the current deploy
- update django sites in the database to match the staging environment
- backing up and restoring any user-uploaded media files and setting correct ownership and permissions
Replication does not yet include restoring Solr indexing or support replication to dev environments.
- Add the appropriate production and staging host names to the source and destination plays
- Define a new variable
replication_source_host
in the staging variables; it should reference the corresponding ansible hostname (e.g., for genizareplication_source_host
is set togeniza_production
) - If database names differ between staging and production, you may need to override the
db_backup_filename
for the staging host variables. - Ensure that the staging host has an
application_url
variable defined; this is needed to correctly set the Django site entry in the migrated database.
Recommended steps for adding a new playbook:
- Copy over an appropriate playbook for a similar application to use as a starting point, and modify the list of roles as needed.
- Define separate staging and production host groups for the application.
- Add the staging host group to the staging group by including them in the list of
[staging:children]
, and create an application group that collects both staging and production host groups. - Add new
group_vars
directories as needed to configure the application or override any defaults. Variables that are relevant for all environments should be set as application group variables; environment-specific configurations can be set in_staging
or_production
group variables.
We use the ADR specification for documenting architectural decisions made over the course of work on this repository - i.e. conventions around our usage of Ansible. Decision documents are stored in the architecture-decisions/
folder as markdown files.
To propose or doucment a new decision, copy the architecture-decisions/template.md
file and rename it using a sequential number and description of the decision that needs making. Decisions that require discussion should be proposed using a pull request for discussion and review.
- Ensure you have access to the geniza ansible vault key in LastPass
- Install lastpass cli
- Set the following environment variables:
ANSIBLE_VAULT_IDENTITY_LIST=geniza@bin/lpass_geniza.sh
GENIZA_DEPLOY_ONLY=1
Note that you will not be able to run setup tasks or decrypt setup vault secrets.