ASC CRS LACROSSE

This repository contains SIFT's LACROSSE Cyber Reasoning System (CRS) as submitted to the AIxCC semi-final competition. The latest version of LACROSSE can be found here: https://github.com/siftech/afc-crs-lacrosse

CRS Sandbox

This repository, the CRS Sandbox includes a ./compose.yaml file. This file is the only resource competitors will have for infrastructure automation at competition time. Environment variables and secrets will be injected into ./compose.yaml from each competitors private copy of the CRS Sandbox.

Competitor SSO accounts to GitHub will be limited to a basic set of actions for making modifications and merging PRs within the GitHub repository.

Reporting Bugs & Issues

Competitors should use GitHub issues to report bugs on the respective repositories.

Competitors are also welcome to comment in tickets assisting others.

We encourae all competitors to read through issues (open & closed) within the following repos.

Evaluation Window

Phase 1 - GitHub Actions Passing

Date: 2024-05-30

On the above date, teams will be provided access to their private CRS repositories.

This repository will be generated from the CRS Sandbox reference repository which will be treated as the template repository.

Merging into main will require the workflows specified in .github/workflows/evaluator.yml and .github/workflows/package.yml to pass.

Competitors MUST release at least one version of their CRS during Phase 1 to validate their package workflow correctly executes.

Failure to do so will prevent a team's CRS from moving forward to Phase 2.

During Phase 1, teams must use their own secret keys and tokens to access collaborator resources (LLM APIs) and authenticate against GitHub.

Interpreting Results in GitHub Actions

The job that evaluates the CRS's performance is part of the CRS Evaluator and is called run-validate-crs-submissions.

It runs the CRS as defined in the ./compose.yaml and evaluates its submitted vulnerability discoveries and generated patches. Check the output of the validation steps, CRS submission log step, and CRS logs step for introspection into what happened.

phase1.mp4

Phase 2 - Automated Execution of your CRS

Date: 2024-06-21

On the above date, the AIxCC Game Architecture team will automatically execute competitors CRSs against a subset of published challenge problems.

The CRS MUST be released via GitHub Release and you MUST merge at least one pull request with a passing Evaluator workflow.

Competitors must release new versions of their CRS with an updated tag from main after the start of Phase 2 in order to trigger provisioning of their cluster.

Teams MUST merge the automated upstream pull requests in their repos OR rebase for CRS Sandbox version >= v2.5.0.

With each new release of a competitors CRS it will be automatically provisioned.

Only the latest semantic version of a competitors CRS that is properly tagged from main will be tested in Phase 2.

During Phase 2, secret keys and tokens for collaborator resources (LLM APIs) and GitHub access will be set by the AIxCC infrastructure team.

Competitors will recieve access to a live vCluster environment at the start of Phase 2.

Competitors will be able to evaluate their CRS in this environment each time they make a new release of their CRS.

The vCluster environment will use the same SSO from the AIxCC Dashboard.

We plan to add another button to the Dashboard for this environment soon.

Competitors MUST modify their PAT to add repo and read:packages level access to their classic PAT.

Competitors MUST add their PAT with the GitHub CLI with the name GHCR_PULL_TOKEN. They may do this by running gh variable set GHCR_PULL_TOKEN and adding the PAT from above.

The process for creating the PAT is outlined under GitHub Personal Access Token.

However, once we announce Phase 2 is live, teams will be able to log into their CRS at

https://vcluster-platform.aixcc.tech/login

During competition, CRSs may only submit a single working vulnerability discovery on any single commit, and must use that issued CPV UUID for any generated patches. Any further VDSs will be rejected as duplicates. During phase 2, however, duplicate submissions will not be rejected in order to facilitate rapid CRS testing. We may turn rejection back on towards the end of phase 2.

By modifying cp_config/cp_config.yaml, competitors can change the CPs presented to their CRS during phase 2.

phase2.mp4

Code Owners

Please review the .github/CODEOWNERS file.

This file shows all the files that require pull request approval by the Game Architecture team. The main branch protections will prevent making changes to these files.

The following paths have push protections in place. They cannot be modified even within a private branch or pull request.

If you feel like one of these items needs modified, please make a CRS Sandbox Issue.

The Game Architecture team will review the request and respond accordingly.

.github/actions/trigger-downstream-sync.mjs
.github/workflows/evaluator.yml
.github/workflows/README.md
.github/workflows/template-sync.yml
.github/workflows/trigger-sync-on-release.yml
.tool-versions
charts/*
cp_root/*
crs_scratch/*
dind_cache/*
LICENSE
Makefile
README.md
sandbox/*

Docker Images

Competitors MUST push all container images that are contained in compose.yaml to their CRS repository.

All container images MUST contain a tag.

Docker Compose services which contain a build section MUST be added to package.yaml.

If your solution is referencing a public container like PostgreSQL or MongoDB, you MUST push this image to your CRS repository.

You MUST push these images with a tag to your CRS OCI repository and reference this image using the ghcr.io link.

GitHub has the following Container Registry instructions.

Failure to follow these steps will prevent your CRS images from being able to execute at the competition.

Competitors SHOULD use a tag similar to :${RELEASE_TAG-v1.0.0} for all images in their ./compose.yaml that are built and pushed automatically with .github/workflows/package.yml.

This will make releases update automatically in the Kubernetes resources.

CRS Constraints on Docker and Virtualization

In the competition environment, a CRS is expected to use Docker (via run.sh) to exercise the CPs that are packaged and configured to be built, tested, and patched using the provided Docker container.

One CP (the public Linux kernel CP) includes virtme-ng in its CP-specific Docker container for the purposes of testing the built kernel.

The virtme-ng program will automatically use /dev/kvm for acceleration if it is present and the CRS is running as root See Linux CP #10.

Competitors are permitted to add privileged: true to any container under ./compose.yaml.

The Game Architecture team has confirmed the CRS execution environment supports nested virtualization for KVM.

There is no need or support for competitors to map devices directly, they must add the privileged: true to containers which need it.

Environment Variables & GitHub Secrets

Each competitor CRS repository will come pre-packaged with a list of GitHub secrets and environment variables. Teams may change the values of these secrets (e.g. to their own collaborator API keys); however, teams must not change the variable names. Also, teams must ensure their services use the core variables related to the iAPI and LiteLLM connections.

For local development and during Phase 1 of the Evaluation Window, competitors are expected to use / provide their own keys and secrets. During subsequent phases of the evaluation window and at competition, the AIxCC infrastructure team will override these values with their own.

There are currently 5 LLM Provider environment variables declared but not populated in example.env, which will be populated at competition time:

OPENAI_API_KEY
AZURE_API_KEY
AZURE_API_BASE
GOOGLE_APPLICATION_CREDENTIALS
ANTHROPIC_API_KEY

Note: For local development, the ./sandbox/example.env file should be copied to ./sandbox/env. This file is included in the .gitignore so competitors don't accidentally push it to their repository.

Also note: GOOGLE_APPLICATION_CREDENTIALS does not directly contain the Google credential. It contains a path to vertex_key.json, which contains the actual credentials. To get the content of vertex_key.json, use the instructions to create a GCP Service Account in combination with this document about creating the credential file itself.

TBD - These variables and the LiteLLM configuration file are not yet complete. This will be released in a CRS sandbox update. We will continue iterating on the CRS sandbox as we grow closer to the competition in order to support newer versions of components.

Please see the competition rules and technical release as the cut off dates for changes will be described there.

Setting GitHub secrets and variables with competitor repository permissions

Using the GitHub CLI, you are able to set repository-level secrets despite not being able to view or edit them in the web UI.

Your GitHub Classic PAT will need the Full control of private repositories permission, and you will need it set in the GITHUB_TOKEN environment variable. Once you have that configured, try gh secret list. You might get a 403 error requiring SSO sign-in:

Open the link and complete the SSO flow. Then you should be able to use gh secret set to set secrets on your repository and gh secrets list to show which ones exist and when they were most recently set.

You can now also set variables with gh variable set MY_EXAMPLE_VARIABLE and list with gh variable list

The GitHub CRS Validation workflow expects the repo-level secrets to have the same names as in sandbox/env (OPENAI_API_KEY, etc). The only exception to this is Google's LLM credential, which should be stored in VERTEX_KEY_JSON.

LiteLLM Models Supported

Provider	Model	Pinned Version	Requests per Minute (RPM)	Tokens per Minute (TPM)
OpenAI	gpt-3.5-turbo	gpt-3.5-turbo-0125	800	80,000
OpenAI	gpt-4	gpt-4-0613	200	20,000
OpenAI	gpt-4-turbo	gpt-4-turbo-2024-04-09	400	60,000
OpenAI	gpt-4o	gpt-4o-2024-05-13	400	300,000
OpenAI	text-embedding-3-large	text-embedding-3-large	500	200,000
OpenAI	text-embedding-3-small	text-embedding-3-small	500	200,000
Anthropic	claude-3-sonnet	claude-3-sonnet-20240229	1,000	80,000
Anthropic	claude-3.5-sonnet	claude-3-5-sonnet-20240620	1,000	80,000
Anthropic	claude-3-opus	claude-3-opus-20240229	1,000	40,000
Anthropic	claude-3-haiku	claude-3-haiku-20240307	1,000	100,000
Google	gemini-pro	gemini-1.0-pro-002	120	pending (as of 20240610)
Google	gemini-1.5-pro	gemini-1.5-pro-preview-0514	120	pending (as of 20240610)
Google	textembedding-gecko*	textembedding-gecko@003*	pending (as of 20240610)	pending (as of 20240610)

Note: OpenAI Embedding models have not currently been released in more than a single version, thus pinned/name strings are identical.

Some OpenAI models will also be matched by an Azure-hosted version:

Provider	Model	Pinned Version	Requests per Minute (RPM)	Tokens per Minute (TPM)
Azure	gpt-3.5-turbo	gpt-3.5-turbo-0613	100	80,000
Azure	gpt-4o	gpt-4o-2024-05-13	100	300,000
Azure	text-embedding-3-large	text-embedding-3-large	100	120,000
Azure	text-embedding-3-small	text-embedding-3-small	100	120,000

Competitors will be able to freely request the model they like by the Model name in chart above, plus a prefix "oai-" or "azure-". Ex. "oai-gpt-4o". This was done because of performance differences between the models as hosted on OAI vs Azure infrastructure. The models themselves are guaranteed to be identical but no such promises can be made as regards supporting provider infrastrcture.

Note: OAI Embedding models have not currently been released in more than a single version.

These models are all utilized by hitting the LiteLLM /chat/completions endpoint, specifying model and message using the OpenAI JSON request format. This is the tentative complete list of models.

The Requests per Minute (RPM) and Tokens per Minute (TPM) columns in the table above are rate limits that are enforced per CRS for the ASC. The LiteLLM proxy will be responsible for implementing these limits. The RPM and TPM limits are enforced per model, not in aggregate across models or providers.

Note: the "*" next to model "textembedding-gecko" indicates this model target is still in flux. The AIxCC infrastructure team is still waiting on LiteLLM to finalize support for the model "text-embedding-004". If this newer model is not integrated in time to support its use during the ASC, then the fallback will likely be "textembedding-gecko@003".

Local Development

We recommend using Ubuntu 22.04 LTS for CRS Sandbox development and will be unable to investigate issues with other base operating systems.

GitHub Personal Access Token

In order to work with the CRS Sandbox you must setup your GitHub personal access token or PAT following these steps.

Configure a personal access token (PAT) with repo (all checks) and read:packages permission by following this guide
Authorize the generated PAT for the aixcc-sc organization by this guide
Run echo "example-token-1234" | docker login ghcr.io -u USERNAME --password-stdin replacing example-token-1234 with your generated PAT.
Confirm that you see > Login Succeeded in your output from step #3.
Competitors MUST add this key as a repository variable called GHCR_PULL_TOKEN. This MUST be a variable and NOT a secret. The Game Architecture team will use this variable to pull your repository images at competition time.

GitHub SSH Key

Generate an SSH key by following this guide
Upload the generated SSH key to your AIxCC GitHub account by following this guide
Follow this guide to authorize the SSH key for the aixcc-sc organization

Precommit

This repository has a .pre-commit-config.yaml file for assisting with local development.

While competitors are not required to use this, they may find it easier to pass the mandatory evaluation checks.

You can install the command-line tool by going here

Dependencies

Most dependencies in this repository can be automatically managed by mise, but you'll have to install the following yourself:

docker >= 24.0.5
docker-compose >= 2.26.1
GNU make >= 4.3

(optional for local kubernetes testing)

k3s >= v1.29.5
nfs-common >= 1:2.6.3ubuntu1
open-iscsi >= 2.1.8-1ubuntu2

We've added a Makefile target make install which will setup the required dependencies. This is the exact same target used by the GitHub workflow evaluator.

Additionally, you will need permissions to interact with the Docker daemon. Typically this means adding your user to the docker group.

Working with Docker

The crs-sandbox contains its own Docker daemon inside of a container. With make up this runs docker-in-docker. However with Kubernetes via make k8s and at competition this runs the dockerD daemon container within Kubernetes. By default this is not accessible on the host machine, but you can enable the port mapping by editing [./compose_local_overrides.yaml`](./compose_local_overrides.yaml). Note that by doing this, you are exposing the Docker daemon on your host without authentication enabled.

Once you've done that, set DOCKER_HOST=tcp://127.0.0.1:2375.

export DOCKER_HOST=tcp://127.0.0.1:2375
docker logs <container name>

Working with K3S Kubernetes

We now use K3S for our local Kubernetes w/ the Longhorn storage driver. We use a Kubernetes context named crs for all kubectl targets in the Makefile to prevent modification to other Kubernetes environments.

You MUST set your GitHub PAT in the env file so that Kubernetes can use this to pull images.

Install dependencies

make install

Merge the k3s kubeconfig into your main kubeconfig

sudo cp /etc/rancher/k3s/k3s.yaml /tmp/k3s.yaml
sudo chown $USER /tmp/k3s.yaml
KUBECONFIG=/tmp/k3s.yaml:~/.kube/config kubectl config view --flatten > ~/.kube/config

Rename k3s context

kubectl config rename-context default k3s

Set the current context to k3s

kubectl config use-context k3s

Remove k3s

make k8s/k3s/clean

Working with Kubernetes API

Several teams inquired about the ability of their CRS to work directly with the Kubernetes API in a few tickets.

#197
#203

This functionality has now been added to the CRS Sandbox.

This is approach is purely optional and should be considered an unsupported expert mode so teams can perform dynamic orchestraion of their CRS.

Unsupported means that issues in GitHub related to the Kubernetes API access will receive a lower priorty.

Teams using the Kubernetes API MUST manage their own dynamic resources, and their CRS approach MUST have the ability to recover from memory exhaustion, etc.

To enable this feature the compose.yaml file must contain the following for each service that needs Kubernetes API access.

labels:
  kompose.serviceaccount-name: crs

Dependencies managed using mise

This repository defines its dependencies in a .tool-versions file. mise can read this file and automatically install the tools at the required versions. Install mise, set it up in your shell, and then run mise install. mise will then manage your PATH variable to make the tools available whenever you cd into this repository.

We've included a Makefile with helpful targets to make working with the CRS Sandbox easier. However, you can copy any commands and run them on your own. Please note the use of --profile with all docker compose commands. This is so we can easily swap --profile development with --profile competition at competition time, but competitors can use the --profile development to run the local copy of emulated resources.

Data Sharing & Volumes

A CRS will find the CPs under evaluation in the volume indicated by the environment variable ${AIXCC_CP_ROOT}. At competition time and likely during some part of the evaluation window, this volume will be configured as read-only. As such, a CRS MUST copy a CP from ${AIXCC_CP_ROOT} to a writable location in order to build or test it.

The volume indicated by the environment variable ${AIXCC_CRS_SCRATCH_SPACE} will be writable by the CRS and CPs. Moreover, this volume can be shared among the CRS services as a shared file system. It is the responsibility of the CRS developers to ensure that use of this shared volume is coordinated between its services to prevent data corruption via collisions or race conditions. No other folders or volumes will be shared between containers for competitor use during competition.

No internet Access

As stated previously, a CRS will NOT have internet access except for via the LiteLLM proxy to the configured LLM providers.

Because of this competitors MUST provide all artifacts within their Docker container images.

All images needed to execute a CRS MUST be included under .github/workflows/package.yml under the jobs.build-and-push-image.strategy.matrix.include section.

The Game Architecture team will migrate these images to the competition environment prior to starting your CRS.

Release Process

We've modified our original guidance on the tagging process.

All teams should be using SemVer 2.0.0 to tag releases.

A team MUST have a tag of v1.0.0 OR greater within their private CRS repository at competition.

Teams MUST use a v prefix in their tags.

All releases MUST be from the main branch ONLY. Failure to create release tags from main will lead to a failed release.

Teams can create these tags by following the GitHub Release process with https://docs.github.com/en/repositories/releasing-projects-on-github/managing-releases-in-a-repository

This will automatically tag any Docker images you've specified under .github/workflows/package.yml outlined above.

This will also tag the Helm chart of your CRS automatically.

At competition the AIxCC Game Architecture team will use the latest SemVer tag available on your repository that was present at the end of the submission window.

Using Make

A Makefile has been provided with a number of a commands to make it easy to clone the exemplar repos, stand up the environment, and a variety of other actions.

Copy sandbox/example.env to sandbox/env and replace the variables with your own for local development.

If you do not have working GitHub credentials that can pull images from GHCR, make up will fail.

cp sandbox/example.env sandbox/env

make cps - clones the exemplar challenges into local ./cp_root folder (the source folder for ${AIXCC_CP_ROOT}) make up - brings up the development CRS Sandbox, you can visit http://127.0.0.1:8080/docs to see the iAPI OpenAPI spec. make down - tears down the development CRS Sandbox

See Makefile for more commands

make force-reset - performs a full Docker system prune of all local docker containers, images, networks, and volumes. This can be useful if you accidentally orphaned some docker process or other resources.

Kubernetes

The Makefile includes endpoints for make k8s, make k8s/development and make k8s/competition

This will generate a resources chart in a .k8s/ folder. The make k8s command uses K3S to run Kubernetes locally and will also apply the generated Kubernetes resources onto your cluster. This process uses a component called Kompose for translating the Docker Compose file into resources. The CRS Sandbox will include a CI/CD action which the private repos must also use. This will generate and push the container images to the respective per-competitor private GitHub. This will also push the generated manifest file as an OCI compliant manifest to the private GitHub repos. The evaluator.yml action runs make k8s in every pull request to main. This is to ensure all resources can be properly translated into a manifests and deployed into Kubernetes.

Autoscaling

One of Kubernetes' most useful features is autoscaling. Kompose exposes horizontal pod autoscaling, among many other features, via labels set on services. This example produces an HPA configuration that will scale from 3 replicas up to 12, adding and removing replicas to target an average CPU utilization of 80% and memory utilization of 1024 megabytes. Please note these are probably not good default values for your application and you should customize them.

services:
  job-runner:
    labels:
      # Thresholds for automatic scale up
      kompose.hpa.cpu: 80 # percentage
      kompose.hpa.memory: 1024Mi
      # High & low limits for number of replicas
      kompose.hpa.replicas.max: 12
      kompose.hpa.replicas.min: 3

Resource Requests & Limits

Docker Compose and Kubernetes both support the concepts of requests and limits.

We recommend that teams review Docker Compose Deploy Specification.

Kompose V3 will automatically convert these requests and limits into requests and limits within Kubernetes.

Nodes will be labeled with node=node1, node=node2, node=node3 at competition time. Therefore in your kompose_competition_overrides.yaml, you will be able to do the following for a service to constrain its placement:

deploy:
  placement:
    constraints:
      - node.labels.node == node1

Teams may use the following files to add requests and limits onto any containers within a CRS.

Deployments, Pods, and replica count

Kompose has some limitations on resource creation. Depending on what attributes you set on your services, you will get different Kubernetes resource types. Here are some typical use cases.

Run forever and never exit

This type of service is never expected to exit cleanly. If it exits, it will be due to an uncaught exception. This might be a database or cache.

Set restart: always on this service. This produces a Deployment in Kubernetes. If you want multiple, you can use the deploy.replicas key to scale horizontally.

Run once and exit cleanly

This type of service is intended to run once, typically when initialized, and not restart upon completion.

Set restart: on-failure on this service. This produces a Pod in Kubernetes. If you want multiple, you will need to declare multiple services.

Architecture Diagram

This diagram depicts the CRS Sandbox during the development phase with --profile development and during the competition phase with --profile competition. As you can see the iAPI remains as part of the CRS Sanbox but can communicate with the upstream API. However, the LiteLLM component moves to a centralized component that does NOT run within the CRS Sandbox at competition.

Competition Details

Runtime

The ASC will be organized into a series of rounds, and in each round a CRS will analyze a single CP (i.e., a single CP folder will be present in ${AIXCC_CP_ROOT}). Each round will last four (4) hours.

At the start of each round, the folders ${AIXCC_CP_ROOT} and ${AIXCC_CRS_SCRATCH_SPACE} will be reset with only the target CP for that round in ${AIXCC_CP_ROOT}. The contents of ${AIXCC_CRS_SCRATCH_SPACE} will not persist between rounds.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github		.github
.static		.static
cp_config		cp_config
crs		crs
mock_crs		mock_crs
proprietary		proprietary
sandbox		sandbox
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.tool-versions		.tool-versions
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
compose.yaml		compose.yaml
compose_local_overrides.yaml		compose_local_overrides.yaml
example-crs.sh		example-crs.sh
kompose_competition_overrides.yaml		kompose_competition_overrides.yaml
kompose_development_overrides.yaml		kompose_development_overrides.yaml

License

siftech/asc-crs-lacrosse

Folders and files

Latest commit

History

Repository files navigation