gxformat2

Galaxy Workflow Format 2 - library for converting between Format2 (YAML) and native Galaxy (.ga) workflow representations.

Setup

uv sync --group test --group lint --group mypy

Running Tests

uv run --group test pytest tests/ -x -q

Set GXFORMAT2_TEST_IWC_DIRECTORY to a local clone of the IWC repository to enable additional integration tests that exercise real-world workflows (conversion, linting, round-tripping). If you know of an IWC checkout, set it during test runs.

Linting

uv run --group lint ruff check
uv run --group lint flake8
uv run --group lint black --check --diff .
uv run --group mypy mypy gxformat2

Project Structure

gxformat2/converter.py - Format2 → native Galaxy conversion
gxformat2/export.py - native Galaxy → Format2 conversion
gxformat2/normalize.py - Format2 normalization (step outs, anonymous refs)
gxformat2/model.py - Shared utilities (types, source parsing, step helpers)
gxformat2/lint.py - Workflow linting
gxformat2/abstract.py - Abstract CWL export
gxformat2/examples/ - Workflow fixture files and typed catalog (see below)
tests/_helpers.py - Test utilities: to_native(), from_native(), round_trip(), MockGalaxyInterface
tests/example_wfs.py - Legacy inline workflow strings, migrating to gxformat2/examples/

Schema & Pydantic Models

Source YAML schemas (schema-salad format) live in schema/:

schema/v19_09/workflow.yml - Format2 schema definition
schema/native_v0_1/workflow.yml - Native Galaxy workflow schema definition
schema/common/common.yml - Shared record definitions (uses pydantic: namespace for type overrides)

Generated Python models land in gxformat2/schema/ — do not hand-edit:

gxformat2/schema/gxformat2.py / gxformat2_strict.py - Format2 Pydantic v2 models (lax/strict)
gxformat2/schema/native.py / native_strict.py - Native workflow Pydantic v2 models (lax/strict)
gxformat2/schema/v19_09.py / native_v0_1.py - Legacy schema-salad-tool codegen (large files)

Hand-written Pydantic models that extend the generated ones:

gxformat2/normalized/_format2.py - Normalized Format2 models
gxformat2/normalized/_native.py - Normalized native workflow models
gxformat2/normalized/_conversion.py - Expanded workflow models
gxformat2/cytoscape/models.py - Cytoscape.js visualization models

Regenerating Schemas

bash build_schema.sh

Uses schema-salad-plus-pydantic (Pydantic v2) and schema-salad-tool (legacy codegen). Key env vars:

SKIP_PYDANTIC=1 - Skip Pydantic model generation
GXFORMAT2_SCHEMA_BUILD_DRY_RUN=1 - Write to temp dir (used by make lint)
SKIP_JAVA=1 / SKIP_TYPESCRIPT=1 - Skip Java/TS codegen

Generated files are excluded from ruff linting in pyproject.toml.

Workflow Fixtures (`gxformat2/examples/`)

Prefer file-based fixtures in gxformat2/examples/ over inline strings in tests/example_wfs.py. File-based fixtures are visible to docs, downstream consumers (Planemo), and the catalog validation.

Adding a new fixture

Create the file with the naming convention {origin}-{description}[-{variant}].{ext}:
- Origin: real (unmodified Galaxy export), real-hacked (real with manual edits), synthetic (hand-written), converted (machine-generated)
- Extension: .gxwf.yml (Format2) in format2/, .ga (native JSON) in native/
Add an entry to gxformat2/examples/catalog.yml with file, origin, format, and tests fields
Use in tests via the API:

from gxformat2.examples import load, load_contents, get_path

wf_dict = load("synthetic-my-case.gxwf.yml")        # parsed dict
wf_str = load_contents("synthetic-my-case.gxwf.yml") # raw string
wf_path = get_path("synthetic-my-case.gxwf.yml")     # absolute path

Catalog validation

tests/test_examples_catalog.py enforces:

Every catalog entry points to an existing file
Every example file on disk appears in the catalog
Every test file referenced in catalog entries exists

Migration note

tests/example_wfs.py still has ~20 inline workflow constants. Two (BASIC_WORKFLOW, NESTED_WORKFLOW) already delegate to load_contents(). Remaining constants should migrate to files when touched.

Declarative Tests (`gxformat2/examples/expectations/`)

Prefer declarative YAML-driven tests over imperative Python tests for verifying structural properties of operation results (normalization, conversion, ensure). The YAML format is language-agnostic and reusable from a TypeScript project.

Test runner: tests/test_interop_tests.py Expectation files: gxformat2/examples/expectations/*.yml

Expectation file format

test_name:
  fixture: synthetic-basic.gxwf.yml       # from gxformat2/examples/
  operation: to_native                     # function to call on the loaded fixture
  assertions:
    - path: [steps, "1", tool_id]          # navigate the result object
      value: cat1                          # exact equality
    - path: [steps, "1", workflow_outputs, $length]
      value: 1
    - path: [outputs, 0, outputSource]
      value_contains: the_cat              # substring check
    - path: [warnings]
      value_any_contains: "disconnected"   # any list element contains substring
    - path: [unique_tools]
      value_set:                           # unordered set comparison
        - {tool_id: cat1, tool_version: "1.0"}

Path elements

str — dict key lookup (falls back to attribute access)
int — list index
$length — terminal, returns len(current)
{field: value} — find first list item where item.field == value

Available operations

Normalization: normalized_format2, normalized_native, expanded_format2, expanded_native
Conversion: to_format2, to_native, ensure_format2, ensure_native
Validation: validate_format2, validate_format2_strict, validate_native, validate_native_strict
Linting: lint_format2, lint_native — return {errors: [...], warnings: [...], error_count: N, warn_count: N}
Best practices: lint_best_practices_format2, lint_best_practices_native — same return shape

Special keys

assertions may be omitted or empty — the operation succeeding is the test
expect_error: true — the operation must raise an exception (test passes on error, fails on success)

# Positive validation: operation succeeds, no assertions needed
test_basic_valid_format2:
  fixture: synthetic-basic.gxwf.yml
  operation: validate_format2_strict

# Negative validation: operation must fail
test_extra_field_rejected:
  fixture: synthetic-extra-field.gxwf.yml
  operation: validate_format2_strict
  expect_error: true

Adding a declarative test

Pick or create an expectation file named after the operation/feature
Add a named case with fixture, operation, and optionally assertions / expect_error — test IDs must be unique across all expectation files
Ensure the fixture exists in gxformat2/examples/ and its catalog.yml entry lists tests/test_interop_tests.py

When to use imperative tests instead

Mutation safety, mocking/resolvers, cycle detection, and scenarios requiring Python-specific setup (e.g. tmp_path, custom ConversionOptions) stay in test_normalized.py.

Key Patterns

Round-trip testing: round_trip(yaml_string) converts Format2→native→Format2 and validates
Source references: Format2 uses step_label/output_name with / as separator. resolve_source_reference() in model.py handles labels containing /
walk_id_list_or_dict() in normalize.py handles both dict and list step/input representations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gxformat2

Setup

Running Tests

Linting

Project Structure

Schema & Pydantic Models

Regenerating Schemas

Workflow Fixtures (`gxformat2/examples/`)

Adding a new fixture

Catalog validation

Migration note

Declarative Tests (`gxformat2/examples/expectations/`)

Expectation file format

Path elements

Available operations

Special keys

Adding a declarative test

When to use imperative tests instead

Key Patterns

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

gxformat2

Setup

Running Tests

Linting

Project Structure

Schema & Pydantic Models

Regenerating Schemas

Workflow Fixtures (gxformat2/examples/)

Adding a new fixture

Catalog validation

Migration note

Declarative Tests (gxformat2/examples/expectations/)

Expectation file format

Path elements

Available operations

Special keys

Adding a declarative test

When to use imperative tests instead

Key Patterns

Workflow Fixtures (`gxformat2/examples/`)

Declarative Tests (`gxformat2/examples/expectations/`)