Synapse Engine v2.0.0 - Fully GPU-Driven & Data-Oriented Architecture by TamasPetii · Pull Request #1 · TamasPetii/SynapseEngine

TamasPetii · 2026-04-03T19:11:34Z

Pull Request: Synapse Engine v2.0.0 - Fully GPU-Driven & Data-Oriented Architecture

Overview

This pull request introduces a massive 150-commit rewrite of the Synapse Engine.

The architecture has been completely migrated from a legacy Object-Oriented, CPU-bound model to a strict Data-Oriented Design (DOD) approach.

The rendering backend is now a 100% GPU-driven, bindless Vulkan pipeline utilizing Multi-Draw Indirect (MDI) and Mesh Shaders.

Core Architecture & ECS

Segmented Entity Pool (ECS)

Replaced the legacy ECS with a cache-friendly memory layout that categorizes entities into:

Static
Dynamic
Streamed

This improves memory locality and significantly reduces cache misses during system iteration.

Dynamic Skipping Optimization

System iterations now track pool states to automatically skip unchanged dynamic components, drastically reducing CPU overhead.

Massive Multithreading (Taskflow)

Integrated Taskflow to fully parallelize the engine loop.

The following operations are now executed asynchronously through a dependency graph:

system updates
Assimp mesh loading
animation loading
GPU buffer staging

MVI Editor UI

Implemented a Model-View-Intent (MVI) architecture for the ImGui editor with full support for:

Undo / Redo
Gizmo state management

Physics & Dependencies

Integrated Jolt Physics as the physics backend.

Dependency management has been transitioned to vcpkg (manifest mode).

Additional serialization support added with Boost.JSON.

GPU-Driven Rendering Pipeline

Bindless MDI Pipeline

The entire scene dispatch now relies on:

a single global instance buffer
a global indirect draw command buffer

Using:

vkCmdDrawIndirectCount
vkCmdDrawMeshTasksIndirectCountEXT

Dual Pipeline Execution

Meshes can dynamically switch between:

Traditional Pipeline → Vertex / Fragment
Mesh Shader Pipeline → Task / Mesh

Both can coexist in the same scene.

WBOIT Integration

Implemented Weighted Blended Order-Independent Transparency (WBOIT) for transparent materials.

This guarantees accurate blending without depth sorting.

O(1) Render Submission

The entire complex scene is submitted in exactly 8 draw calls.

Draw calls are strictly categorized by:

pipeline type
material type
transparency
culling side

Categories:

Traditional Opaque 1-sided
Traditional Opaque 2-sided
Traditional Transparent 1-sided
Traditional Transparent 2-sided
Mesh Opaque 1-sided
Mesh Opaque 2-sided
Mesh Transparent 1-sided
Mesh Transparent 2-sided

Global Material Indirection

Introduced a GPU-side Material Lookup Table.

This allows instances sharing the same base mesh to override materials without breaking the batch.

Advanced Culling Systems

2-Phase Compute Culling

Implemented a fully GPU-driven 2-phase culling pipeline.

Phase 1 — Model-Level Culling

Performs:

frustum culling
occlusion culling

Using bounding spheres and aabb.

Phase 2 — Mesh-Level Culling

Individual meshes are culled through Indirect Compute Dispatch.

Subgroup Optimizations

Replaced standard atomics with Subgroup Operations in compute shaders.

Used specifically for incrementing Indirect Draw Command instance counters.

Task Shader Culling

Integrated meshlet-level culling directly into the Task Shader stage.

Invisible geometry is rejected before reaching the mesh stage.

Light Culling

Implemented dedicated GPU and CPU-driven culling passes for:

Point Lights
Spot Lights

HiZ Map Generation

Rewrote the Hierarchical Z-Buffer generator.

Now safely handles odd-resolution texture edge cases for accurate occlusion culling.

Assets, Dynamic LODs & Animation

Offset-Based Dynamic LODs

Integrated meshoptimizer to generate:

up to 4 LOD levels
meshlets per mesh

LOD selection is performed dynamically at runtime based on camera distance.

Uses offset descriptors, eliminating unnecessary index buffer duplication.

Vertex Pulling Animation

Rewrote skeletal animation to work without baked bone data in the vertex struct.

Animations now use:

bindless vertex pulling
separate animation address buffers

This allows the same vertex shader to process both:

static meshes
animated meshes

Per-Frame Animated Colliders

The ColliderProcessor now calculates accurate:

bounding spheres
AABBs

for animated models every frame on background threads.

This data is fed into GPU culling passes.

Unified Image Architecture

Created a unified image processing pipeline:

ImageBuilder
ImageSource (File / Procedural)
ImageProcessor

Supports:

runtime CPU mipmap generation
runtime GPU mipmap generation
procedural height map generation

Tooling, Profiling & Debugging

Vulkan Timestamp Profiler

Built a custom stall-free, N-buffered GPU Timestamp Query profiler.

Integrated with:

in-engine dashboard
file logger

Tracks pass-by-pass timings for:

Compute
Graphics
Transfer

All measured in milliseconds.

Visual Collider Debugging

Added 1-draw-call collider visualization for:

sphere colliders
AABB colliders
model level
mesh level
meshlet level

Editor Depth Pass for WBOIT

Added a secondary:

depth pass
EntityID pass

for transparent objects.

This solves the WBOIT depth-write issue and enables:

pixel-perfect mouse picking
accurate Gizmo manipulation

inside the editor.

Summary

This PR represents a complete next-generation engine architecture overhaul, focused on:

GPU-driven rendering
data-oriented systems
aggressive batching
advanced compute culling
modern mesh shader workflows
scalable tooling and profiling

The result is a significantly more scalable, cache-efficient, and high-performance rendering engine.

…ings. - Vcpkg will download all the necessary packages, headers, libs, dlls.

…amps

…and storage policies

… on template functions

…t type index manager implemented

…/registry test + CI/CD google test pipeline added

…ping extensions

…set, atomic and normal uint8 flags!!!

…n (VK_EXT_shader_object) - Changed vcpkg to be static-md -> To fix shaderc and spriv reflection diamond lib include problems!!!

…, Utils)

- BinarySemaphore - TimelineSemaphore - Fence

…ns for pipeline layout

…ight pass

…nted

…pu culling shaders

…ojection lod selection

TamasPetii added 30 commits December 26, 2025 18:57

Old engine delete + New project initial setup

31d92e6

Vcpkg submodule, Vcpkg manifest json, Common project build props sett…

b445a64

…ings. - Vcpkg will download all the necessary packages, headers, libs, dlls.

Setup documentation + Vcpkg glfw test

375d24a

Ci/Cd test setup

6cf3fa7

SynMacros + Implemented proper console/file/memory logger with timest…

22090e0

…amps

Initial template based ecs pool implementation, with modular mapping …

899bc12

…and storage policies

Mapping CRTP + Constraint concept

4f6d644

Storage CRTP and Concept constraint + Refactored to use requires void…

f19761f

… on template functions

Logger refactor and log format string fix + Unique dll safe super fas…

70a7b00

…t type index manager implemented

Implemented component registry + Created Google Test project for pool…

42a07ed

…/registry test + CI/CD google test pipeline added

CI pipeline vs22 buildtool test

07f45e5

CI msbuild + test vcpkg explicit integration

2845da4

CI vcpkg package install explicit pipeline

de26f76

Registry seq and par view

d1ae409

Implemented segmented storage "proper" static/dynamic/stream mechanism

6b46697

Refactored storage/mapping/pool: Introduce Policy-based Storage & Map…

8721245

…ping extensions

Refactored flagmixin: Policy based template flag mixin to support bit…

18ae517

…set, atomic and normal uint8 flags!!!

Modular window, application abstraction, sandbox.

af621c9

Vulkan Core abstraction implemented using volk and vma

adfd3d6

SwapChain recreate impleneted on window framebuffer resize

23f2849

Shader abstraction with shaderc and include loader + Shader reflectio…

e55aa90

…n (VK_EXT_shader_object) - Changed vcpkg to be static-md -> To fix shaderc and spriv reflection diamond lib include problems!!!

Small project changes

cdfddcf

CI/CD vulkan pipeline

9e0594b

CI/CD changes

801d013

Vulkan new Buffer and Image abstraction implemented (Handler, Factory…

120890c

…, Utils)

Implemented vulkan synchronization abstraction

b6ee3b8

- BinarySemaphore - TimelineSemaphore - Fence

Implemented CommandPool and CommandBuffer abstraction.

98e20de

Implemented rendering vulkan util functions + shader group abstractio…

b1ffb2b

…ns for pipeline layout

Complex shader test and some fix

6bdbd29

Implemented resource and shader managers.

868eb41

TamasPetii added 27 commits March 30, 2026 16:25

Resolved some sync ecs bugs, and implemented initial deferred point l…

d7e395b

…ight pass

Refactored shaders

ea2301b

Refactored mesh shaders too

89f5fa3

Refactored a lot of shaders

fe8e71e

Refactored wireframe shaders

7895b13

Implemented deferred spot light pass and systems

9e18d73

Refactored Render Passes

82a40a9

Refactored push constants, shared between cpp passes and shaders!

4a8ea69

Missing wireframe files

197e9ee

Direction light systems and new frustum collider.

73288f7

Resolved frustum culling mismatch

3551cb2

Implemented direction light deferred pass and shaders

381cadf

Gpu driven point light culling works fine

0608296

Point and Spot light gpu driven occlusion and 0 pixel culling impleme…

8a886a5

…nted

Refactored renderers, implemented projection based lod selection in g…

5eb76f6

…pu culling shaders

Resolved some bugs

3b76896

Implemented settings window, and cpu 0 pixel triangle culling with pr…

d42e747

…ojection lod selection

Implemented debug camera scene view

daebdbe

Implemented billboards for dir/point/spot lights

d964d0a

Implemented camera billboards

a668000

Cpu fully inside entity bit

ffc2c1d

Resolved cpu-gpu material bug

650261d

Created performance engine mode

2bc4244

Scene changes

8f1df71

Added scene billboard icons

c9810ee

Gitignore

2cb13f3

GpuProfiler implemented

2da8c8b

TamasPetii self-assigned this Apr 3, 2026

TamasPetii merged commit 41a6878 into main Apr 3, 2026
1 check failed

TamasPetii deleted the remake branch May 14, 2026 11:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Synapse Engine v2.0.0 - Fully GPU-Driven & Data-Oriented Architecture#1

Synapse Engine v2.0.0 - Fully GPU-Driven & Data-Oriented Architecture#1
TamasPetii merged 144 commits into
mainfrom
remake

TamasPetii commented Apr 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TamasPetii commented Apr 3, 2026

Pull Request: Synapse Engine v2.0.0 - Fully GPU-Driven & Data-Oriented Architecture

Overview

Core Architecture & ECS

Segmented Entity Pool (ECS)

Dynamic Skipping Optimization

Massive Multithreading (Taskflow)

MVI Editor UI

Physics & Dependencies

GPU-Driven Rendering Pipeline

Bindless MDI Pipeline

Dual Pipeline Execution

WBOIT Integration

O(1) Render Submission

Global Material Indirection

Advanced Culling Systems

2-Phase Compute Culling

Phase 1 — Model-Level Culling

Phase 2 — Mesh-Level Culling

Subgroup Optimizations

Task Shader Culling

Light Culling

HiZ Map Generation

Assets, Dynamic LODs & Animation

Offset-Based Dynamic LODs

Vertex Pulling Animation

Per-Frame Animated Colliders

Unified Image Architecture

Tooling, Profiling & Debugging

Vulkan Timestamp Profiler

Visual Collider Debugging

Editor Depth Pass for WBOIT

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant