Document CAP_KILL requirement for Kubernetes deployments with restrictive security contexts #7961

Copilot · 2026-01-10T18:44:36Z

LocalAI uses signals to terminate backend child processes (llama.cpp, diffusers, whisper). When running in Kubernetes with security contexts that drop all capabilities, the container cannot send signals, causing VRAM leaks and orphaned processes with permission denied errors.

Changes

Added documentation to docs/content/getting-started/kubernetes.md and docs/content/installation/kubernetes.md:

Security Context Requirements: Explains CAP_KILL necessity and provides example Pod configuration with restrictive security settings
Troubleshooting: Documents symptoms, root causes, and verification steps for the permission denied error

Example Configuration

securityContext:
  allowPrivilegeEscalation: false
  capabilities:
    drop:
      - ALL
    add:
      - KILL  # Required for LocalAI to stop backend processes
  seccompProfile:
    type: RuntimeDefault

Without CAP_KILL, LocalAI's go-processmanager library cannot invoke process.Stop() which sends SIGTERM/SIGKILL to child processes, leaving them orphaned and holding GPU memory.

Original prompt

Problem

When running LocalAI in Kubernetes with restrictive security contexts (such as dropping all capabilities or using seccomp profiles), the backend child processes cannot be properly terminated when models are stopped. This results in:

VRAM not being freed when stopping models

Error messages like (deleteProcess) error while deleting process error=permission denied

Child processes remaining alive and holding GPU memory

This issue was identified in #7958

Root Cause

LocalAI uses syscall.SIGTERM and syscall.SIGKILL signals to terminate backend processes (via the go-processmanager library). When running in Kubernetes with restrictive security contexts that drop the CAP_KILL capability, the container cannot send signals to child processes.

Solution

Update the Kubernetes documentation (docs/content/getting-started/kubernetes.md and docs/content/installation/kubernetes.md) to include:

A new section on Security Context Requirements explaining that LocalAI needs the CAP_KILL capability to properly manage backend processes

Example deployment YAML showing the correct security context configuration

Troubleshooting information for users who encounter the "permission denied" error when stopping models

Proposed Documentation Changes

Add a new section to the Kubernetes docs that includes:

Security Context Requirements

LocalAI spawns child processes to run model backends (e.g., llama.cpp, diffusers). To properly stop these processes and free resources like VRAM, LocalAI needs permission to send signals to its child processes.

If you're using restrictive security contexts, ensure the CAP_KILL capability is available:
securityContext:
  allowPrivilegeEscalation: false
  capabilities:
    drop:
      - ALL
    add:
      - KILL  # Required for LocalAI to stop backend processes
  seccompProfile:
    type: RuntimeDefault
Troubleshooting

Issue: VRAM is not freed when stopping models, and logs show error while deleting process error=permission denied

Cause: The container lacks permission to send signals to child processes. This commonly happens when:

All capabilities are dropped without adding back CAP_KILL

Using user namespacing (hostUsers: false) with certain configurations

Overly restrictive seccomp profiles

Solution: Add the KILL capability to your container's security context as shown above. If running in privileged mode works but the above doesn't, check your cluster's Pod Security Policies or Pod Security Standards for additional restrictions.

Files to Modify

docs/content/getting-started/kubernetes.md

docs/content/installation/kubernetes.md (if it exists separately, keep them in sync)

This pull request was created from Copilot chat.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

netlify · 2026-01-10T18:44:40Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`a148070`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/69629ef86fc4c70008df987d
😎 Deploy Preview	https://deploy-preview-7961--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Co-authored-by: mudler <[email protected]>

Initial plan

6e1ecc9

Copilot AI assigned Copilot and mudler Jan 10, 2026

Copilot started work on behalf of mudler January 10, 2026 18:45 View session

Add Kubernetes security context requirements and troubleshooting docs

a148070

Co-authored-by: mudler <[email protected]>

Copilot AI changed the title ~~[WIP] Update Kubernetes documentation for process termination in LocalAI~~ Document CAP_KILL requirement for Kubernetes deployments with restrictive security contexts Jan 10, 2026

Copilot finished work on behalf of mudler January 10, 2026 18:49

Copilot AI requested a review from mudler January 10, 2026 18:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Document CAP_KILL requirement for Kubernetes deployments with restrictive security contexts #7961

Document CAP_KILL requirement for Kubernetes deployments with restrictive security contexts #7961

Copilot AI commented Jan 10, 2026 •

edited

Loading

Uh oh!

netlify bot commented Jan 10, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Document CAP_KILL requirement for Kubernetes deployments with restrictive security contexts #7961

Are you sure you want to change the base?

Document CAP_KILL requirement for Kubernetes deployments with restrictive security contexts #7961

Conversation

Copilot AI commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Example Configuration

Problem

Root Cause

Solution

Proposed Documentation Changes

Security Context Requirements

Troubleshooting

Files to Modify

Uh oh!

netlify bot commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Jan 10, 2026 •

edited

Loading

netlify bot commented Jan 10, 2026 •

edited

Loading