Skip to content

Fix performance issue & bug on mps (Apple Silicon)#25

Open
sunnyden wants to merge 3 commits intoNVIDIA:mainfrom
sunnyden:patch-1
Open

Fix performance issue & bug on mps (Apple Silicon)#25
sunnyden wants to merge 3 commits intoNVIDIA:mainfrom
sunnyden:patch-1

Conversation

@sunnyden
Copy link

@sunnyden sunnyden commented Jan 25, 2026

On macOS the inference will have some bug and performance issue when running with args --device mps.

This PR fixed the issue.

@sunnyden sunnyden mentioned this pull request Jan 25, 2026
1 task
@sunnyden sunnyden changed the title Fix inference issue on mps(Apple Silicon) Fix performance issue & bug on mps (Apple Silicon) Jan 25, 2026
alex-audible added a commit to alex-audible/personaplex that referenced this pull request Jan 28, 2026
- Apply PR NVIDIA#25 fixes for MPS compatibility:
  - lm.py: add map_location to torch.load for voice prompt embeddings
  - transformer.py: replace index_copy_ with direct assignment on MPS
- Add MPS performance analysis (optimisation.md, optimisation_ultrathink.md)
  identifying 12 bottlenecks causing 5x slower-than-real-time inference
- Add comprehensive MLX port plan (mlxport.md) with 6-stage approach,
  performance logging framework, and baseline-first methodology using
  existing test assets

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant