[Performance] PyTorch (MPS) is faster than MLX in backward of convolution layer #1313

arnold-yan · 2024-08-08T15:46:30Z

Describe the bug
Recently I profiled the neural network layer performance from MLX and compared with PyTorch. I found that although MLX forwarding is consistently faster than PyTorch, in some chips (M1 Pro, M1 Max), PyTorch is much faster (3x~6x) for convolution forward + backward. While in some chips such as M3 Max, MLX is faster than PyTorch.

To Reproduce
To reproduce this, I have two minimal examples. The networks just have several convolution layers. You may try these two scripts to verify the performance.

time_pytorch_mlx.zip

awni · 2024-08-08T16:13:56Z

Same benchmark on an M2 Ultra

average time of Pytorch: 7.20261025428772
average time of MLX: 2.34059739112854

abdussamettrkr · 2024-08-12T15:14:06Z

On M2Pro

average time of MLX: 4.472527980804443
average time of Pytorch: 10.073836088180542

alwint3r · 2024-08-13T01:03:43Z

On M3 Max

average time of MLX: 2.4813356399536133
average time of PyTorch: 7.1081931591033936

awni · 2024-08-13T13:20:30Z

Thanks for the benchmarks everyone! There is clearly an unexpected performance cliff on M1 machines here as MLX is substantially faster on M2+. We'll need to take a deeper look at that to figure out where it's coming from.

pyvadev · 2024-09-13T00:10:00Z

On M1

average time of MLX: 30.113215446472168
average time of Pytorch: 15.948616743087769

jrp2014 · 2024-09-13T20:37:54Z

M3 Max:
average time of MLX: 2.939736843109131
average time of Pytorch: 5.9829957485198975

awni added the performance label Aug 8, 2024

awni mentioned this issue Sep 13, 2024

[Performance] mlx.core.conv_general is really slow #1409

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Performance] PyTorch (MPS) is faster than MLX in backward of convolution layer #1313

[Performance] PyTorch (MPS) is faster than MLX in backward of convolution layer #1313

arnold-yan commented Aug 8, 2024 •

edited

Loading

awni commented Aug 8, 2024

abdussamettrkr commented Aug 12, 2024

alwint3r commented Aug 13, 2024

awni commented Aug 13, 2024

pyvadev commented Sep 13, 2024

jrp2014 commented Sep 13, 2024

[Performance] PyTorch (MPS) is faster than MLX in backward of convolution layer #1313

[Performance] PyTorch (MPS) is faster than MLX in backward of convolution layer #1313

Comments

arnold-yan commented Aug 8, 2024 • edited Loading

awni commented Aug 8, 2024

abdussamettrkr commented Aug 12, 2024

alwint3r commented Aug 13, 2024

awni commented Aug 13, 2024

pyvadev commented Sep 13, 2024

jrp2014 commented Sep 13, 2024

arnold-yan commented Aug 8, 2024 •

edited

Loading