Make IQ1_M work for QK_K = 64 #6327

ikawrakow · 2024-03-26T18:22:43Z

As with all other i-quants, AVX2, ARM_NEON, CPU scalar, Metal. CUDA will come later.

ikawrakow · 2024-03-27T07:47:20Z

@ggerganov Perhaps you should disable the nix build? I don't know about you, but for me a check running for 6 hours and eventually cancelled on every commit does not make much sense. If nothing else, lets have some merci with our planet.

ggerganov · 2024-03-27T08:00:04Z

@SomeoneSerge Is there something to be done to speed-up the builds? AFAICT, with the recent workflow concurrency changes (#6243) all Nix builds are bound to be cancelled since the chance of committing something to master within 6h is quite large and this would cancel all running workflows

SomeoneSerge · 2024-03-27T13:24:15Z

@ggerganov thanks for the heads-up; I noticed a few cancelled builds but haven't got around to investigate this. I opened a tracking issue for now: #6346

mscheong01 · 2024-03-28T07:02:26Z

If we don't want nix builds to fail on master, we could exempt the master branch as stated in the #6243 description

But, if we don't want this to happen with our master branch workflows, we can make an exception. Here's how we could set it up:

concurrency: 
  group: ${{ github.workflow }}-${{ github.ref }}-${{ github.event.inputs.sha }}
  cancel-in-progress: true

* iq1_m: make it work for QK_K = 64 (WIP) * iq1_m: make it work for QK_K = 64 (scalar and AVX2) * iq1_m: QK_K = 64 seems to work on Metal and ARM_NEON --------- Co-authored-by: Iwan Kawrakow <[email protected]>

Kawrakow added 3 commits March 26, 2024 18:39

iq1_m: make it work for QK_K = 64 (WIP)

e1939bc

iq1_m: make it work for QK_K = 64 (scalar and AVX2)

5c953a1

iq1_m: QK_K = 64 seems to work on Metal and ARM_NEON

b0d0bdd

ggerganov approved these changes Mar 27, 2024

View reviewed changes

ikawrakow merged commit cbc8343 into master Mar 27, 2024
57 of 58 checks passed

ikawrakow deleted the ik/iq1m_64 branch March 27, 2024 07:44

SomeoneSerge mentioned this pull request Mar 27, 2024

nix: ci: fit into the new limits #6346

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make IQ1_M work for QK_K = 64 #6327

Make IQ1_M work for QK_K = 64 #6327

ikawrakow commented Mar 26, 2024

ikawrakow commented Mar 27, 2024

ggerganov commented Mar 27, 2024

SomeoneSerge commented Mar 27, 2024

mscheong01 commented Mar 28, 2024 •

edited

Make IQ1_M work for QK_K = 64 #6327

Make IQ1_M work for QK_K = 64 #6327

Conversation

ikawrakow commented Mar 26, 2024

ikawrakow commented Mar 27, 2024

ggerganov commented Mar 27, 2024

SomeoneSerge commented Mar 27, 2024

mscheong01 commented Mar 28, 2024 • edited

mscheong01 commented Mar 28, 2024 •

edited