Replies: 2 comments 2 replies
-
I have searched Issue and found #418, but it is outdated. |
Beta Was this translation helpful? Give feedback.
1 reply
-
Did you ever find a solution to this? I'm currently having the same issue where its just cpu being used not gpu. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm using khoj on WSL where both
nvidia_smi
andtorch.cuda.is_available()
works.I installed khoj using
CMAKE_ARGS="DLLAMA_CUDA=on" FORCE_CMAKE=1 python -m pip install khoj
and no error occurred.However, when I try to ask something, the CPU Utilization will be 100% and it takes over 100s to generate the answer while GPU Utilization stays low.
Is there anything I can do or is there any option I should enable to use GPU?
Beta Was this translation helpful? Give feedback.
All reactions