CodeQwen returns extra white space for code completion #1947

ycclnn · 2024-04-24T07:13:03Z

Other models like deepseek works without a problem.

wsxiaoys · 2024-04-24T07:16:55Z

Thanks for reporting the issue - I have also observed the issue and am looking into the debug process. It seems that the tokenizer treats _ as a space, regardless of the context for CodeQwen.

wsxiaoys · 2024-05-02T20:09:57Z

Can confirm this presents in upstream llama.cpp as well:

cross posted at: ggerganov/llama.cpp#7050

ycclnn · 2024-05-06T00:52:02Z

Can confirm this presents in upstream llama.cpp as well:

cross posted at: ggerganov/llama.cpp#7050

yeah, white space exists for using vllm as well so its a model thing rather than serving framework thing. I was thinking shiftleft the cursor a few chars and then check the completion result with overlapping substrings to overcome this and thats the only workaround in my mind.

ycclnn · 2024-05-06T00:57:04Z

Observed that codeqwen not always returns extra white space, and sometimes the white space is meaningful. Therefore, trim the leading white space may not be an approach.

ycclnn added the bug-unconfirmed label Apr 24, 2024

ycclnn closed this as completed Apr 24, 2024

wsxiaoys reopened this Apr 24, 2024

wsxiaoys self-assigned this Apr 24, 2024

wsxiaoys added bug Something isn't working fixed-in-next-release and removed bug-unconfirmed fixed-in-next-release labels Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeQwen returns extra white space for code completion #1947

CodeQwen returns extra white space for code completion #1947

ycclnn commented Apr 24, 2024

wsxiaoys commented Apr 24, 2024 •

edited

wsxiaoys commented May 2, 2024

ycclnn commented May 6, 2024 •

edited

ycclnn commented May 6, 2024

CodeQwen returns extra white space for code completion #1947

CodeQwen returns extra white space for code completion #1947

Comments

ycclnn commented Apr 24, 2024

wsxiaoys commented Apr 24, 2024 • edited

wsxiaoys commented May 2, 2024

ycclnn commented May 6, 2024 • edited

ycclnn commented May 6, 2024

wsxiaoys commented Apr 24, 2024 •

edited

ycclnn commented May 6, 2024 •

edited