-
Notifications
You must be signed in to change notification settings - Fork 234
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Any changes on repo level concatenation? #22
Comments
V2 switched from topological sorting concatenation to random concatenation mainly because random concatenation is more language-friendly. |
Thanks for answer. Can I ask few more questions?
|
|
Thanks a lot! |
V1 mention, just 4 lang deal by repo-level dependence deal, why V2 not keep the 4 lang in V1? |
can you give a explantion for me? am i right with the thought |
Dependencies between files are important. When randomly concatenating files, some cases may satisfy these dependencies and can help improve the performance in very long codes. Other cases may contribute to the robustness of the model because not all application scenarios provide complete dependencies. |
so remove file topological graph is not benificial inall? is this confirmed by experiments?
|
In our experiment, we did not observe significant differences between random ordering and topological sorting on the benchmarks. |
Why deepdeek coder. V1. Mention. Toplogical. Graph. As. A. Good. Point? |
The Topological Graph is a good way to organize repo-level data, as you believe that "the biggest advantage of dependent files is alleviating hallucination and reasoning." However, based on our current experiments, the improvement of organizing repo-level data using a topological graph compared to random ordering on benchmarks is marginal, with no significant differences. Given the resources required to use topological sorting to parse hundreds of programming languages, random ordering is more efficient. |
thanks,i see |
thanks |
In deepseek coder v1, I was able to find a detailed explanation of repo level concat in an issue.
Has anything changed from the method used in v1 to v2?
The text was updated successfully, but these errors were encountered: