An error occurred while analyzing the dataset (out-of-memory) #1328

Youssef1313 · 2023-11-23T03:51:09Z

I'm analyzing a simple 1.5MB zip file consisting of Java files, but getting out-of-memory.

rien · 2023-11-23T08:59:17Z

Hi @Youssef1313 how many files are you analyzing? How big are they?

Do you get this problem with the web server? Or using the CLI?

Youssef1313 · 2023-11-23T09:01:20Z

@rien Using the web server. 1461 Java files.

rien · 2023-11-23T16:05:32Z

I've take a closer look at the logs and it seems like the submissions you want to analyze have multiple files per submission.

We currently don't support grouping files together, so each file would be compared with all the others instead of with other groups. This needs a lot of memory and would give sub-par results.

In the mean time (until #1121 is implemented), you could try concatenating all files that belong to the same submission into one file (if you're using linux: cat **/*.java > combined.java).

Let me know if you need more help with this.

Youssef1313 · 2023-11-23T16:06:59Z

Great. Thanks!

Youssef1313 closed this as completed Nov 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An error occurred while analyzing the dataset (out-of-memory) #1328

An error occurred while analyzing the dataset (out-of-memory) #1328

Youssef1313 commented Nov 23, 2023

rien commented Nov 23, 2023

Youssef1313 commented Nov 23, 2023

rien commented Nov 23, 2023

Youssef1313 commented Nov 23, 2023

An error occurred while analyzing the dataset (out-of-memory) #1328

An error occurred while analyzing the dataset (out-of-memory) #1328

Comments

Youssef1313 commented Nov 23, 2023

rien commented Nov 23, 2023

Youssef1313 commented Nov 23, 2023

rien commented Nov 23, 2023

Youssef1313 commented Nov 23, 2023