-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Semsiman calculation taking huge amout of time and resources #115
Comments
One week, oof! |
@souzadevinicius lets try a 0.4 Jaccard threshold and removing the labelling options and see if that makes it at least possible to run HP-ZP |
Ok
|
what's the status of this? |
Discussing in the MWF hackathon now We were thinking we would deploy semsimian/oak on our build server and run on a regular cadence. This way we have an objective measure of how much memory/time we are talking about here, and we can also emit a new artifact with a PURL so people can use this downstream. @caufieldjh perhaps we already have a repo to do this? |
Ah okay, Harry has already made a repo for this here |
Sorry I'm a little late to the party but @souzadevinicius , did you run this without |
The last build in Aug '23 took 1h and 18m.
Yep good question @hrshdhgd Harry says a previous build with auto-label turned on took 15h so this might be at least one thing that is slowing down Vinicius's run |
Note that the Jenkins build performed by that repo takes a bit over 1 hr without autolabel and ~15 hrs w/ autolabel. |
For reasons not entirely clear to me, this build took 3 hours. Here's the command:
That's with:
The product: |
I'm trying to calculate semantic similarity profiles using Phenio ontology comparing different term sets
Ontology used: Phenio
Library versions
command line execution example:
I tried to run these experiments locally (32 and 64 GB RAM machines) and in a HPC (writing output process took more than one week and then was killed)
The text was updated successfully, but these errors were encountered: