Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doc suggestion #180

Open
wsnoble opened this issue May 17, 2024 · 6 comments
Open

Doc suggestion #180

wsnoble opened this issue May 17, 2024 · 6 comments

Comments

@wsnoble
Copy link

wsnoble commented May 17, 2024

I'm reading the quickstart, and one minor thing that I think would be very helpful is if this documentation mentioned, for each option, what the default value is if you don't specify it.

@wsnoble
Copy link
Author

wsnoble commented May 17, 2024

Also, FYI, this sentence is hard to understand because split-sequences is not mentioned elsewhere in the quick-start: " Therefore, it is best to combine –minibatch-fraction with –split-sequences. "

@wsnoble
Copy link
Author

wsnoble commented May 17, 2024

After further searching the docs, I can't find any explanation of what split-sequences does or what value it takes. Can someone please explain this?

@EricR86
Copy link
Member

EricR86 commented May 22, 2024

--split-sequences puts an upper limit on the size of the windows used for training and inference in base-pairs. The default is 2000000 bp.

For more details see:
https://segway.readthedocs.io/en/latest/technical.html#memory-usage

The defaults for all options are both in the --help command line output as well in the command-line usage summary portion of the docs.

@ChelseyLin3
Copy link

Does the original resolution of genomedata have any impact on the --split-sequences function?

@EricR86
Copy link
Member

EricR86 commented May 22, 2024

No there is no inherent "resolution" for Genomedata. --split-sequences effectively will split at base-pair boundaries regardless of whatever underlying resolution the dataset originates from.

@ChelseyLin3
Copy link

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants