Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

.ld file format issue. two additional columns #45

Open
jmhallas opened this issue May 9, 2023 · 5 comments
Open

.ld file format issue. two additional columns #45

jmhallas opened this issue May 9, 2023 · 5 comments

Comments

@jmhallas
Copy link

jmhallas commented May 9, 2023

Hello,

I ran ngsLD on a beagle file

ngsLD \
--geno allmdSub.beagle.gz \
--posH allmdSub.pos.gz \
--probs \
--n_ind $NIND \
--n_sites $NSITES \
--max_kb_dist 10 \
--min_maf 0.05 \
--n_threads 16 \
--outH ./allmdSub.ld


==> Input Arguments:
        geno: /share/cdfwwildlife/hallas_dedicated/lc_deer/angsd/allmdSub/allmdSub.beagle.gz
        probs: true
        log_scale: false
        n_ind: 204
        n_sites: 83611
        pos: /share/cdfwwildlife/hallas_dedicated/lc_deer/angsd/allmdSub/allmdSub.pos.gz (WITH header)
        max_kb_dist (kb): 10
        max_snp_dist: 0
        min_maf: 0.050000
        ignore_miss_data: false
        call_geno: false
        N_thresh: 0.000000
        call_thresh: 0.000000
        rnd_sample: 1.000000
        seed: 1681186263
        extend_out: false
        out: /share/cdfwwildlife/hallas_dedicated/lc_deer/ngsld/nallmdSub.ld (WITH header)
        n_threads: 16
        verbose: 1
        version: 1.1.1 (Mar 28 2023 @ 10:46:02)

==> GZIP input file (not BINARY)
> Reading data from file...
> Header found! Skipping line...
==> Calculating MAF for all sites...
==> Getting sites coordinates
==> Launching threads...
==> Waiting for all threads to finish...
==> Freeing memory...
Done!

my output has two additional columns

#site1	site2	dist	r^2_ExpG	D	D'	r^2
HiCscaffold1pilon:548766	390	HiCscaffold1pilon:552772	378	4006	0.008618	0.010649	0.204312	0.026252
HiCscaffold1pilon:544311	396	HiCscaffold1pilon:548766	390	4455	0.024345	0.047307	0.999452	0.318523
HiCscaffold1pilon:552807	356	HiCscaffold1pilon:552811	382	4	0.458927	0.055914	0.999964	0.431553
HiCscaffold1pilon:544239	387	HiCscaffold1pilon:544270	363	31	0.019952	-0.000612	0.033265	0.000030
HiCscaffold1pilon:138487	362	HiCscaffold1pilon:138500	371	13	0.000093	-0.010224	0.433973	0.008593
HiCscaffold1pilon:544311	396	HiCscaffold1pilon:552772	378	8461	0.010669	0.012214	0.163652	0.013607
HiCscaffold1pilon:153019	375	HiCscaffold1pilon:153035	387	16	0.823266	0.244860	0.999997	0.999810

I'm not sure what the second and fourth column represent. Any thoughts?

Thank you for your time,

--josh

@fgvieira
Copy link
Owner

what is the content of the file allmdSub.pos.gz?

@jmhallas
Copy link
Author

jmhallas commented May 10, 2023 via email

@fgvieira
Copy link
Owner

I think 2nd and 4th columns are the 3rd column in your pos file. Can you try removing the third column and run again?

@jmhallas
Copy link
Author

jmhallas commented May 10, 2023 via email

@fgvieira
Copy link
Owner

fgvieira commented Aug 7, 2023

Did it work?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants