update hitaggregator function by ccrobertson · Pull Request #211 · databio/pepatac

ccrobertson · 2022-01-18T22:04:20Z

Here is a modified version of the hit_aggregator() function that does the following:

(1) takes all peak regions from all narrowPeak files in the project and defines a set of k non-overlapping regions using GenomicRanges “reduce”
(2) assigns each observed peak across narrowPeak files to one of the k non-overlapping regions defined in (1)
(3) for each of the k non-overlapping regions, it finds which observed peak has the highest score

The function returns a list of k regions with peak boundaries based on the highest scoring peaks in step (3).

ccrobertson · 2022-01-18T22:21:06Z

I'm not sure my modification is actually what you want "hit_aggregator" to do. This was my attempt to ensure that the resulting consensus peak set is non-overlapping (currently I find that sometimes there is some overlap between peaks)... An alternative approach could be to apply "reduce" to the results from the original implementation at the end, instead of defining a disjoint set up front.

update hitaggregator function

cdac324

nsheff requested a review from jpsmith5 July 7, 2022 19:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update hitaggregator function#211

update hitaggregator function#211
ccrobertson wants to merge 1 commit intodatabio:masterfrom
ccrobertson:hitaggregator

ccrobertson commented Jan 18, 2022

Uh oh!

ccrobertson commented Jan 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ccrobertson commented Jan 18, 2022

Uh oh!

ccrobertson commented Jan 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant