Skip to content

Potential Bug: GDC Prepare does not work for breast cancer data #619

Open
@fabianjkrueger

Description

@fabianjkrueger

Hello!

There seems to be an issue with preparing of certain data sets for analysis. It's weird, since if works for some of the projects, but it doesn't work for others. One of the projects causing issues here is breast cancer ("BRCA"). I queried and downloaded the data for the different projects in a script like shown below.

GDCquery(project = "TCGA-BRCA",
                           data.category = "Simple Nucleotide Variation",
                           data.type = "Masked Somatic Mutation")

# this is the step that just wont work for breast cancer...
mutationDataBRCA <- GDCprepare(mutationQueryBRCA, # specify which query to use
                           save = TRUE, # save the output as as a file
                           save.filename = file.path(prepared_path, "BRCA_SNVMSM.RData"),
                           directory = dl_path, # directory where downloaded files are stored
                           remove.files.prepared = FALSE) 

All paths are stored in variables, so this is not the issue. This code works for almost all the other cancer types, for example colon adenocarcinoma (project "COAD").

This is the error message I get:

Error in `dplyr::bind_rows()`:
! Can't combine `..151$Tumor_Seq_Allele2` <character> and `..152$Tumor_Seq_Allele2` <logical>.
Backtrace:
 1. TCGAbiolinks::GDCprepare(...)
 2. TCGAbiolinks:::readSimpleNucleotideVariationMaf(files)
 3. purrr::map_dfr(...)
 4. dplyr::bind_rows(res, .id = .id)

To me, it looks like there is a problem with data types, but I don't know how to fix it.

Is there anything else I might be missing? Are there temporary files that depend on loading a specific library for reading them? If not, there might be a bug.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions