Skip to content

mrrlab/fish-variability-across-organs

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Inter-individual gene expression variability implies stable regulation of brain-biased genes across organs

Christabel F. Bucao1,2*, Consolée Aletti1, Sébastien Moretti1,2, Andrew W. Thompson3,4,5, Brett L. Racicot4, Catherine A. Wilson6, Julien Bobe7, Ingo Braasch4,5, Yann Guiguen7, John H. Postlethwait6, Marc Robinson-Rechavi1,2

1 Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
2 SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
3 Department of Biological Sciences, Western Michigan University, Kalamazoo, Michigan, USA
4 Department of Integrative Biology, Michigan State University, East Lansing, Michigan, USA
5 Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, Michigan, USA
6 Institute of Neuroscience, University of Oregon, Eugene, Oregon, USA
7 INRAE, LPGP, Rennes 35000, France

* Corresponding author: [email protected]

Abstract

Phenotypic variation among individuals plays a key role in evolution, since variation provides the material on which natural selection can act. One important link between genetic and phenotypic variation is gene expression. As for other phenotypes, the range of accessible expression variation is limited and biased by different evolutionary and developmental constraints. Gene expression variability broadly refers to the tendency of a gene to vary in expression (i.e., between individuals or cells) due to stochastic fluctuations or differences in genetic, epigenetic, or environmental factors, separately from the differences between e.g. organs. Variability due to biomolecular stochasticity (transcriptional ‘noise’) and cell-to-cell heterogeneity has been well-studied in isogenic populations of unicellular organisms such as bacteria and yeasts. However, for more complex organisms with multiple cells, tissues, and organs sharing the same genetic background, the interplay between inter-individual expression variability, gene and organ function, and gene regulation remains an open question. In this study, we used highly multiplexed 3’-end Bulk RNA Barcoding and sequencing (BRB-seq) to generate transcriptome profiles spanning at least nine organs in outbred individuals of three ray-finned fish species: zebrafish, Northern pike, and spotted gar. For each condition, we measured expression variation per gene independent of mean expression level. We observed that lowly variable genes are enriched in cellular housekeeping functions whereas highly variable genes are enriched in stimulus-response functions. Furthermore, genes with highly variable expression between individuals evolve under weaker purifying selection at the coding sequence level, indicating that intra-species gene expression variability predicts inter-species protein sequence divergence. Genes that are broadly expressed across organs tend to be both highly expressed and lowly variable between individuals, whereas organ-biased genes are typically highly variable within their top organ of expression. For genes with organ-biased expression profiles, we inferred differences in selective pressure on gene regulation depending on their top organ. We found that genes with peak expression in the brain have low inter-individual expression variability across non-nervous organs, suggesting stabilizing selection on regulatory evolution of brain-biased genes. Conversely, liver-biased genes have highly variable expression across organs, implying weaker regulatory constraints. These patterns show that gene regulatory mechanisms evolved differently based on constraints on the primary organ.

Directory Structure

  • config/: Contains YAML file indicating package versions for conda environment

  • data/: Contains input data

    • counts/: Contains counts and UMI-deduplicated counts. Currently under embargo and will be made available upon acceptance for publication.
    • gene_metadata/: Contains gene biotype information from Ensembl
    • sample_metadata/: Contains sample metadata files for each species
    • selectome/: Contains selection statistics from the Selectome database
  • results/: Contains output files sorted by subfolders labeled after each step of the analysis pipeline. Only R notebook HTML files are available on the Git repository, please check Zenodo for R data files.

    • run_pipeline.Rdata: Contains all parameters used for each step of the analysis pipeline
  • workflow/: Contains scripts used for the analysis pipeline

    • analysis/: Contains all steps of the analysis pipeline, available as .Rmd files
    • functions/: Contains all functions used for analysis/
    • renv/: Used for package management in R
    • run_pipeline.R: Runs all the steps under analysis/
    • run_go_figure.sh: Runs GO-Figure! 1.0.0 (downloaded separately)
    • demultiplex_brbseq_fastq.sh: Used for demultiplexing BRB-seq fastq files using BRB-seqTools 1.6.1 (downloaded separately) for uploading to NCBI SRA
    • rename_fastq_files.sh: Used for renaming demultiplexed fastq files by mapping each barcode to their corresponding sample name
    • renv.lock: Lockfile for managing R package versions. Run renv::restore() to set up the R environment based on packages specified in the lockfile. All package versions used are also specified in the output HTML files under results/.

Species Codes

  • LOC: Lepisosteus oculatus (spotted gar)
  • ELU: Esox lucius (Northern pike)
  • DRE: Danio rerio (zebrafish)

Links

2024-11-12: Check out the preprint!

DOI

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 96.2%
  • Shell 3.8%