Skip to content

🌳Tool to fetch seqs metadata from NCBI to annotate phylogenetic trees

License

Notifications You must be signed in to change notification settings

iliapopov17/phyloki

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python3 Biopython Pandas OS License

Phyloki simplifies phylogenetic tree annotation in microbiology and virology by fetching metadata from NCBI GenBank using accession numbers. It also reinstates organism names in trees constructed with IQ-TREE, retrieves host information about microorganisms, and prepares annotation datasets for further visualization in iTOL.

Table of contents

The Good 😎 The Bad 😒 The Ugly 🚮

Phyloki allows easy and simple annotation of phylogenetic trees. See the examples above:

  • The best tree contains information about the hosts from which the virus was isolated and the full names of the viruses.
  • The so-so tree contains the same information, but is colour annotated with randomly generated colours.
  • The worst tree contains only accession numbers on its leaves.

Features

Sequence Downloading

  • Facilitates the retrieval of sequences from NCBI GenBank using specified accession numbers.

Metadata Fetching

  • Downloads metadata from NCBI GenBank (Accession Number; Organism Name; Country; Year; Host)

Organism Name Reintegration

  • Enhances IQ-TREE constructed trees by replacing accession numbers with the corresponding organism names for clarity and context.

Host Information Retrieval

  • Gathers host data for each microorganism, including the host's taxonomic order.

Annotation Dataset Preparation for iTOL

  • Utilizes the collected host information to prepare detailed annotation datasets, optimizing visualization in iTOL.

Installation

pip install phyloki

Usage Guide

Demonstrational data is based on the recent paper about identifying novel hantavirus in bats

🔗 Visit Phyloki wiki page

Contributing

Contributions are welcome! If you have any ideas, bug fixes, or enhancements, feel free to open an issue or submit a pull request.

Contact

For any inquiries or support, feel free to contact me via email

Happy tree annotating! 🌳

About

🌳Tool to fetch seqs metadata from NCBI to annotate phylogenetic trees

Topics

Resources

License

Stars

Watchers

Forks