Assignees: Nelly and Collins
The 16S ribosomal RNA is necessary for the synthesis of all prokaryotic proteins.
Reproducibility is very vital in research as it ensures that results are the same no matter how many times a computational pipeline is executed. Workflow languages such as Nexflow and containers such as Docker or Singularity are key tools in ensuring reproducibility in research.
This mini-project aims to review existing microbiome workflows, identify great ones and extend the workflows where there are gaps, especially to make them useful in insect and pathogen data. Review the workflows using the following criteria:
- How easy are they to set up and use? Do they provide accessible documentation and tutorials?
- Are they fast and easily scalable based on available compute resources?
- Can they scale to the cloud?
- Can they be used on a variety of data, including insects and pathogen microbiome
- Are they implemented in the latest specifications and versions of the tools? For example, Whether the pipeline implements Nextflow DSL2 syntax and docker or singularity containers
- Are they well and regularly maintained? When were they updated last?
- https://github.com/nf-core/ampliseq
- https://h3abionet.github.io/H3ABionet-SOPs/16s-rRNA-1-0.html
- https://github.com/h3abionet/TADA
- https://github.com/mbbu/16S_Accreditation
- https://github.com/nanoporetech/pipeline-transcriptome-de/tree/paired_dge_dtu
- https://github.com/biocorecrg/master_of_pores
- https://github.com/h3abionet/h3abionet16S
- Create a Roadmap for the miniproject.
- Test existing 16srRNA pipelines.
- Address the identified gap.
- Document your work clearly on GitHub using wikis and GitHub pages.
- Document the papers you are reading, a link to the paper, and a sentence or two on why you included them.
N/B:
- Use test dataset from either of the listed pipelines.
- Demonstrate collaborative research skills, informative visualization, and report writing