I've stopped using QIIME2 entirely, so be warned if anything below doesn't work for newer (2019/06) QIIME2 versions.
QIIME2 16S wrapper Implemented for QIIME2 2018.8
Runs a complete OTU analysis on your sequence data based on a range of optimizable parameters, such as trimming and truncate length, annotation identity, and cluster parameters.
This is a side project on my Masters Degree, and i will be implementing features and user-friendliness as i go. Any questions or suggestions, you can easily reach me on the e-mail [email protected], with the subject "q2pipe: your subject"
- Conda package manager package manager that installs QIIME2 easily;
- QIIME2 installed via conda, the main analysis package;
- BIOM format for data wrangling;
- PICRUSt metagenome predictor;
- Analysis parameters, such as name, trimming and truncation values, max. expected errors, which algorithm to use for annotation, and such.
- A manifest text file for data;
- A sample metadata text file;
- Your paired-end sequences.
- Imports your data as a QIIME2 artifact;
- Denoise it using the DADA2 pipeline;
- Classifies your data using either BLAST or VSEARCH, against the latest Greengenes and SILVA database;
- Run core diversity and basic phylogenetic analysis based on QIIME2
- Predict the metagenomes with PICRUSt;
- Reports all results as HTML, plain text and a usable BIOM-taxonomy file.
- (WIP) Run our custom RScript with the main diversity indexes.
- Turn on your qiime2 (usually with
source activate qiime2-2018.8
) - Create a manifest file, with the name manifest.csv, then put it inside the intel directory.
- Run q2manifest to import your data as a QIIME2 artifact.
- Open the demux-summary.html in a web browser, and pick parameters suitable to your sequences.
- Write these parameters on intel/parameters.txt
- Run q2pipe and then q2picrust.
And that's it.
- Add user friendly description and help messages
- Error control for core wrapper (manifest, preproc, pipe and picrust)
- Missing parameters;
- Missing files;
- Missing databases;
- (done) Annotation with VSEARCH/BLAST over SILVA132 and the latest Greengenes.
- (done) Core diversity analysis with QIIME2.
- (WIP) Update to QIIME-2011-11.
- Implement shi7 to test over parameters.
- Merge-Denoise-Deblur optimized preprocess pipeline.
- Automatic q2-feature-classifier.
- Generate numerical ecology analysis based on metadata columns, on a custom RScript.
- Implement for ITS regions.
- QIIME2.
- I can't thank enough Edgar at Drive5 (https://drive5.com/about.html) for many, many insights over the years.
- DADA2 sequence denoise pipeline: Callahan, B. J., McMurdie, P. J., Rosen, M. J., Han, A. W., Johnson, A. J. A., & Holmes, S. P. (2016). DADA2: high-resolution sample inference from Illumina amplicon data. Nature methods, 13(7), 581.
- VSEARCH for sequence clustering and feature classification: Rognes, T., Flouri, T., Nichols, B., Quince, C., & Mahé, F. (2016). VSEARCH: a versatile open source tool for metagenomics. PeerJ, 4, e2584.
- BLAST also used for feature classification: Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., & Madden, T. L. (2009). BLAST+: architecture and applications. BMC bioinformatics, 10(1), 421.
- PICRUSt for metagenome prediction: Langille, M. G., Zaneveld, J., Caporaso, J. G., McDonald, D., Knights, D., Reyes, J. A., ... & Beiko, R. G. (2013). Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nature biotechnology, 31(9), 814.