pereiramemo / big-mex Goto Github PK
View Code? Open in Web Editor NEWBiG-MEx implementation as Docker images and R packages
License: GNU General Public License v3.0
BiG-MEx implementation as Docker images and R packages
License: GNU General Public License v3.0
Create repository dedicated to the phylogenetic placement of domain seqs. onto reference trees
Scripts should test number of arguments
In the wiki:
for i in $( echo "${SAMPLES}" ); do
sudo "${BIN}/run_bgc_dom_annot.bash" \
"${OUTPUT_DIR}/OSD${i}_ME_n_SE_shotgun_workable_merged.fastq.gz" \
"${OUTPUT_DIR}/out_dom_annot_osd${i}" \
--intype dna \
--nslots 2 \
--sample "osd${i}" \
--verbose t
echo "sample osd${i}"
done
and it returns:
WARN: Unknown option (ignored): --verbose
If a folder already exists give the option to overwrite as a parameter (useful when processing many files). Now it just exits without a clear message.
Hi,
I recently read your Biorxiv manuscript on BiG-MEx and really liked the methods you developed and found the subsequent analyses super interesting as well!
I am attempting to follow the workflows detailed in the wiki and perform ORF calling on my metagenome samples; however, I am unsure which FragGeneScan code base to use.
The official FragGeneScanPlus appears to have an unresolved issue related to translating of reverse frames, as is well described in the following ticket: hallamlab/FragGeneScanPlus#19 . A more up to date fork of this repo, called FragGeneScanPlusPlus [https://github.com/unipept/FragGeneScanPlusPlus], maintained by a different group, appears to resolve this issue and matches the output of the original FragGeneScan (by Rho et al 2010) for a test case (the one from the fore mentioned git issue ticket*).
Can you please confirm if FragGeneScanPlusPlus [https://github.com/unipept/FragGeneScanPlusPlus] is the program you recommend to users for metagenomic ORF calling?
Kind regards,
Rauf
Unable to find image 'epereira/ufbgctoolbox:bgc_dom_merge_div' locally
docker: Error response from daemon: manifest for epereira/ufbgctoolbox:bgc_dom_merge_div not found.
See 'docker run --help'.
it should be
epereira/ufbgctoolbox:bgc_dom_merged_div
instead of
epereira/ufbgctoolbox:bgc_dom_merge_div
check warning:
"In if (class(model_c) == "xgb.Booster") { ... :
the condition has length > 1 and only the first element will be used"
Hello Emiliano.
run_bgc_dom_div.bash meta fails with following error messages,
Is there any solution ?
java -ea -Xmx198617m -cp /bioinfo/software/bbmap/current/ driver.FilterReadsByName in=/input/sample1-1.fq.gz in2=/input/sample1-2.fq.gz out=/scratch//out_dom_meta_div/redu_r1.fasta out2=/scratch//out_dom_meta_div/redu_r2.fasta names=/scratch//out_dom_meta_div/all_headers.list include=t overwrite=t
Executing driver.FilterReadsByName [in=/input/sample1-1.fq.gz, in2=/input/sample1-2.fq.gz, out=/scratch//out_dom_meta_div/redu_r1.fasta, out2=/scratch//out_dom_meta_div/redu_r2.fasta, names=/scratch//out_dom_meta_div/all_headers.list, include=t, overwrite=t]
Set INTERLEAVED to false
Exception in thread "main" java.lang.RuntimeException: Can't find file /input/sample1-1.fq.gz
at fileIO.ReadWrite.getRawInputStream(ReadWrite.java:880)
at fileIO.ReadWrite.getGZipInputStream(ReadWrite.java:973)
at fileIO.ReadWrite.getInputStream(ReadWrite.java:813)
at fileIO.FileFormat.getFirstOctet(FileFormat.java:407)
at stream.FASTQ.isInterleaved(FASTQ.java:126)
at stream.FastqReadInputStream.(FastqReadInputStream.java:60)
at stream.ConcurrentReadInputStream.getReadInputStream(ConcurrentReadInputStream.java:119)
at stream.ConcurrentReadInputStream.getReadInputStream(ConcurrentReadInputStream.java:55)
at driver.FilterReadsByName.process(FilterReadsByName.java:256)
at driver.FilterReadsByName.main(FilterReadsByName.java:41)
filterbyname R1 and R2 failed
Hello,
we try to deploy BiG-MEx on our HPC-Cluster via Singularity, but the Docker images mentioned in the README and wiki aren't partially not available. It's also not very clear in the documentation which container is actually relevant or not, as the container names in the wrapper scripts and the README aren't always the same.
Not available (or not public?) images mentioned in README/Wiki
Not available (or not public?) images mentioned in the corresponding scripts
Can you update the README with the newest information needed to deploy BiG-MEx?
Thanks in advance and best regards,
Felix
Add an option to process amplicon data, without generating a targeted assembly.
Hi,
Thanks for a nice tool. Is it possible to make it available via conda?
-Susheel
Hello,
I try to go through the Getting Started in your Wiki, but it seems that there are files missing in the metagenomic_samples.tar.gz.
The Wiki says:
However, for this tutorial, we included the predicted ORFs (i.e., OSD*_orfs.faa) in metagenomic_samples.tar.gz.
But there are no OSD*_orfs.faa files in the archive download from: https://owncloud.mpi-bremen.de/index.php/s/fibkaNcmGifhLv9/download
.
Best Regards,
Felix
Remove choice or add "other" modes from CV shiny apps
check negative values, convert these to 0 might be better.
Add best practices to preprocess reads in the wiki
Add a check point for old fastq headers: read numbers separated with "/".
add uproc-orfs option in bgc_dom_annot
Hi,
I am trying to run BiG-MEx in the HPC of my university, which instead of docker uses singularity. I was wondering if you could add to the wiki an example of running run_bgc_dom_annot.bash by calling docker from singularity.
Thanks!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.