Giter Site home page Giter Site logo

easymetagenome's People

Contributors

tong-chen avatar yongxinliu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

easymetagenome's Issues

Issue regarding output of lefse-plot_features.py from LEfse

Hi,
I'm new here and found your project interesting and helpful!

But I'm kinda confused about the lefse output for the single feature produced by lefse-plot_features.py. Cuz there is no subclass(SampleID) annotation under the Fig. yield, like your own example result showed:
Screenshot 2023-06-15 at 11 39 23 AM
only group classification above the bars.

Is it because the input file doesn't include the subclass(SampleID) information? If so, then how should I insert that info into input file? by format2lefse? I failed to modify format2lefse Rscript to make it. Can you help me with this?

humann3多线程求助

您好,在运行单样本使用humann3时可以正常运行,使用rush多任务时报错,请问如何解决

如果服务器性能好,请设置--threads值为8/16/32

tail -n+2 result/metadata.txt | cut -f1 | rush -j 2
"humann --input temp/concat/{1}.fq
--output temp/humann3/ --threads 3 --metaphlan-options '--bowtie2db /db/metaphlan4 --index mpa_vOct22_CHOCOPhlAnSGB_202212 --offline'"
报错如下:
(humann3) xumingyuan@cn5:meta$tail -n+2 result/metadata.txt|cut -f1|rush -j 2 \

"humann --input temp/concat/${i}.fq --output temp/humann3/ --threads 32 -- metaphlan-options '--bowtie2db /home/xumingyuan/data_HD/database/metaphlan4 --index mpa_vOct22_CHOCOPhlAnSGB_202212 --offline'"
/bin/bash: line 1: humann: command not found
/bin/bash: line 1: humann: command not found
[ERRO]
我修改补全了humann3的地址,tail -n+2 result/metadata.txt|cut -f1|rush -j 2
"/home/xumingyuan/data_HD/micromamba/envs/humann3/bin/humann --input temp/concat/${i}.fq --output temp/humann3/ --threads 32 -- metaphlan-options '--bowtie2db /home/xumingyuan/data_HD/database/metaphlan4 --index mpa_vOct22_CHOCOPhlAnSGB_202212 --offline'"
humann: error: unrecognized arguments: -- metaphlan-options --bowtie2db /home/xumingyuan/data_HD/database/metaphlan4 --index mpa_vOct22_CHOCOPhlAnSGB_202212 --offline
[ERRO] 。
非常感谢。

NCBI核酸数据库解压问题

您好!本人生信小白一枚,最近在按照0install.sh安装软件配置环境,在运行NCBI核酸和物种信息(可选)部分时遇见了解压数据库的问题。由于最近国家微生物科学数据中心网页维护,我从网盘中下载了NCBI/nt中的所有*.tar.gz安装包,并成功上传到我的服务器。运行cd ${db}/NCBI/nt; for i in *.tar.gz; do tar xzf $i; done对安装包逐个解压时,压缩包01 08 09 25 26 35 48 59 61 65 70出现了报错。有以下两个报错:

tar: 跳转到下一个头
gzip: stdin: invalid compressed data--crc error
gzip: stdin: invalid compressed data--length error
tar: Child returned status 1
tar: Error is not recoverable: exiting now

gzip: stdin: invalid compressed data--format violated
tar: 归档文件中异常的EOF
tar: Error is not recoverable: exiting now

按照脚本中提示和网络检索的帮助,我认为文件可能有损坏,于是重新在云盘上下载。重新上传后问题1解决,但问题2无法解决,想要寻求帮助。

另外,看到教程中NCBI核酸和物种信息为可选,我是否可以直接跳过这一部分?

感谢感谢

megahit使用问题

老师您好!
您的代码中吗,使用megahit组装的时候,都是基于所有样本进行拼接的,但是若实操的时候,面临大样本数据,计算机内存不足以进行混合组装,只能使用单样本组装后,如何进行后续去冗余,基因定量,功能注释等步骤呢。是单样本组装后,将所有样本分别组装后的contigs,cat汇总成一个大contig文件,在进行后续去冗余等步骤吗?希望老师能解答我的问题,十分感谢!

Script目录在哪

刘老师,流程中提到了Script目录,但是在项目文件里面我没找到,我应该如何获取?这个Script目录是保密的?我应该如何获取?谢谢!

db文件的位置

作者您好!我想请问一下*EasyMicrobiome/dbcan2/CAZyDB.08062022数据库在哪里呢?我没有找到。
如果使用以下方法下载的数据库是否是一致的呢?

访问CAZy官网:[https://www.cazy.org/;](https://www.cazy.org/%EF%BC%9B)
点击页面上方的“Download”按钮,进入下载页面;
在下载页面中,找到CAZy Database一栏,点击右侧的Download链接;
进入CAZy Database下载页面后,选择与您的系统版本相对应的文件进行下载,例如Linux系统可以下载cazy_2022-08-06.tar.gz文件;
解压文件:tar zxvf cazy_2022-08-06.tar.gz;
将解压后的文件放到您需要使用的位置。

snakemake

老师好,我们能提供在集群上运行的snakemanke版本吗?

How to select reads to a fixed level

Hi Ph.D. Liu,

I am puzzled with the part that how to select a fixed number of reads across all samples in the metagenomic analysis. Is there any software or script that is fittable to humann2 pipeline?

Thank you!

Bracken估计丰度,不同样本的结果文件行数不同

按照 1.Pipeline 里的代码用Bracken重新估算 C1和C2的丰度,但是汇总结果文件时发现,结果文件的行数并不相同。但是Kraken2的结果行数是相同的 ,按照1.Pipeline里的代码可以直接进行合并。下面是bracken的参数-r 100 -l S。
bracken -d ${kraken2_db} -i $work_dir/02_Kraken2/${Sample}.report
-r ${len} -l ${tax} -o $work_dir/02_Kraken2/${Sample}.bracken
-w $work_dir/02_Kraken2/${Sample}.bracken.report
请问造成这种结果的原因是什么?这样的结果是否有问题?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.