Comments (10)
Hello,
The purpose of the bash expression echo ${out_path##*/}
is to cut out all preceding path elements.
If the file_path here is 1_starting_files/control_1_TINY.merged
, for example, this bash expression will return control_1_TINY.merged
.
I'm not sure why you're having issues identifying the functional and taxonomic information of the genes. Is this a failure in the master script? Is this occurring on Step 5, when generating the functional results from the RefSeq database? Are you using the default database (RefSeq_bac)?
Hopefully I can help you fix these issues.
from samsa2.
I am making some updates to the tool, but this shouldn't be causing issues such as these you're reporting.
from samsa2.
I run the program, and got the following problem,
Calculating DESeq results for hierarchy level 1
Error in row.names<-.data.frame
(*tmp*
, value = value) :
duplicate 'row.names' are not allowed
Calls: rownames<- ... rownames<- -> row.names<- -> row.names<-.data.frame
In addition: Warning message:
non-unique values when setting 'row.names': ‘’, ‘1,2-dihydroxy-3-keto-5-methylthiopentene dioxygenase (EC 1.13.11.54)’, ‘1,2-dihydroxycyclohexa-3,5-diene-1-carboxylate dehydrogenase (EC 1.3.1.25)’, ‘1,2-dihydroxynaphthalene dioxygenase’, ‘12-oxophytodienoate reductase (OPR3)(DDE1)’, ‘1,4-alpha-glucan branching enzyme (EC 2.4.1.18)’, ‘1,4-alpha-glucan (glycogen) branching enzyme, GH-13-type (EC 2.4.1.18)’, ‘1,4-dihydroxy-2-naphthoate octaprenyltransferase (EC 2.5.1.74)’, ‘16 kDa heat shock protein A’, ‘16 kDa heat shock protein B’, ‘16S rRNA m(5)C 967 methyltransferase (EC 2.1.1.-)’, ‘16S rRNA processing protein RimM’, ‘1-acyl-sn-glycerol-3-phosphate acyltransferase (EC 2.3.1.51)’, ‘1-deoxy-D-xylulose 5-phosphate reductoisomerase (EC 1.1.1.267)’, ‘1-deoxy-D-xylulose 5-phosphate synthase (EC 2.2.1.7)’, ‘1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase (EC 1.17.7.1)’, ‘1-phosphofructokinase (EC 2.7.1.56)’, ‘1-pyr [... truncated]
Execution halted
can you let me know how to sort the problem
from samsa2.
Hi utpalh,
Did this error happen while running the master_script.sh? Or were you running one of the R scripts separately?
Just from the error that you've pasted in, it looks like you're trying to run the Subsystems_DESeq_stats.R script, but it's having issues with the raw counts table. Could you upload the raw counts table that you received to this issue? This should be located in your Step 2 folder.
from samsa2.
raw counts
control_1_Healthy 21092913
control_2_Healthy 20298464
control_3_Healthy 20078549
experimental_4_HLB 19632654
experimental_5_HLB 20712965
experimental_6_HLB 22204048
I run DESeq for REfseq, no problem but for the subsystems_results i have the error
I have run the master_script.sh, the error came while running the following lines in it.
Rscript $R_programs/Subsystems_DESeq_stats.R -I $starting_location/step_5_output/Subsystems_results/receipts/ -O Subsystems_level-1_DESeq_results.tab -L 1 -R $starting_location/step_2_output/raw_counts.txt
I tried both the file location with subsystem result files.
- step_5_output/Subsystems_results/
- step_5_output/Subsystems_results/receipts/
Output
[1] "USAGE: $ run_DESeq_stats.R -I working_directory/ -O save.filename -L level (1,2,3,4)"
Working directory is /home/utpal/Documents/samsa2/step_5_output/Subsystems_results/receipts/
Saving results as Subsystems_level-1_DESeq_results.tab
Calculating DESeq results for hierarchy level 1
Error in row.names<-.data.frame
(*tmp*
, value = value) :
duplicate 'row.names' are not allowed
Calls: rownames<- ... rownames<- -> row.names<- -> row.names<-.data.frame
In addition: Warning message:
from samsa2.
Hmm, the counts file contents you posted look fine. So that's not the issue.
You shouldn't be using the receipts location; this should be calling the files in $STEP_5/Subsystems_results
. Is there a chance you could attach a couple of the results files to this issue?
(You can attach files to a Github comment by dragging them into the text box or using the link at the bottom to select them.)
If you cannot do this, I can try making some more explicit merge commands in the R script and uploading that for you to try in place of yours, but I'm not sure why it's encountering this issue. I am not seeing this in my tests.
from samsa2.
I uploaded 4 files, Check if there is anything you find that are not correct
4 files.zip
from samsa2.
Ah, that's resolved it!
It looks like the R script was properly converting blank hierarchy entries into "NO HIERARCHY", but was missing when an entry is instead classified as "N/A". One line fix, and the script now runs without any problems.
I've pushed an update, but I'm also attaching the R script here. If you replace your "Subsystems_DESeq_stats.R" file with this one, things should work. (You'll have to gunzip the file, because Github won't let me share a script file directly, but will if it's gzipped. Go figure.)
from samsa2.
Thanks Sam its working fine now
from samsa2.
I came upon the following error while processing for Level 2 subsystem and level 4 subsystem
[1] "USAGE: $ run_DESeq_stats.R -I working_directory/ -O save.filename -L level (1,2,3,4)"
Working directory is /home/utpal/Documents/samsa2/step_5_output/Subsystems_results/
Saving results as Subsystems_level-2_DESeq_results.tab
Calculating DESeq results for hierarchy level 2
Error in setnames(x, value) :
Can't assign 4 names to a 3 column data.table
Calls: colnames<- ... colnames<- -> names<- -> names<-.data.table -> setnames
Execution halted
[1] "USAGE: $ run_DESeq_stats.R -I working_directory/ -O save.filename -L level (1,2,3,4)"
Working directory is /home/utpal/Documents/samsa2/step_5_output/Subsystems_results/
Saving results as Subsystems_level-1_DESeq_results.tab
Calculating DESeq results for hierarchy level 4
Error in row.names<-.data.frame
(*tmp*
, value = value) :
duplicate 'row.names' are not allowed
Calls: rownames<- ... rownames<- -> row.names<- -> row.names<-.data.frame
In addition: Warning message:
non-unique value when setting 'row.names': ‘NO HIERARCHY’
Execution halted
from samsa2.
Related Issues (20)
- Working with Archaea HOT 3
- Header parsing using blast format 6 with salltitles HOT 1
- AttributeError: module 'time' has no attribute 'clock' HOT 5
- Help with script
- Error running DIAMOND_subsystems_analysis_counter.py HOT 9
- About full_database_download.bash HOT 8
- full_database_download.bash which versions? HOT 1
- Error with R script: DESeq_stats. HOT 11
- Combine step 5 outputs from different samples HOT 1
- Issue with colors in 'make_combined_graphs.R HOT 1
- Sequencing depth differs between experimental and control HOT 1
- Error in DIAMOND_analysis_counter.py HOT 1
- issue with get_normalized_count_table.R
- Error in R-analysis step HOT 2
- error standardized_DIAMOND_analysis_counter.py HOT 4
- test_of_master_script.py HOT 2
- update HOT 1
- error while running run_DESeq_stats.R HOT 5
- Information regarding New available databases - Refseq and SEED database HOT 2
- Loosing too many reads after Pear step HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from samsa2.