Comments (17)
How much RAM does my computer have?
from hifiasm.
I'm running in a linux environment with 1000G RAM.
from hifiasm.
Based on our experiments, 200Gb fasta needs about 200G RAM, so 1000G RAM should be enough. Could you have a look at the log file of linux? It should be at the file like: /var/log/messages. That may tell why it was killed.
from hifiasm.
I'm sorry, I don't have permission to view the files under this path, why don't I have log file output? I also assembled it with another linux server, didn't killed, but it's been running for a week without any file output, is that normal?
from hifiasm.
Could you please check the status of the running thread on another server? I worry hifiasm didn't successfully run on that server. If there is no log, that means hifiasm stop at k-mer counting step, which is abnormal...
from hifiasm.
It's weird, I rerun it again and still no file output.
from hifiasm.
Sorry, what do you mean by no file output? For 200G fasta, the k-mer counting may need several hours. So it shouldn't have any output to stderr in the first several hours.
from hifiasm.
@DongnaMa What is the exact command line you are using? How many threads are you specifying?
from hifiasm.
At first, my run command: hifiasm -z 50 -o THS.asm -t 20 ../1_data/THS_pacbio.fasta and ran for a week without any files output. I'm running commands on another server:hifiasm -z 50 -o THS.asm -t 60 THS_pacbio.fasta , I tried different threads parameters including 10,30, 100, and run for about a few hours and get killed.
from hifiasm.
My fasta sequences are 61Gbyte and hifiasm used 179Gbyte ram as maximum:
User time (seconds): 1420965.92
System time (seconds): 11862.12
Percent of CPU this job got: 13456%
Elapsed (wall clock) time (h:mm:ss or m:ss): 2:57:27
Average shared text size (kbytes): 0
Average unshared data size (kbytes): 0
Average stack size (kbytes): 0
Average total size (kbytes): 0
Maximum resident set size (kbytes): 179872840
Average resident set size (kbytes): 0
Major (requiring I/O) page faults: 1
Minor (reclaiming a frame) page faults: 615657759
Voluntary context switches: 133891585
Involuntary context switches: 150357957
Swaps: 0
File system inputs: 72
File system outputs: 68024272
Socket messages sent: 0
Socket messages received: 0
Signals delivered: 0
Page size (bytes): 4096
Exit status: 0
Which means fastaSize3. Nonetheless 2003 is less that 1TByte. I would check for dmesg to see if the OOM killer killed you program.
Just on a side note, it would be handy in this case that hifiasm starts to write to the log what is is doing already from the start. If you have no log file at all, you start to suspect that something else went completely wrong, I had a similar experience once.
from hifiasm.
PacBio assembled the 26Gb redwood genome at ~33X using 710GB memory at the peak, so 1TB should be ok. @DongnaMa are you using a job scheduler such as Slurm/LSF/SGE? Also, are you using hifi data?
it would be handy in this case that hifiasm starts to write to the log what is is doing already from the start.
That makes sense. Thanks for the suggestion.
from hifiasm.
My server is SGE type, but I'm running the program in the largest node not using a job scheduler. I only used PacBio data and didn't use any other data.
from hifiasm.
I'm guessing that my 1TB memory may have set a limit,so it got killed, Is there any way to limit the highest memory used by the program?
from hifiasm.
By "PacBio data", do you mean CLR (noisy reads) or HiFi/CCS (accurate reads)? Note that hifiasm only works with HiFi data, not CLR.
from hifiasm.
It's a CCS type.
from hifiasm.
Sorry, I made sure, It's a CLR type, it doesn't fit, thanks guys.
from hifiasm.
np, have a good night.
from hifiasm.
Related Issues (20)
- setting K parameter in yak HOT 2
- how to make the correct genome size estimation for allotetraploid species? HOT 2
- Possible missing one haplotype in human assemblies HOT 2
- No haploid.gfa files output in trio-binning mode HOT 3
- Hifi + Hi-c + ONT assembly fails
- In Trio-binning, always more on hap1 despite (almost) same sequences for paternal and maternal
- discontinuous assembly with shorter pacbio hifi reads but high coverage HOT 2
- Is x20 of Hifi data enough to construct draft assembly of 6.5Gb genome? HOT 1
- line 8: 110334 Aborted(core dumped) HOT 1
- Ultra Long intergration failed: no output for UL kmer counting HOT 3
- missing 8Mb sequences in the assembly HOT 5
- Empty haplotype 2 gfa files by ONT integration HOT 1
- Basic Question About HiFi Input HOT 3
- Spend too long times to run hifiasm HOT 1
- Switch error on X and Y chromosome HOT 2
- *.ovlp.bin file HOT 1
- Resolving switching error (?)
- Interchromosomal misjoin HOT 1
- Read error correction does not reduce the number of kmers present once, twice or three times
- Recreate p_ctg from p_utg
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hifiasm.