Giter Site home page Giter Site logo

Out of Memory about craq HOT 4 CLOSED

jiaolaboratory avatar jiaolaboratory commented on May 25, 2024
Out of Memory

from craq.

Comments (4)

JiaoLaboratory avatar JiaoLaboratory commented on May 25, 2024

This situation is possible, but based on our tests, it occurs almost exclusively in the case of exceptionally long single chromosomes (e.g., when the length of a single chromosome exceeds 300 Mb). Generally, CRAQ consumes more memory when processing high-noise ONT data compared to HiFi data. Even although the memory usage still remains within manageable limits. By the way, could you please provide information on how your 'lgs.sort.bam' was generated? Additionally, if the memory issue cannot be resolved, based on our experience, reducing the usage of long reads (e.g., using only 50X coverage ) is almost no impact on the final results.

from craq.

yangfangyuan0102 avatar yangfangyuan0102 commented on May 25, 2024

Hi, @JiaoLaboratory
Thank you for your reply. I mapped the long reads using minimap2 with typical options "-ax map-pb | samtools view -F1796 -q 20 | samtools sort ....". I tried to reduce datasize in half, 75X. Memory is still insufficient.
One thing I realized was that my genome was polished by long- and short-reads mutiple time, so it could be very different from the raw long reads. Will this cause the raw long reads to be too noisy?

from craq.

JiaoLaboratory avatar JiaoLaboratory commented on May 25, 2024

It's highly likely that this is the cause. Currently, CRAQ's handling of high-noise alignments consumes significant resources. Would you mind trying the previous version of CRAQ-v1.0.8 (https://github.com/JiaoLaboratory/CRAQ/releases/download/v1.0.8/CRAQ-v1.0.8.zip)? This version lacks optimizations for high-accuracy data processing (which consumes resources in that step), but there haven't been significant changes in handling high-noise ONT/CRL data with CRAQ-v1.0.9.

from craq.

yangfangyuan0102 avatar yangfangyuan0102 commented on May 25, 2024

It's highly likely that this is the cause. Currently, CRAQ's handling of high-noise alignments consumes significant resources. Would you mind trying the previous version of CRAQ-v1.0.8 (https://github.com/JiaoLaboratory/CRAQ/releases/download/v1.0.8/CRAQ-v1.0.8.zip)? This version lacks optimizations for high-accuracy data processing (which consumes resources in that step), but there haven't been significant changes in handling high-noise ONT/CRL data with CRAQ-v1.0.9.

Great, The version 1.0.8 works. Thank you!

from craq.

Related Issues (17)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.