Giter Site home page Giter Site logo

Comments (8)

fpusan avatar fpusan commented on August 23, 2024

Hi!
Can you share the input genomes and the command you used so I can try to reproduce the error?

from superpang.

yi1873 avatar yi1873 commented on August 23, 2024

Species: Mycobacterium tuberculosis
Taxid: 1773
Genome:316 complete genomes in GenBank (https://ftp.ncbi.nlm.nih.gov/genomes/genbank/assembly_summary_genbank.txt)
Command: SuperPang.py --fasta genome/*.fa --output-dir 1773/pangenome --force-overwrite -t 20 --assume-complete -b 0.95 -i 0.95 -k 301

from superpang.

yi1873 avatar yi1873 commented on August 23, 2024

You can also test E.coli with all complete genomes in GenBank, get the same error.

from superpang.

fpusan avatar fpusan commented on August 23, 2024

Can reproduce with the Mycobacterium tuberculosis dataset

from superpang.

yi1873 avatar yi1873 commented on August 23, 2024

Full errors:

multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File ".../anaconda3/envs/SuperPang/lib/python3.8/multiprocessing/pool.py", line 125, in worker
result = (True, func(*args, **kwds))
File ".../anaconda3/envs/SuperPang/lib/python3.8/multiprocessing/pool.py", line 48, in mapstar
return list(map(*args))
File ".../anaconda3/envs/SuperPang/lib/python3.8/site-packages/SuperPang/lib/Assembler.py", line 846, in reconstruct_sequence
if k[:-1] == kmers[-1][1:]:
IndexError: list index out of range
"""

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File ".../anaconda3/envs/SuperPang/bin/SuperPang.py", line 291, in
main(parse_args())
File ".../anaconda3/envs/SuperPang/bin/SuperPang.py", line 144, in main
contigs = Assembler(input_minimap2, args.ksize, args.threads).run(args.minlen, args.mincov, args.bubble_identity_threshold, args.genome_assignment_threshold, args.threads)
File ".../anaconda3/envs/SuperPang/lib/python3.8/site-packages/SuperPang/lib/Assembler.py", line 259, in run
NBP2seq = dict(zip(NBPs, self.multimap(self.reconstruct_sequence, threads, NBPs, self.vertex2hash, self.compressor)))
File ".../anaconda3/envs/SuperPang/lib/python3.8/site-packages/SuperPang/lib/Assembler.py", line 55, in multimap
res = pool.map(fun, range(len(iterable)))
File ".../anaconda3/envs/SuperPang/lib/python3.8/multiprocessing/pool.py", line 364, in map
return self._map_async(func, iterable, mapstar, chunksize).get()
File ".../anaconda3/envs/SuperPang/lib/python3.8/multiprocessing/pool.py", line 771, in get
raise self._value
IndexError: list index out of range

from superpang.

fpusan avatar fpusan commented on August 23, 2024

Thanks! I'm running it right now in my computers, adding some traces to see what could be happening.
Also, wild guess here, but does this also happen if you call it like
SuperPang.py --fasta genome/*.fa --output-dir 1773/pangenome --force-overwrite -t 20 --assume-complete -b 0.95 -i 0.95 -k 301 -n 20 ?

from superpang.

fpusan avatar fpusan commented on August 23, 2024

Nevermind, I now think that the problem is due to the presence of ambiguous bases in the input fasta files
(e.g. "M" to represent either "A" or "C"). This happens for these genomes and would definitely break my code in a few places. Will think of a fix

from superpang.

fpusan avatar fpusan commented on August 23, 2024

Hi again! This issue is hopefully fixed in the new version 0.9.3

from superpang.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.