Giter Site home page Giter Site logo

Comments (4)

Juke34 avatar Juke34 commented on July 25, 2024

Potentially yes. Do you have any other error / trace before this ?
If you can send me your files I can have a look more in detail.

from emblmygff3.

Iseez avatar Iseez commented on July 25, 2024

There is no trace before this one.
The files I was trying to use are attached to this reply.

gff&fasta.zip

from emblmygff3.

Juke34 avatar Juke34 commented on July 25, 2024

Thank you for having reported this problem. This is indeed a bug.
Line 194 of EMBLmyGFF3.py
while end:
has to be replaced by
while end < len(seq):

It will be fixed in a future release.
Until then, you can fix it yourself that way:

Uninstall EMBLmyGFF3:

pip unistall EMBLmyGFF3

clone the repo in a nice place:

mkdir ~/git
cd ~/git
git clone https://github.com/NBISweden/EMBLmyGFF3.git
cd EMBLmyGFF3

replace line 194 of EMBLmyGFF3.py as indicated before (here using the nano text editor but you can use what ever you want):
nano EMBLmyGFF3/modules/feature.py

install:

python setup.py install

or if you do not have administartive rights on your machine:

python setup.py install --user


Except this bug I can point 2 other problems:

  1. The fasta headers are different from those reported in column 1 of the gff file. Be sure the annotation has been done using this fasta sequences. If it is the case fix the name they must be similar in both files otherwise EMBLmyGFF3 will not be able to match the features (gene,cds,etc) to the proper sequence. Currently no feature at all will be attached to the sequences.
  2. if you plan to submit the EMBL file created like that, you will have a rejection from ENA. Indeed they do not accept sequences that start or/and end with N (gaps). You have plenty of cases like that. This is also related to the current bug. If you shrink the N from extremities you will passthrough the current bug in EMBLmyGFF3 and do not encounter any rejection due to trailing Ns during ENA submission.
    To do so first you must fix names between your gff and fasta files as indicated point 1 and then you can use the script gff3_sp_clipN_seqExtremities_and_fixCoordinates.pl from the GAAS repository.

from emblmygff3.

Iseez avatar Iseez commented on July 25, 2024

I realised about the headers after having a problem related with that with another species, the hedears will be changed, the EMBL sequences are not going to be submitted, I needed them to make some analysis.
Thank you, I really apreciate your help.

from emblmygff3.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.