I happened to stumble over this while parsing a large dataset: spaCy throws an Asserti

AssertionError when parsing empty string about spacy HOT 3 CLOSED

explosion commented on May 1, 2024

AssertionError when parsing empty string

from spacy.

Comments (3)

NSchrading commented on May 1, 2024

I've also run into this problem. My machine is running Linux Mint 16, 64-bit. I feel like this should be a priority to fix since it prevents analyzing large datasets (e.g. data from the internet) that (presumably) contain weird formatting that result in empty string tokens. In my case, it was due to a sample being exactly the empty string so it was exactly like doing this:

from spacy.en import English
nlp = English()
nlp(u"")

from spacy.

honnibal commented on May 1, 2024

Thanks for the report, and sorry it took me a while to get to it. The fix should be published in master and on pip.

I'd intended this to be the behaviour for empty strings, but left a temporary assertion in the parser.

from spacy.

lock commented on May 1, 2024

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

from spacy.

AssertionError when parsing empty string about spacy HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent