Giter Site home page Giter Site logo

Comments (4)

ravikkr18 avatar ravikkr18 commented on August 17, 2024

Hi harakiri406,

I Got the exact same issue, i just changed the ELASTICSEARCH_TYPE "items" to something else and now it works perfectly.

from scrapy-elasticsearch.

harakiri406 avatar harakiri406 commented on August 17, 2024

Already tried many ELASTICSEARCH_TYPE (data, link, url) entries, nothing works. I could suspect ES not accepting certain data types, but then my tcpdump would at least show some package

from scrapy-elasticsearch.

ravikkr18 avatar ravikkr18 commented on August 17, 2024

in my case it first collect few 100-150 items than send them all to ES one by one , you can see ES returned output on terminal.

from scrapy-elasticsearch.

harakiri406 avatar harakiri406 commented on August 17, 2024

I found the cause of my problem: my data. It was hard to find in the logs because they were so big. Issue can be closed ;

2017-08-03 09:03:42 [scrapy.core.engine] INFO: Closing spider (finished)
2017-08-03 09:03:42 [scrapy.core.engine] ERROR: Scraper close failure
Traceback (most recent call last):
  File "/usr/local/lib/python2.7/dist-packages/twisted/internet/defer.py", line 653, in _runCallbacks
    current.result = callback(current.result, *args, **kw)
  File "/usr/local/lib/python2.7/dist-packages/scrapyelasticsearch/scrapyelasticsearch.py", line 99, in close_spider
    self.send_items()
  File "/usr/local/lib/python2.7/dist-packages/scrapyelasticsearch/scrapyelasticsearch.py", line 86, in send_items
    helpers.bulk(self.es, self.items_buffer)
  File "/usr/local/lib/python2.7/dist-packages/elasticsearch/helpers/__init__.py", line 195, in bulk
    for ok, item in streaming_bulk(client, actions, **kwargs):
  File "/usr/local/lib/python2.7/dist-packages/elasticsearch/helpers/__init__.py", line 162, in streaming_bulk
    for bulk_actions in _chunk_actions(actions, chunk_size, max_chunk_bytes, client.transport.serializer):
  File "/usr/local/lib/python2.7/dist-packages/elasticsearch/helpers/__init__.py", line 61, in _chunk_actions
    data = serializer.dumps(data)
  File "/usr/local/lib/python2.7/dist-packages/elasticsearch/serializer.py", line 50, in dumps
    raise SerializationError(data, e)
SerializationError: 

from scrapy-elasticsearch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.