Giter Site home page Giter Site logo

Comments (4)

shikhar avatar shikhar commented on July 1, 2024

That's a good point that when using key.ignore=true having the offset as ES document version is not useful since the topic/partition/offset is already encoded in the key.

Can you elaborate on 'lots of warnings'. I would expect some warnings when you are starting the connector if the shutdown was not clean, but they should not be persistent. Basically the warnings should only happen if the connector has to perform some recovery and re-send documents that were already indexed.

If it is more frequent than that, it would be great to see a complete Connect worker log if possible.

from kafka-connect-elasticsearch.

shikhar avatar shikhar commented on July 1, 2024

INFO or DEBUG for this log line may be more appropriate in retrospect.

UPDATE: done in b929b1d

from kafka-connect-elasticsearch.

simplesteph avatar simplesteph commented on July 1, 2024

Seeing the same issue in a similar setup.
The triggering line error is:

[2017-01-06 04:59:58,458] ERROR Commit of WorkerSinkTask{id=elasticsearch-topic-0} offsets threw an unexpected exception:  (org.apache.kafka.connect.runtime.WorkerSinkTask)
org.apache.kafka.connect.errors.ConnectException: Flush timeout expired with unflushed records: 560
	at io.confluent.connect.elasticsearch.bulk.BulkProcessor.flush(BulkProcessor.java:302)
	at io.confluent.connect.elasticsearch.ElasticsearchWriter.flush(ElasticsearchWriter.java:217)
	at io.confluent.connect.elasticsearch.ElasticsearchSinkTask.flush(ElasticsearchSinkTask.java:125)
	at org.apache.kafka.connect.runtime.WorkerSinkTask.commitOffsets(WorkerSinkTask.java:287)
	at org.apache.kafka.connect.runtime.WorkerSinkTask.iteration(WorkerSinkTask.java:157)
	at org.apache.kafka.connect.runtime.WorkerSinkTask.execute(WorkerSinkTask.java:143)
	at org.apache.kafka.connect.runtime.WorkerTask.doRun(WorkerTask.java:140)
	at org.apache.kafka.connect.runtime.WorkerTask.run(WorkerTask.java:175)
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
	at java.lang.Thread.run(Thread.java:745)

Which is very odd...
Then I'm getting all these warnings.

PS: How can I benefit from the elasticsearch connector updates if I'm using the cp-docker-images ?

from kafka-connect-elasticsearch.

 avatar commented on July 1, 2024

@shikhar Sorry for the very long delay, I didn't notice the notification...

By 'lots of warning' I meant thousands or warnings. not just a few when starting the connector when the shutdown was not clean.
Right now, we don't use the kafka elasticsearch connector anymore, so I am not sure I can help moving on with this issue.

My current hypothesis is that the connector was reading to many items from kafka and was not able to index them into elasticsearch before the session.timeout.ms triggered. The commit fails (because it is too late) and the items are given to another thread. However, part of them are already indexed.
(Not 100% sure, but it is the problem we hit with logstash and the kafka input plugin and elasticsearch output plugin).

Because i will not be able to help test any solution right now, from my point of view, you can close this issue.

from kafka-connect-elasticsearch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.