Giter Site home page Giter Site logo

Comments (9)

hamadodene avatar hamadodene commented on September 28, 2024 1

@lhotari Yes, we have pulsar 3.0.3, bk 4.16.4
And for E, Qw, Qa we use ensembleSize=2, writeQuorumSize=2, ackQuorumSize=2

from bookkeeper.

eolivelli avatar eolivelli commented on September 28, 2024 1

zk+null is the safest default because it automatically adatps to the existing layout.

I suggest to use that and let the clients automatically discover.
In case it is a new cluster when you format it using zk+null the layout will be hierarchical

from bookkeeper.

lhotari avatar lhotari commented on September 28, 2024

@hamadodene It would be helpful to share Pulsar version & Bookkeeper version & possible customized Ensemble size (E), write quorum (Qw) and ack quorum (Qa) size.

from bookkeeper.

lhotari avatar lhotari commented on September 28, 2024

@hamadodene noticed this in the output that you shared:

jute.maxbuffer value is 1048575 Bytes

in Pulsar, the default is -Djute.maxbuffer=10485760.

When you run Bookkeeper, do you use bin/pulsar bookie to start it?

This might not be relevant in this context, but just just wondering if large ZNodes with low jute.maxbuffer value could result in inconsistencies.

from bookkeeper.

lhotari avatar lhotari commented on September 28, 2024

When running Bookkeeper with Pulsar's bin/pulsar bookie script, one of the main differences is that Bookkeeper will use org.apache.pulsar.metadata.bookkeeper.PulsarMetadataBookieDriver and org.apache.pulsar.metadata.bookkeeper.PulsarMetadataClientDriver from the Pulsar code base for metadata operations.

from bookkeeper.

lhotari avatar lhotari commented on September 28, 2024

@hamadodene do you use offloading? I found issue apache/pulsar#21737 which could be related in that case.

from bookkeeper.

lhotari avatar lhotari commented on September 28, 2024

also apache/pulsar#15464

from bookkeeper.

hamadodene avatar hamadodene commented on September 28, 2024

@lhotari
We don't use offload. We have our own service that wraps Bookkeeper (we create an org.apache.bookkeeper.server.EmbeddedServer). We don't use the two classes you mentioned earlier, but we configure the metadataServiceUri of Bookkeeper as zk+hierarchical and the ZNode /ledgers/LAYOUT indicates hierarchical.

We recently forced the metadataServiceUri to be hierarchical; previously, we were using zk+null, which then used the Bookkeeper default. Therefore, the layout on the ZNode was Flat, probably due to defaults from older versions.

This caused problems because during the update, the ledger Pulsar ZNodes were written with hierarchical layout, while other nodes were written with flat layout. Perhaps this caused the inconsistencies.

However, Bookkeeper seemed to write without errors (at least it wrote the ZNodes); perhaps the missing ledgers in the logs are those written before we fixed the layout?

The update was made from Pulsar 2.9.5 to 3.0.3 and Bookkeeper 4.14.4 to 4.16.4.

from bookkeeper.

dmercuriali avatar dmercuriali commented on September 28, 2024

@eolivelli our system is pretty old. In the znode /ledgers/LAYOUT we had Flat.
We use the same BK cluster for pulsar and for some other parts of our application. Our code defaulted to zk+null.

After the pulsar upgrade we noticed that the ledgers for pulsar topics were created with the hierarchical layout (while the ledgers created directly by us were still created with the flat layout). This might be a problem with pulsar, maybe It forces the layout instead of reliyng on the cluster-default.

But the strange thing @hamadodene is reporting, is that pulsar was (apparently) able to publish messages on the topics, but could not read the messages because bk was throwing BKException$BKNoSuchLedgerExistsException: No such ledger exists on Bookies.
We then forced the hierarchical layout on the bk cluster, but bk still could not read the pulsar ledgers. Looking in the bk logs, we found no entries for the ledgers "created" before the layout switch.

Is it possible that bk was creating the znode for the ledger (with hierarchical layout), and then silently failed to actually write because of the conflicting layout?

from bookkeeper.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.