Giter Site home page Giter Site logo

Priam never marks a node as dead about priam HOT 7 CLOSED

netflix avatar netflix commented on September 27, 2024
Priam never marks a node as dead

from priam.

Comments (7)

matthewbogner avatar matthewbogner commented on September 27, 2024

Not trying to debate the test you ran or that you didn't see what you saw :)

But here is where Priam marks an instance in SimpleDB with the "-dead" suffix.
https://github.com/Netflix/Priam/blob/master/priam/src/main/java/com/netflix/priam/identity/InstanceIdentity.java#L119

from priam.

sagarl avatar sagarl commented on September 27, 2024

Looks like the issue is resolved, hence closing it.

from priam.

aryanet avatar aryanet commented on September 27, 2024

I think I should rephrase this. I figured Priam marks the node as Dead in Simple DB, but the problem is that it does not automatically remove the dead token from the ring. What happened is that I killed a node to see what happens. A new node came up by auto scaling group, Priam picked the dead node's token from Simple DB but couldn't gracefully start the cassandra service as cassandra complained the token still exist and must run decommission or removetoken. I had to manually run removetoken from a healthy node nad restart priam in the new node so that it joins the ring.

from priam.

sagarl avatar sagarl commented on September 27, 2024

Hi Arya,

I ran the test in following order:

  1. Created a New 9 node cluster ( 3 Nodes in each zone)

  2. Terminated one of the instances (out of 9 running instances)
    a. Ran "ring" command -- which displays the Terminated node as Down

  3. ASG brought up another instance
    a. After the node is up & running, again ran "ring" command -- which displayed the new node using the same
    token as that of dead node.

Let me know if this works for you.

from priam.

sagarl avatar sagarl commented on September 27, 2024

Hopefully the above explanation has resolved the issue, hence closing it.

from priam.

dehora avatar dehora commented on September 27, 2024

Created #161 instead of commenting on closed ticket.

from priam.

aryanet avatar aryanet commented on September 27, 2024

Sagar, this is still an issue. I am thinking that Netflix tooling maybe hiding the nodetool removetoken step for you. I cannot find where this is done in Priam and as you see dehora found the same issue. The replacement mechanism is not by the docs and it has hit me in the past. Please take a look at #159. I think a replacement node, should never claim the same token.

from priam.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.