Giter Site home page Giter Site logo

qumomf's People

Contributors

poolnam avatar pparshin avatar shmel1k avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

qumomf's Issues

Anti-flapping protection

Avoid flapping (cascading failures causing continuous outage and elimination of resources) by introducing a block period, where on any given replica set, qumomf will not kick in automated recovery on an interval smaller than said period.

Implement auto discovery of cluster

Using vshard.storage.info or vshard.router.info we might auto discover the cluster topology. User would just provide credentials and initial endpoint to connect.

Each cluster should have a unique name to identify them. Names and other meta information we can store to a database backend like SQLite.

API

What should be implemented:

  • Building a cluster topology (Cluster structure and basic data on nodes).
  • Viewing information on a separate node
  • Getting a list of recoveries
  • Getting information on individual recovery
  • Getting a list of active problems

Filter out replicas with lag more than user specified value

We should provide an options to filter out replicas which are too lagging from master election:

  1. Max idle value, in seconds
  2. Max LSN behind the master.

It will allow users to control the master election behaviour and fail the promotion if chosen replica is too far from master.

Make a decision to run failover using consensus

The idea is similar to the logic realized in MySQL Orchestrator.

Current implementation: qumomf decides to switch replica role to master if it lost the connection to old master. It might lead to a false failover.

I suggest to analyze replica alerts and if they notify that replication is broken too then qumomf should start a failover process.

Add readonly option

For testing purpose it would be useful to have a readonly option. If that option is enabled, qumomf will not change cluster topology in any way.

Add health check

To monitor the qumomf status and getting its info like version, built time, etc.

Improve logging

  • lower the log level for non-essential logs to the debug level
  • log only in case of a change in the topology or state of the cluster
  • log instance URI

Notify about concrete dead follower

Subj.

{"level":"warn","cluster":"order_control","replica_set":"5065fb5f-5f40-498e-af79-43887ba3d1ec","master_uri":"some_master_uri", ,"message":"Master is reachable but some of its replicas are not replicating. No actions will be applied."}

Can you please add some additional information about replicas that are failing to replicate?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.