Giter Site home page Giter Site logo

parallel log ingestions about drain3 HOT 4 CLOSED

logpai avatar logpai commented on July 21, 2024
parallel log ingestions

from drain3.

Comments (4)

davidohana avatar davidohana commented on July 21, 2024

Hello and thank you,

Please read my answer for a similar question here: #16 for possible solutions that do not require code change in Drain3 itself.

Adding multithreading/multiprocessing support to Drain3 would be a very welcomed contribution. However, I am not sure that multithreading will provide much value here because of the Python GIL, so we will need to to a PoC and measure the performance improvement. Multiprocessing with child processes and shared memory might be a better option performance-wise, but its not trivial to implement either.

A possible direction to start with - since the vast majority of logs should match an existing template, and a new/changed template is pretty rare, its possible to process almost all logs in concurrency, and only when one Drain3 instance detects that it requires a change in the parse tree, it will avoid this log, and hand it over to the main Drain3 instance that will process and update the tree, then instruct child instances to sync their state.

from drain3.

kwokon0ng avatar kwokon0ng commented on July 21, 2024

Thanks David for your response.

its possible to process almost all logs in concurrency, and only when one Drain3 instance detects
that it requires a change in the parse tree, it will avoid this log, and hand it over to the main

I think this suggestion makes sense.
Does Drain3 have a built-in mechanism to support a main Drian3 instance that ingests template changes from child instances and update tree?

from drain3.

kwokon0ng avatar kwokon0ng commented on July 21, 2024

i think i have figured it out, in doc it says to use inference mode https://github.com/IBM/Drain3#training-vs-inference-modes

thanks

from drain3.

davidohana avatar davidohana commented on July 21, 2024

Correct, the match() function can be used to determine if a log already matches an existing template. However, you will have to implement the synchronization between main and child Drain instances.

from drain3.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.