Comments (4)
Hello and thank you,
Please read my answer for a similar question here: #16 for possible solutions that do not require code change in Drain3 itself.
Adding multithreading/multiprocessing support to Drain3 would be a very welcomed contribution. However, I am not sure that multithreading will provide much value here because of the Python GIL, so we will need to to a PoC and measure the performance improvement. Multiprocessing with child processes and shared memory might be a better option performance-wise, but its not trivial to implement either.
A possible direction to start with - since the vast majority of logs should match an existing template, and a new/changed template is pretty rare, its possible to process almost all logs in concurrency, and only when one Drain3 instance detects that it requires a change in the parse tree, it will avoid this log, and hand it over to the main Drain3 instance that will process and update the tree, then instruct child instances to sync their state.
from drain3.
Thanks David for your response.
its possible to process almost all logs in concurrency, and only when one Drain3 instance detects
that it requires a change in the parse tree, it will avoid this log, and hand it over to the main
I think this suggestion makes sense.
Does Drain3 have a built-in mechanism to support a main Drian3 instance that ingests template changes from child instances and update tree?
from drain3.
i think i have figured it out, in doc it says to use inference mode https://github.com/IBM/Drain3#training-vs-inference-modes
thanks
from drain3.
Correct, the match()
function can be used to determine if a log already matches an existing template. However, you will have to implement the synchronization between main and child Drain instances.
from drain3.
Related Issues (20)
- Can i change the state saving name and path HOT 1
- Error when running the example. HOT 1
- Extra delimiters in config HOT 1
- Only mask_name * is used HOT 2
- Issue with match method in Drain class HOT 1
- specify a log file HOT 1
- Saving log template/cluster and ID for each log HOT 2
- Error parsing logs: "ZeroDivisionError: float division by zero" HOT 4
- Restrictions on matching mode HOT 2
- About parameter `full_search_strategy` in drain match method HOT 12
- Windows regular expression HOT 1
- Drain3 deprecation warning with pip install command. HOT 2
- visualize drain parse tree (feature) HOT 1
- Hi, I've been trying to use drain for running log anomaly detection on some logs.
- Log Matching on new data HOT 2
- Chinese and English hybrid log template mining HOT 5
- Some DRAIN templates with <*> do not have parameters extracted HOT 7
- PermissionError when running with Persistance
- Is it possible to freeze templates when trainning? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from drain3.