lsds / seep Goto Github PK
View Code? Open in Web Editor NEWCluster computing using Stateful Dataflow Graphs
Cluster computing using Stateful Dataflow Graphs
The scale out mechanism is chosen from the checkpoint mechanism. This means that only one can be used at a given time for a given query, which forces all the stateful operators of the query to use the same checkpointing mechanism.
Initial work is done, check and validate.
Hello, SEEP Team.
I tried to build you project. I am running into the problem, that the following dependency was not available via the default public Maven repositories. Is there a repository declaration missing, or is it necessary to manually install the dependency into the local repository?
<dependency>
<groupId>soot</groupId>
<artifactId>soot-framework</artifactId>
<version>2.5.0</version>
</dependency>
Thank you very much.
Include Kryo and Metrics as external dependencies. So far it's been good enough with Ant, but now I will include some java collections external implementation, so it makes sense to migrate the project to Maven.
In particular, the problem is how to understand how many of the total number of upstream operators have to synchronise in the barrier. (These are those that have scaled out from the same original operator).
On branch master
Your branch is up-to-date with 'origin/master'.
Changes not staged for commit:
modified: seep-java2sdg/.settings/org.eclipse.jdt.core.prefs
modified: seep-system/.settings/org.eclipse.jdt.core.prefs
Untracked files:
seep-java2sdg/.settings/org.eclipse.core.resources.prefs
seep-java2sdg/.settings/org.eclipse.m2e.core.prefs
seep-streamsql/.settings/
seep-streamsql/target/
seep-system/.settings/org.eclipse.core.resources.prefs
seep-system/.settings/org.eclipse.m2e.core.prefs
seep-system/examples/stateless-simple-query/dist/
seep-system/libs/
These could be deleted or git ignored.
There are different layers in this package. It has to be consistent with the rest of the codebase.
Implications on hop-by-hop semantics.
Correct but not sure impact on performance, in particular when one of the upstream have scaled out and N threads compete for a queue, while only 1 (potentially) competes for its unique queue.
Any optimisation regarding which "logical" operators should be placed in the same node is a high-level optimisation. Therefore, when deploying operators in the cluster, each "deployable" operator may implement the body of N "logical" operators, but it is always only one operator being deployed. This reduces complexity at deployment time.
With this done, it is necessary to figure out how to include output buffers in the backup.
For example, if there is a failure due to lack of sources or sinks, indicate this. Or if there is a declared stateful operator without state. This would already increase the usability of the current interface
It's highly improbable that you want to leverage multicore when you are synchronising in a barrier with a dynamic number of threads, and still has another number N of threads to support more ingestion modes.
However, it would be nice to at least provide these support for modes that are one-at-a-time for example.
I put both seep-system-0.0.1-SNAPSHOT.jar and stateless-simple-query.jar in the dist folder and run:
java -jar seep-system-0.0.1-SNAPSHOT.jar Master stateless-simple-query.jar Base
It gives me the following errors:
java.lang.ClassNotFoundException: Base
at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
at uk.ac.imperial.lsds.seep.infrastructure.master.MasterController.executeComposeFromQuery(MasterController.java:153)
at uk.ac.imperial.lsds.seep.Main.executeMaster(Main.java:69)
at uk.ac.imperial.lsds.seep.Main.main(Main.java:44)
I know it can not locate the Base class. Could you please tell me where is the problem? Thanks.
Make both workers and master to poll until system is ready to deploy. This is in contrast to enforce an ordering when deployint the system. (master -> workers)
This will avoid an annoying but not harmful exception thrown by backupHandler when deploying in local mode.
Right before and after receiving state chunks in all the receivers of the star topology. It does not break the thread or the program, but they are always thrown.
Provide a convenient interface for serialisation/deserialisation that accepts different mechanisms.
When manually scaling out, some upstream operators are not registered down. Operators cannot forward ACKs upstream, and thus upstream output buffers fill.
It may not make sense to use it if there is only one stream. However, a meaningful message should warn the user that the call will fall back to the default send in such case.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.