Giter Site home page Giter Site logo

dstl / baleen3 Goto Github PK

View Code? Open in Web Editor NEW
20.0 6.0 5.0 1.89 MB

Baleen 3 is a data processing tool based on the Annot8 framework

License: Apache License 2.0

Shell 0.01% JavaScript 0.71% Handlebars 0.09% HTML 0.19% TypeScript 66.31% Java 32.70%
java baleen annot8 data-processing

baleen3's People

Contributors

jbaker-dstl avatar stuarthendren avatar timcroydon avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

baleen3's Issues

Support for Maps (Objects) in UI

Currently, you can't create maps (objects) via the UI when configuring components. This is an issue for some components, such as the Machine Translation setting, which requires a map of settings (which could change so can't be specified in advance).

There is some support for this in JSON Schema and React Json Schema Forms, but no support in the theme we're currently using: rjsf-team/react-jsonschema-form#1927

When the above issue is addressed, we should be able to update our UI dependencies and resolve this issue.

Error when saving the pipeline as described in help

I'm trying to create my first pipeline in Baleen.

I've followed the tutorial in the help to create my first pipeline (REST API as source, Email and Print as processors).

When I go to save the pipeline I get the following error:

ERROR: http://localhost:6413/api/v3/pipelines/Empty%20Pipeline2 404

I get the following in the console running Baleen:

2020-11-10 17:45:07.542 ERROR 21269 --- [   scheduling-1] u.g.d.baleen.services.PipelineService    : Unable to create pipeline Empty Pipeline2 from file pipelines/182bf282-ab13-4a67-addd-3079c4e38dd2.json

java.lang.NullPointerException: null
	at java.base/java.io.FileInputStream.<init>(FileInputStream.java:147) ~[na:na]
	at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:182) ~[annot8-components-opennlp-1.0.0-plugin.jar:na]
	at opennlp.tools.namefind.TokenNameFinderModel.<init>(TokenNameFinderModel.java:108) ~[annot8-components-opennlp-1.0.0-plugin.jar:na]
	at io.annot8.components.opennlp.processors.NER$Processor.<init>(NER.java:58) ~[annot8-components-opennlp-1.0.0-plugin.jar:na]
	at io.annot8.components.opennlp.processors.NER.createComponent(NER.java:38) ~[annot8-components-opennlp-1.0.0-plugin.jar:na]
	at io.annot8.components.opennlp.processors.NER.createComponent(NER.java:30) ~[annot8-components-opennlp-1.0.0-plugin.jar:na]
	at io.annot8.common.components.AbstractComponentDescriptor.create(AbstractComponentDescriptor.java:38) ~[annot8-components-base-text-1.0.0-plugin.jar:na]
	at io.annot8.common.components.AbstractComponentDescriptor.create(AbstractComponentDescriptor.java:10) ~[annot8-components-base-text-1.0.0-plugin.jar:na]
	at io.annot8.implementations.pipeline.SimplePipeline$Builder.lambda$build$1(SimplePipeline.java:319) ~[annot8-pipeline-implementation-1.0.1.jar!/:na]
	at java.base/java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:195) ~[na:na]
	at java.base/java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1654) ~[na:na]
	at java.base/java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:484) ~[na:na]
	at java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:474) ~[na:na]
	at java.base/java.util.stream.ForEachOps$ForEachOp.evaluateSequential(ForEachOps.java:150) ~[na:na]
	at java.base/java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(ForEachOps.java:173) ~[na:na]
	at java.base/java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) ~[na:na]
	at java.base/java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:497) ~[na:na]
	at io.annot8.implementations.pipeline.SimplePipeline$Builder.build(SimplePipeline.java:322) ~[annot8-pipeline-implementation-1.0.1.jar!/:na]
	at io.annot8.implementations.pipeline.InMemoryPipelineRunner.<init>(InMemoryPipelineRunner.java:76) ~[annot8-pipeline-implementation-1.0.1.jar!/:na]
	at uk.gov.dstl.baleen.services.PipelineService.createPipeline(PipelineService.java:263) ~[classes!/:3.0.1]
	at uk.gov.dstl.baleen.services.PipelineService.createPipelineFromFile(PipelineService.java:163) ~[classes!/:3.0.1]
	at uk.gov.dstl.baleen.services.PipelineService.detectChanges(PipelineService.java:202) ~[classes!/:3.0.1]
	at jdk.internal.reflect.GeneratedMethodAccessor87.invoke(Unknown Source) ~[na:na]
	at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[na:na]
	at java.base/java.lang.reflect.Method.invoke(Method.java:567) ~[na:na]
	at org.springframework.scheduling.support.ScheduledMethodRunnable.run(ScheduledMethodRunnable.java:84) ~[spring-context-5.2.9.RELEASE.jar!/:5.2.9.RELEASE]
	at org.springframework.scheduling.support.DelegatingErrorHandlingRunnable.run(DelegatingErrorHandlingRunnable.java:54) ~[spring-context-5.2.9.RELEASE.jar!/:5.2.9.RELEASE]
	at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) ~[na:na]
	at java.base/java.util.concurrent.FutureTask.runAndReset(FutureTask.java:305) ~[na:na]
	at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:305) ~[na:na]
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) ~[na:na]
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) ~[na:na]
	at java.base/java.lang.Thread.run(Thread.java:835) ~[na:na]

I'm excited to try this tool - any help would be appreciated!

Failed to mvn clean package (could not resolve dependencies)

After having done a clean clone of the repository, and runnin mvn clean package I get the following:

[INFO] -------------------------< uk.gov.dstl:baleen >------------------------- [INFO] Building Baleen 3 3.0.2 [INFO] --------------------------------[ jar ]--------------------------------- [WARNING] The POM for io.annot8:annot8-conventions:jar:1.1.0 is missing, no dependency information available [WARNING] The POM for io.annot8:annot8-components-geo:jar:1.1.0 is missing, no dependency information available [WARNING] The POM for io.annot8:annot8-components-print:jar:1.1.0 is missing, no dependency information available
Then following this:
Failed to execute goal on project baleen: Could not resolve dependencies for project uk.gov.dstl:baleen:jar:3.0.2: The following artifacts could not be resolved: io.annot8:annot8-conventions:jar:1.1.0, io.annot8:annot8-components-geo:jar:1.1.0, io.annot8:annot8-components-print:jar:1.1.0: Failure to find io.annot8:annot8-conventions:jar:1.1.0 in https://repo.maven.apache.org/maven2 was cached in the local repository, resolution will not be reattempted until the update interval of central has elapsed or updates are forced

The expected result would have been a build success.

Current build environment is Ubuntu 20.04 but have also tried it as part of a dockerfile (which was from ubuntu:latest - which is probably also Ubuntu 20.04)

Does anyone know what's causing this? Does anyone else get a build success using this build or even commit a1739ed (which I thik is the last release version)?

Display loading page whilst pipeline is initialising

For pipelines that take longer to initialise, users get a 404 error after pipeline creation as Baleen tries to redirect them to the pipeline page before the pipeline is ready.

We need to look at ways to have the page available sooner, or at least to show a loading page during this period.

See #2 for some additional details.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.