inseefr / arc Goto Github PK
View Code? Open in Web Editor NEWWorkbench for acquisition and normalization of data sets
License: MIT License
Workbench for acquisition and normalization of data sets
License: MIT License
There is a lot of french in the application (documentation or fields/variables/methods/classes name) which need to be translate in English.
It is written as an outstanding warning in the documentation : there is a dependency to an INSEE internal java component for ARC. This link make the whole application not usable.
When a solution will be provided ?
The main documentation (provided through the README.md) is useful but maybe a bit dense for a first grip on what ARC is about.
Identically to what is generally provided by open-source software (1), we may provide a first README that would :
(1) see for example the tensorflow github repo or the one for airflow
Sonarcloud will stop supporting Java 8 in October 2020.
This is related to the jdk version declared in .travis.yml (openjdk8).
The existing Docker images for ARC is only handling the deployment of a pre-built WAR archive into a Tomcat application server.
Good practice nowadays is to include the build step into the image (a.k.a a build container) in order to have a unique and overarching build tool.
Ref : https://docs.docker.com/develop/develop-images/dockerfile_best-practices/
Example : https://github.com/I3S-ESSnet/is2/blob/master/app.Dockerfile
Some unit tests currently need a real connexion to a running database (see for example this one), running database that doesn't exit in a CI context.
For the moment, we disable those tests but we want in a near future to be able to run them. We need for that a DB test strategy :
We are asked to generate test reports. Any idea of what solution to implement Romain or HealerMikado ?
Also, is it possible to implement the integration gitlab-CI on Github the same way we does in our internal repository ?
My IDE tell me that BiUntokenizer is never used.
Is it dead code or work-in-progress ? In the first case, we should remove the interface, shouldn't we ?
Make ARC be able to retrieve data from several executor nods
Check if the users is a non admin user and set the database initialization script to handle a non superuser
Dear ARC Team,
I am trying to install ARC on a Windows machine. I execute the 'docker-compose up' command, but I get the following error message:
Step 8/15 : ARG LOG_PATH=logs/log-arc.log ---> Running in d175228b6d89 Removing intermediate container d175228b6d89 ---> 3e420a404f9b Step 9/15 : ARG LOG_LEVEL=ERROR ---> Running in 67e1109380ac Removing intermediate container 67e1109380ac ---> fa0955e40c84 Step 10/15 : ARG LOG_SETTINGS=fr/insee/config/log4j.xml ---> Running in 1911904ebd8f Removing intermediate container 1911904ebd8f ---> ed1df1c30c6a Step 11/15 : COPY . /usr/src/app/ ---> c27f84c77339 Step 12/15 : RUN chmod +x usr/src/app/script.sh && usr/src/app/script.sh ---> Running in 1f9f3cc346c4 /bin/sh: usr/src/app/script.sh: not found ERROR: Service 'app' failed to build: The command '/bin/sh -c chmod +x usr/src/app/script.sh && usr/src/app/script.sh' returned a non-zero code: 127
I'm currently translating the classes in utils.sqlengine and i sadly noticed the lack of tests, for example tests related to the table naming schema.
It's nearly mandatory nowadays to provide a code base with an extensive test suite, at least for building confidence. Test also helps understand the inner workings of the software.
Romain, could you please take care of the follwing module in the package ARC/arc-utils/src/main/java/fr/insee/arc/utils/ ?
ressourceUtils
sqlengine
structure
textUtils
utils
webutils
Thank you
Originally posted by @Nolife999 in #7 (comment)
Switching to Log4j2 as it was already declared in project dependencies
Franck, could you please take care of the follwing module in the package ARC/arc-utils/src/main/java/fr/insee/arc/utils/ ?
batch
bdd
dao
files
format
Thank you
Originally posted by @Nolife999 in #7 (comment)
I have just read your very interesting documentation. But as there is a warning about its incompleteness, I allow myself to open this issue because I have a doubt about the features offered.
Are flat files with positional records supported as loader?
Like this one ?
123456789012345678901234567890123456789012345678901234567890123456789012345
US Alice Smith 123 Maple Street Mill Valley CA 90952
I don't think so. I could only read information about key files, values, xml and csv.
Simply suggesting here the use of a dedicated REST API tool (Jersey is a good start) instead of implementing it from scratch with the use of Servlets.
This way we will benefit from related technology, like JSON parsing and ser/deser.
In model family managment, numbers cannot be used in the table name (0,1,2...,9) as it should be
The package naming is a bit curious here : underscores ( _ ) could be replaced by dashes ( - ), and i also think that the base name should be "arc" instead of "arc_essnet".
The initial version of the application was relying on an Insee internal solution for authentication.
We should now provide a generic solution:
A french user guide exists (50 pages of great quality), we should translate it to english. ๐ฌ๐ง
To expand the transparency on the project, i suggest to add at least :
As experienced in the last few PR (see https://travis-ci.org/InseeFr/ARC/pull_requests), CI is broken for pull requests due to sonar trying to send it's results to localhost instead of sonarcloud. This seems to be caused by GH-32.
The sonar call in travis.yml use the profile "coverage" :
mvn clean verify sonar:sonar -Pcoverage
This profile is absent, resulting in the following warning :
[WARNING] The requested profile "coverage" could not be activated because it does not exist.
This might or might no be related to Sonar not picking up test coverage reports.
Here is a working example of a a pom with a coverage
profile for reference : https://github.com/SonarSource/sq-com_example_java-maven-travis/blob/master/pom.xml
Hi,
I'm trying to deploy the arc-web.war on Tomcat 8, but I get this error:
01-Oct-2020 08:51:13.287 SEVERE [localhost-startStop-1] org.apache.catalina.core.StandardContext.listenerStart Exception sending context initialized event to listener instance of class org.springframework.web.context.ContextLoaderListener
org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'gererFamilleNormeAction': Unsatisfied dependency expressed through field 'properties'; nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'properties' defined in class path resource [applicationContext.xml]: Error setting property values; nested exception is org.springframework.beans.PropertyBatchUpdateException; nested PropertyAccessExceptions (1) are:
PropertyAccessException 1: org.springframework.beans.MethodInvocationException: Property 'logConfiguration' threw exception; nested exception is java.lang.NullPointerException
at org.springframework.beans.factory.annotation.AutowiredAnnotationBeanPostProcessor$AutowiredFieldElement.inject(AutowiredAnnotationBeanPostProcessor.java:643)
at org.springframework.beans.factory.annotation.InjectionMetadata.inject(InjectionMetadata.java:130)
Regards,
Francesco
I just had a few issues setting up the project, mostly linked to Maven fetching an RC version of Spring 5. This is coherent with the currently defined range in the pom of [4.3.17,)
(any version equal or greater to 4.3.17).
My take on this is that, at the very least, version range should have an upper bound to prevent any major version change (as in here from 4 to 5). Major version changes, by definition, do not guaranty retro-compatibility.
That would be for instance [4.3.17,4.9999)
(cf https://medium.com/@MichaKutz/legit-but-useless-maven-version-ranges-explained-d4ba66ac654)
More generally, i feel like any version range still has the major setback of reducing reproducibility and portability.
In the textUtils package, three interfaces hold various constants (see for example IConstanteCaractere).
Why don't we simply use classes for that ? Or best, enums (see that post for reference) ! โจ
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.