Giter Site home page Giter Site logo

Comments (6)

greenwoodma avatar greenwoodma commented on August 22, 2024

I think the main issue is that this is badly named. If I recall correctly the underlying code dates back to a time where we really only handled XML and HTML documents. The intention was to allow people to save their documents back into the original format but with extra XML/HTML elements. In this situation there is no need for a root element as part of the export as it will always be in the original document (which annotation set is a good question depending on use of annotation set transfer etc.).

The problem is that we have moved well beyond just HTML and XML files so this become "inline XML" instead of "save preserving format" (which I believe was it's original name). I think the default settings make perfect sense given the expected use case. It just falls apart when used on random document formats that aren't HTML/XML style.

I think the best solution would be to document it properly but move it into a plugin so it isn't a default output option.

from gate-core.

johann-petrak avatar johann-petrak commented on August 22, 2024

Documenting it would be the number 1 priority of course. But I think the default should never generate something that is invalid XML. The documentation could point all what you said out and describe how to change the parameters from the default so that one can avoid having a root element, if necessary, but I am, in general, against anything where we create a document by default which cannot even be read back into GATE without an exception!

from gate-core.

greenwoodma avatar greenwoodma commented on August 22, 2024

In principal I agree that producing files we can't open seems odd, but I think breaking the behaviour on something that has been around so long is actually worse. Actually I don't think the root element issue is the worst thing about this format; the fact that it silently throws away annotations that partially overlap leading to loss of information is far more worrying. At least the missing root element is trivial to fix with 30 seconds and a text editor.

from gate-core.

johann-petrak avatar johann-petrak commented on August 22, 2024

This may or may not be related to what to do about the "Flexible Exporter" PR in the Tools plugin.

from gate-core.

greenwoodma avatar greenwoodma commented on August 22, 2024

What were you planning on doing with the flexible exporter? Personally I've never liked the way that is part of a pipeline, I think exporting results should be a separate step

from gate-core.

greenwoodma avatar greenwoodma commented on August 22, 2024

Closed by GateNLP/userguide@9f01ead

from gate-core.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.