Giter Site home page Giter Site logo

independent highlight about tei-style-dta-context HOT 4 OPEN

juh2 avatar juh2 commented on July 18, 2024
independent highlight

from tei-style-dta-context.

Comments (4)

juh2 avatar juh2 commented on July 18, 2024

Here is an example.

<cit>
<quote>
<hi rendition="#et #aq">Si c&#x2019;est la raison, qui fait l&#x2019;homme,<lb/>
c&#x2019;est le sentiment, qui le conduit.
</hi>
</quote><lb/>
<bibl rendition="#right #g #k #aq">Rousseau</bibl>
</cit>

The first rendition has two values, the second four. The easiest thing would be if this would work:

\xmlsetsetup{#1}{contains(@rendition, '\letterhash right', '\letterhash g')]}{xml:right:g}

If this and that is in rendition then name it foo. I didn't tried this way.

Another way would be to have one-to-one relations between the xml attribute values and the ConTeXt constructs in a way that it is possible to concatenate them in the ConTeXt source. This would mean that we only have to xmlsetsetup them once and use them all together if needed.

Yes, it seems that they mix semantic and visual markup at least in this example. But their goal is to mark the text in a way that the digital version matches with the appearance of the scanned edition (mostly the first edition). If you want to make statistical analysis of the average line length you need the <lb/>, if you want to point to the time when people stopped to emphasize a passage with extended letters and began to use italic letters instead you need to have these renditions in your source.

They document their text body TEI here:
http://www.deutschestextarchiv.de/doku/basisformat_table

Parent is:
http://www.deutschestextarchiv.de/doku/

It is a huge TEI. I don't look at it, because I don't want to loose my enthusiasm. ;-)

from tei-style-dta-context.

ousia avatar ousia commented on July 18, 2024

The first rendition has two values, the second four. The easiest thing would be if this would work:

\xmlsetsetup{#1}
    {contains(@rendition, '\letterhash right', '\letterhash g')]}
    {xml:right:g}

If this and that is in rendition then name it foo. I didn't tried this way.

Not sure I understand the whole thing, but something like this should work:

\xmlsetsetup{#1}
    {[contains(@rendition,'\letterhash right') and
      contains(@rendition,'\letterhash g')]}
    {xml:right:g}

Wouldn’t it better better a modular matching approach? I mean, each rendition value has a xmlsetsetup of its own.

I think modular matching makes mixing easier. If I don’t get it wrong, with your approach above you’ll have to code any possible combinations.

And about the text encoding style from the DTA, I must confess that I dislike it. I don’t think it is totally illegitimate. But I’m afraid they’re loosing a good opportunity to encode texts with higher quality (I mean, something that could replace this edition).

If their TEI usage specification is huge, please never consult the P5 Guidelines themselve. They are really huge 😃

from tei-style-dta-context.

juh2 avatar juh2 commented on July 18, 2024

Cool. Good to know that AND (and probably OR) works in setups. You are right, a modular approach would be better. I named it one-to-one solution, which is a confusing name. But I am not sure, whether we can concatenate attributes in ConTeXt like they do in the attributes.

I am not an expert for TEI so I can not judge their usage. But I know that they have problems to provide ebooks. I recently converted some texts to EPUB via Pandoc. After problems with the XHTML version I used the plain text version, because it was the easiest option.

from tei-style-dta-context.

ousia avatar ousia commented on July 18, 2024

Cool. Good to know that AND (and probably OR) works in setups. You are right, a modular approach would be better. I named it one-to-one solution, which is a confusing name. But I am not sure, whether we can concatenate attributes in ConTeXt like they do in the attributes.

or is already used in pandoc-xhtml.tex.

I am not an expert for TEI so I can not judge their usage. But I know that they have problems to provide ebooks. I recently converted some texts to EPUB via Pandoc. After problems with the XHTML version I used the plain text version, because it was the easiest option.

pandoc has no specific reader for TEI (only a writer is planned).

To generate ePub document from TEI sources, I’d rather use the tools from TEI, not pandoc.

from tei-style-dta-context.

Related Issues (1)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.