Giter Site home page Giter Site logo

dramaanalysis's People

Contributors

gitter-badger avatar nilsreiter avatar pagelj avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

dramaanalysis's Issues

Read directly from XMI files

It should be possible to load data directly from a directory containing XMI files. Options:

  • Integrate a Glassfish server that runs locally, call it via URL
  • Call java code directly from R

The latter is probably easier to maintain and distribute, but needs a lot more new code.

Reporting is broken

The file Report.Rmd can not be found, because it is not installed in the correct directory.

Handle multiple document collections

It should be possible to work with texts from multiple collections side by side

  • Add corpus column
  • Switch figure to cast figure
  • Map CastFigure to Figure

Identify graph core

Identifying the independent set based on all drama characters would be cool.

Idea from Solomon Marcus:

Für die Szenenführung ist der Begriff des Kerns von besonderer Bedeutung. Die szenische Anlage eines Theaterstücks impliziert die Wahl eines Kerns von Charakteren, d.h. des Kerns eines Graphen, auf den sich die gesamte Aufführung stützt. Meistens läßt ein (selbst endlicher) Graph mehrere Kerne zu; es kann aber auch vorkommen, daß er überhaupt keinen Kern zuläßt.
(Marcus, 1973)

warning message for n750.0 and rhzq.0

Warning messages:
1: In scan(file = file, what = what, sep = sep, quote = quote, dec = dec,  :
  invalid input found on input connection 'http://zwergdrossel.ims.uni-stuttgart.de:8080/drama.web/annotations/n750.0/de.unistuttgart.ims.drama.api.Utterance/de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Token'
2: In scan(file = file, what = what, sep = sep, quote = quote, dec = dec,  :
  invalid input found on input connection 'http://zwergdrossel.ims.uni-stuttgart.de:8080/drama.web/annotations/rhzq.0/de.unistuttgart.ims.drama.api.Utterance/de.tudarmstadt.ukp.dkpro.core.api.segmentation.type.Token'

Personnel exchange

Integrate code to calculate

  • Drama change rate
  • Scenic difference
  • Hamming distance

into the package.

Improve meta data export

multiple authors or dates

  • Dates are now a feature of the Drama FS and thus accessible directly
  • Author or Translator annotations can be extracted using getAnnotations() (verification needed)

Export as CSV is broken in some cases

When utterances don't have tokens in them, we get a line with less columns than expected. They should be either removed entirely or filled with null-values.

Drama length

When quadrama/DramaNLP#10 is done, we can now longer count the tokens in the table to get the length of the drama, as some tokens might be present twice (if uttered by multiple figures)

Meta data loading

There is currently no way to load meta data (e.g., publication date or title).

Handling of sets

Sets are not corpora (like TextGrid), but sub-corpora, like "Bürgerliches Trauerspiel".

Sets are defined in plain text files that list ids. Currently, they are stored in /metadata/collections.

  • Installing data locally

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.