markusleupold / bibtex-xml Goto Github PK

0.0 1.0 0.0 279 KB

Exam project in the subject "Dokumentbeschreibungssprachen" of my study

Haskell 61.56% Makefile 0.81% XSLT 37.63%

bibtex-xml's Introduction

BibTeX-XML

This project started as an exam project of my studies in the subject "Dokumentbeschreibungssprachen" (en. markup languages). The task was to develop an XML language for describing BibTeX databases, convert a given large database into that language and display it via a webbrowser using CSS and/or XSL Transformations.

Now, after the project has been submitted, some parts of it are serving me for my own software development training. The original database from the exam task has been left in the repository for being used as test data, but the original repository structure may have changed when you're reading this because there are no requirements to build a submission folder anymore.

Because this is only for my own training, this repository will implement things others may have done before (e.g. there is a bibtex package available for Haskell which is most likely much futher developed than my version).

bibtex-xml's People

Contributors

Watchers

bibtex-xml's Issues

Add XSD insertion feature

Scope: bibtoxml

Currently, bibtoxml only outputs BibTeX-XML documents without any DTD or XML-Schema. Command line options should be
provided to add a customizable XSD to the result document.

.gitignores shadow the second directory level

In a few gitignores, negated paths are only specified one level deep by using one asterisk. This only unignores the first level. To ignore all levels, we have to use a double asterisk !the/path/**

Check if @string elements in BibTeX databases can really contain multiple string definitions

Simplify the value parser

Scope: File bibtoxml/BibTeX/Parser.hs

Currently, the values are parsed using elements of type ValueParser, which are tuples of

a predicate function which determines if the parser can be applied to an input stream
the actual parser function which returns the raw parsed value as a String
a constructor function to turn the raw String value into something of type Value

We can simplify this by specifying a single parser which combines the functions of elements one and two from above. This parser has type String -> Maybe (String, String). It will evaluate to Just the raw value and the remaining input stream if it can be applied to the input stream and otherwise to Nothing. Lazy evaluation will stop each parser as soon as it has been determined, that this parser ist not applicable.

Typo in bibtoxml/src/BibTeX/Parser.hs (chaper instead of chapter)

In the field name parsing functions there is a typo in the "chapter" field name: It is written as "chaper"

Implement TeX-like token parsing in values to interpret TeX control sequences

BibTeX's natural environment is TeX, and because of that, the values of a BibTeX database are very likely to contain TeX control sequences. Simple examples would be:

\"o for the o umlaut
\bf for bold font text
,, for german opening quote marks (U+201E)

It's clear, that these control sequences or active characters should be expanded when the user sees the end result. The question is, when the expansion should actually be done. There are three possibilities:

During parsing of the database to the internal data structure of the BibTeX library. This means, that the BibTeX library must implement formatting information inside the values.
During output of the database as XML. Then the XML document type (DTT or XSD) must define a format to describe formatting information.
During XSL Transformation of the BibXML file to the HTML website. The XSLT then has to do some string parsing.

Add DTD insertion feature

Scope: bibtoxml

Currently, bibtoxml only outputs BibTeX-XML documents without any DTD or XML-Schema. Command line options should be
provided to add a customizable DTD to the result document.

Typo in bibtoxml/src/BibTeX: masterthesis vs. mastersthesis

The mastersthesis entry type contains a typo: The 's' in the middle is missing in the BibTeX library. Took a bit of time to find this out :D

data Value should be a Monoid

Scope: bibtoxml/src/BibTeX/Types.hs

In BibTeX, values can be concatenated to form a new value. BibTeX.Types supports this feature, but the implementation can be more precise: If you look closely, you will see that the set of all possible values forms a monoid together with the contatenation operation defined on it. Here's the proof:

Value is the set of all possible values. Each element v of Value has the data type Value and is constructed using one of the three constructors LiteralValue, ReferencedValue, and ComposedValue.

Also, we define the expansion operation e of a value:

e :: Value -> String

where e v is the expanded string representation of v (i.e. the meaning of v as ordinary text, with variable references replaced by their definition recursively)

Third, we define the concatenation operation <+> like following:

<+>: V x V --> V

where r = v1 <+> v2 is defined such that e r == e v1 ++ e v2.

The Value type in BibTeX.Types is not the actual value of a field. Semantically, a field's value is equal to the expansion of its Value element. The internal representation of a field's value therefore has to simulate the semantics of its expansion. This means, that concatenating Value elements is semantically equivalent to concatenating their expansions.
Because of that, Values inherit all characteristics from Strings which are based on the String concatenation (++). Strings form a Monoid with their concatenation, and therefore also Values do, q.e.d.

It would be a good idea to adapt the Value type according to Monoid laws and create a corresponding instance of Monoid. This will make the properties of Value easier to see and understand and therefore improve code quality.

markusleupold / bibtex-xml Goto Github PK

bibtex-xml's Introduction

BibTeX-XML

bibtex-xml's People

Contributors

Watchers

bibtex-xml's Issues

Add XSD insertion feature

.gitignores shadow the second directory level

Check if @string elements in BibTeX databases can really contain multiple string definitions

Simplify the value parser

Typo in bibtoxml/src/BibTeX/Parser.hs (chaper instead of chapter)

Implement TeX-like token parsing in values to interpret TeX control sequences

Add DTD insertion feature

Typo in bibtoxml/src/BibTeX: masterthesis vs. mastersthesis

data Value should be a Monoid

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent