Giter Site home page Giter Site logo

eric-waldman / ieml-language Goto Github PK

View Code? Open in Web Editor NEW

This project forked from iemldev/ieml-language

0.0 0.0 0.0 17.88 MB

The IEML language database. A git database containing the translations for IEML expressions: USL (Uniform Semantic Locator)

Home Page: https://iemldev.github.io/ieml-language

ieml-language's Introduction

ieml-language

Overview

This is the repository for the IEML (information economy metalanguage) language database.

In this repository, you will find structures files, that defines the semantic relationship between the basic elements of the language - the morphemes, and syntagmatic relationship between the morphemes to form words :

  • structure/dictionary : the structure of IEML basic semantics units, the morphemes. It express each morpheme in a set of basic opposition with others morphemes (good/bad, start/ongoing/end, tall/small etc...). Each opposition system is called a paradigm, and the positions that the morphemes have into theirs paradigms defines theirs meaning. The structure made by all the paradigms is called the IEML dictionary. This file is a space-separated csv file with the following columns : ['root', 'paradigms', 'inhibitions']. (root = the main paradigms, paradigms = sub-paradigm in the root, inhibitions = semantics relations to ignore)
  • structure/lexicons/{domain} : These are the paradigms that structure a specific domain lexicon. The lexicons paradigms are build from the morphemes defined in the structure/dictionary file. These files defines paradigms that have been made using intlekt. These files are also space separated csv with the following columns : ['paradigm', 'domain'].

The descriptors files are contained into the descriptors folder, these files contains the translations and definition of the ieml items in natural languages. In the folder, there is a file per syntatic level:

  • morpheme : contains the translations and comments of morphemes and theirs paradigms. This file contains the translations for all the dictionary and definition for the main paradigms.
  • trait : contains the translations and comments of traits and theirs paradigms. The trait are sequence of morpheme and trait paradigms are list of sequence of morphemes.
  • character : contains the translations and comments of characters and theirs paradigms. The character is a hierarchy of trait, the character is a hierarchy of trait where at least one of the trait is a paradigm.

All descriptor files are space separated csv and have the same columns interface ['ieml', 'language', 'descriptor', 'values'] :

  • ieml : the ieml string of the object
  • language : 2 letters language code, for the moment, only 'fr' and 'en' exists.
  • descriptor : a value taken from ['translations', 'comments']
  • values : a list of strings, for comments 'descriptor', the string can be in the Markdown format.

The IEML database aims to provide a memory of past interpretations to statistically ground the language. Asymptotically, this database could be also used to automate the translation of IEML traits, characters and words toward natural languages.

Versions

  • 0.1 : add dictionary and lexicon folder, add README and version file, add docs folder with database visualisation
  • 0.2 : refractor the database structure, remove yaml usage and use csv instead, separate the descriptors from the structure
  • 0.3 : refractor the database structure, separate the lexicons into domains but keep an unique descriptors file for the lexicons

ieml-language's People

Contributors

plevyieml avatar ogrergo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.