Comments (4)
What we might need to do is the following:
- decide what to do with multiple entries (which might have multiple 'meanings' themselves)
- whether or not to preprocess the Marble data (like transforming
{L:Dead Sea<SDBH:יָם־הַמֶּלַח>};
intoDead Sea (יָם־הַמֶּלַח)
) - find out what else we want to extract and use from the marble lexicon. There are a lot of interesting things:
- synonyms
- contextual definitions/references
- core/sub domains
- base form/part of speech
- references to other occurences in Scripture
from macula-hebrew.
In the instance data, I think we want:
- semantic domain
- gloss
Perhaps like this:
sdbh="semdom gloss", using the English gloss. If there are multiple entries, concatenate them with a delimiter.
from macula-hebrew.
A lot of the rest of this is at the lexicon level. We need to figure out what we are doing with Greek and Hebrew semantic lexicons. I need to talk to Reinier about what we have rights to.
from macula-hebrew.
There were no issues when adding the semantic domain and glosses from Marble, so I'll close the issue.
The Marble data is structured as follows:
:::<gloss;gloss;gloss>
Multiple glosses and domains within one entry are separated by ";"
Multiple entries are separated by "|"
xml
<m USFMId="GEN 6:16!1" n="010060160011" lang="H" after="׀" lemma="6672 b" morph="Ncfsa" pos="noun" type="common" gender="feminine" number="singular" state="absolute" cherith-english="roof" cherith-chinese="窗户" marble-sense="צֹהַר:001005001:Constructions:covering;roof|צֹהַר:001005001:Constructions:skylight">צֹ֣הַר</m>
from macula-hebrew.
Related Issues (20)
- Add lemmas to Hebrew nodes trees HOT 4
- There are missing `m/@xml:id`s in our current lowfat trees HOT 1
- Marble Domains (`Domain`, `ContextualDomain`, `CoreDomain`) HOT 6
- 5. Repopulate Hebrew lowfat with the latest updates:
- transcription and gloss attributes from SIL are still missing, at least from Genesis 1.
- Problems in `morpheme-mappings.xml` HOT 1
- Word Sense (from macula-greek) HOT 1
- Greek beta-to-unicode in Genesis 1:1 HOT 1
- Incorrect closing </w> tag
- Implicit article stealing attributes from following sibling
- Split node at GEN 50:10!4
- Replace `c` node with merged `m` in PSA 102:4
- After in Gen 1:12 HOT 2
- Incorrect mapping to lowfat HOT 1
- _ki_ missing in Lev 5:21. HOT 2
- Low-fat word parts missing HOT 5
- Lowfat 'c' fields have no glosses HOT 1
- include Ketiv into Macula-Hebrew ? HOT 2
- Misnumbered nodes in 1 Chronicles 20 HOT 1
- Macula Contextual Domains
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from macula-hebrew.