Giter Site home page Giter Site logo

kfcd / yyzd Goto Github PK

View Code? Open in Web Editor NEW
38.0 2.0 7.0 5.64 MB

開放粵語字典 - 現代粵語字音數據庫

Home Page: http://kaifangcidian.com/han/yue

License: Other

cantonese cantonese-language cantonese-dictionary chinese chinese-characters chinese-language chinese-dictionary dictionary-data

yyzd's Introduction

開放粵語字典

說明

本項目提供開放詞典網粵語字典的完整數據,並以繁簡兩種字體、耶魯、粵拼等11種粵語拼音方案、以及TSV、CSV、MD等多種文檔格式發佈,以便於人與機器皆能讀取、利用數據創造出衍生作品。

特色

  • 按照描寫語言學的原則編寫的粵語字典
  • 以粵語母語者實際講的粵音為標準
  • 著重於現代粵語而非古音(即現代語言為主,古代語言為副)
  • 主要收錄書面語的字音(口語詞可參考開放詞典的粵語詞典資料庫)
  • 完全開源/開放授權的語料數據

數據格式

繁體 簡體 拼音 詞例 定義 又作
daan5 撢子,撢土,撢衣服,雞毛撢子 拂塵用具,以雞毛或不條作成;用撢子掃去灰塵
jeun3 浚河,浚渠,浚井,疏浚 疏通,挖深

支援拼音方案

原數據採用耶魯拼音標音,此外一共11種拼音版本(如粵拼)分別在dist/tsv等子目錄裡可找到。

  • 耶魯
    • (數字)如:jeun3、yeun3、seung3、cha4、chaam4、chaang4
    • (調符)如:jeun, yeun, seung, chàh, chàahm, chàahng
  • 粵拼
    • 如:zeon3, jeon3, soeng3, caa4, caam4, caang4
  • 教院
    • 如:dzoen3, joen3, soeng3, tsaa4, tsaam4, tsaang4
  • 黃錫凌
    • (數字)如:dzœn³, jœn³, sœŋ³, tsa⁴, tsam⁴, tsaŋ⁴
    • (調符)如:¯dzœn, ¯jœn, ¯sœŋ, ˌtsa, ˌtsam, ˌtsaŋ
  • 劉錫祥
    • 如:jun³, yun³, seung³, cha⁴, chaam⁴, chaang⁴
  • 國際音標
    • 如:tsɵn˧, jɵn˧, sœːŋ˧, tsʰaː˨˩, tsʰaːm˨˩, tsʰaːŋ˨˩
  • 廣州拼音
    • 如:zên3, yên3, sêng3, ca4, cam4, cang4
  • 粵語拼音字
    • (數字)如:jont3, yont3, seong3, ca4, cam4, cang4
    • (調符)如:jônt, yônt, seông, ca, cam, cang

實現示例

  • 國粵消歧義字譜
  • 國粵字音對照表

另見

版權

© 2009-2020 開放詞典

本倉庫所含數據皆依照共享創意(創用CC/知識共享)姓名標示(署名)協議發佈。

創用 CC 授權條款
本著作係採用創用 CC 姓名標示 3.0 未本地化 授權條款授權。

yyzd's People

Contributors

dohliam avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

yyzd's Issues

Please Make Jyutping Displayed as Default.

Hi admin(s) of 開放詞典, please make Jyutping as the default displayed spelling system of the website.

With the growing use of Cantonese Jyutping, most online resources have standardized around Jyutping rather than others, and now even the Cantonese input methods offered in Apple QuickType and Google Gboard, which also use Jyutping as the primary spelling for Cantonese. It is clear that Jyutping has become the dominant standard for Cantonese spelling system.

Some cases show that people did question the fact that the Yale scheme currently used in 開放詞典 is not the standard Jyutping spelling system they are familiar with, which led to inconsistencies in their use of both 開放詞典 and the input method and other online resources.

Therefore, please consider making Jyutping as the primary default displayed spelling systems for the website to meet the needs of most users.

Lastly, I would like to say thanks to 開放詞典's site owner and contributors for their efforts on the Cantonese language. It provides a great source of data for those who want to learn Cantonese and for those who want to build applications related to Cantonese learning.

Now, thanks in advance to those who will help with this issue!

Please Consider Converting All Yale Spelling in Original Data to the Jyutping Standard.

I just realized that there is a problem that the romanization spelling used in the Cantonese section of the 開放辭典 is the Yale. Since the data is originally in the Yale scheme, some of the pronunciation elements (e.g., eo, eu, ep, and oe) that exist in Jyutping cannot be converted correctly (perfect correspondingly), and as a result, some words and phrases may be distorted after being converted.

Share some references from other research:

  1. 耶魯拼音中,/a/ 喺開音節用 a 表示,閉音節用 aa。/ɐ/ 喺閉音節中用 a 表示,無法表示開音節 /ɐ/。
  2. 耶魯拼音用 j ch y 嚟代表 /ts/、/tsʰ/、/j/ 三個音。因此元音 /yː/ 需要用 yu 嚟表示。
  3. 耶魯方案唔區分 /ɵ/ 同埋 /œ/,兩者都用 eu 嚟表示。亦都因爲噉耶魯拼音無法表示 /ɛːu/ 呢個音。
  4. 耶魯用附加符號 ˉˊ 同埋字母 h 嚟表示聲調(新版本加入咗數字標調)。
  睡 /sɵi˨/ 掉 /tɛːu˨/ 𠰲 /œt˨/
耶魯 seuih 無法表示 無法表示
粵拼 seoi6 deu6 oet6

To avoid the continuation of this issue, it is recommended that the 開放詞典 convert all Yale in original data to standard Jyutping, and from then on use Jyutping for new words and phrases, so as to avoid the distortion of conversion between spelling systems by adding new words and phrases with continuing use of Yale.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.