Giter Site home page Giter Site logo

tatoeba / tatoeba2 Goto Github PK

View Code? Open in Web Editor NEW
671.0 671.0 131.0 187.92 MB

Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.

Home Page: https://tatoeba.org

License: GNU Affero General Public License v3.0

PHP 81.61% HTML 8.19% CSS 4.25% JavaScript 5.40% Shell 0.37% Batchfile 0.03% Python 0.15%
languages linguistics

tatoeba2's People

Contributors

agrodet avatar alanfgh avatar alexandremagueresse avatar allan-simon avatar andipersti avatar biptaste avatar dj-saidez avatar fadhelafif avatar gioele-santi avatar githubshrek avatar halfdan avatar heliostrike avatar jiru avatar kwikadi avatar lbeaudoux avatar linkmauve avatar loolmeh avatar pallavshah avatar phtan avatar ppjet6 avatar ryckrichards avatar shashank19gaurav avatar theblindspring avatar tommy-3 avatar trang avatar ventice avatar vinkaks avatar wallflower1 avatar yorwba avatar zachleigh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tatoeba2's Issues

Selected language for search or random sentence is not retained in a new window

The languages selected for the "From" and "To" drop-down lists for a phrase search are not retained when a new window is opened. Instead, the selection always flips back to "Any". However, switching the interface language does preserve the selected languages in the same window. In addition, when opening a new window, the selected language for adding new sentences is retained, and the interface language is also retained.Reported by Griza Leono and marafon on the Wall in January 2014, after the December 2013 crash.Additional note: pne points out that the same is true for random sentence::::Example: http://tatoeba.org/eng/sentences/show is supposed to show an English sentence, but it shows any sentence.Choose a language from the drop-down at the top (e.g. Czech) and click "Random", and you get a Czech sentence... but the drop-down has turned back into "Any language".

Remove "__()" wrapper from two empty strings

There were two instances of __() applied where they shouldn't be. Here are the changes I've made and am about to check in:app/views/helpers/comments.php:292:--- app/views/helpers/comments.php (revision 2080)+++ app/views/helpers/comments.php (working copy)@@ -289,10 +289,7 @@$this->Html->image(IMG_PATH . 'edit.png',array(- "title" => __(- $tooltip,- true- ),+ "title" => $tooltip,"width" => 24,"height" => 24)app/views/pages/help.ctp:248--- app/views/pages/help.ctp (revision 2080)+++ app/views/pages/help.ctp (working copy)@@ -245,7 +245,7 @@),$html->image(IMG_PATH . 'translate.png'));- __('');+ '';?>

Provide function to search for near-/duplicates of sentence to be submitted

Provide a function that lets you search for existing duplicates or near-duplicates of a sentence that you're about to submit, ideally without taking you to another screen and disturbing your workflow. There are various ways that this function could be executed:

  1. automatically upon submission of any sentence
  2. automatically upon submission of any sentence unless you are an advanced contributor or higher
  3. automatically upon submission of any sentence unless you opt out in your profile
  4. upon pressing a button that is visible from the screen from which you submit a sentence

AlanF_US, Wall
anders, Wall


Original ticket: https://www.assembla.com/spaces/tatoeba2/tickets/396

Going to "Wall" on VM gives error

Going to "Wall" on the virtual machine gives this error (possibly because the Wall is empty):Warning (512): SQL Error: 1054: Unknown column 'hidden' in 'where clause' [CORE/cake/libs/model/datasources/dbo_source.php, line 527]Query: SELECT Wall.id, Wall.owner, Wall.parent_id, Wall.date, Wall.title, Wall.content, Wall.lft, Wall.rght, User.username, User.id FROM wall AS Wall LEFT JOIN users AS User ON (Wall.owner = User.id) WHERE hidden = 0 ORDER BY Wall.date DESC LIMIT 10

Error for "Contribute/Discuss sentences" on VM

When I go to Contribute/Discuss sentences on my virtual machine, I get this (possibly because the results returned are empty):Warning (512): SQL Error: 1054: Unknown column 'SentenceComment.hidden' in 'field list' [CORE/cake/libs/model/datasources/dbo_source.php, line 527]Query: SELECT SentenceComment.id, SentenceComment.user_id, SentenceComment.text, SentenceComment.created, SentenceComment.modified, SentenceComment.sentence_id, SentenceComment.hidden, Sentence.id, Sentence.text, Sentence.lang, Sentence.user_id, User.id, User.username, User.image FROM sentence_comments AS SentenceComment LEFT JOIN sentences AS Sentence ON (SentenceComment.sentence_id = Sentence.id) LEFT JOIN users AS User ON (SentenceComment.user_id = User.id) WHERE 1 = 1 ORDER BY created DESC LIMIT 50Warning (2): Invalid argument supplied for foreach() [APP/controllers/components/permissions.php, line 144]

delete a tag once no more sentences are using it

al_ex_an_der posted on the Wall:From time to time tags are created by mistake. It would be helpful if tags would be deleted automatically (without a necessity of human interference) as soon as there's no longer any sentence connected with them.

Remove links from "Contribute" page and replace them with links to the wiki

Exchange between Trang and me:

I wrote:

(Phrase 4) Role of moderators
app/views/pages/contribute.ctp:66
We should probably change this to something like:
Role of advanced contributors-> Role of corpus maintainers

Trang responded:

The status of "Corpus maintainer" used to be called "Moderator" but we changed the label because people were expecting the moderators to moderate the community, while we were only expecting them to moderate the corpus (cf. http://blog.tatoeba.org/2011/05/new-users-status.html).
Anyway I think we should remove these links from the "Contribute" page (http://tatoeba.org/eng/contribute, in "Important to read") and replace them with links to the wiki instead.

Arabic (Syrian) (= North Levantine Arabic?)

quote of email from dpsilver7

Ar / ara (Syrian Accent) (it's not classic Arabic, no one speaks Classic Arabic anymore!! :D, it's vernacular, all Arabs and most foreigners I know (that were learning Arabic) wanted to learn Syrian Accent.

Make brackets part of the clickable link for comments

CK writes:

The suggestion:

Make the brackets [ ] part of the clickable part of the link for the "shorthand" used on comments.

In other words, instead of showing this on the page.

3030038 誰か鉛筆持ってない?...

show this.

[#3030038] 誰か鉛筆持ってない?

The reasons:

  1. Even new members can easily see how such links are made. There is no need to read the documentation (assuming you can find it), to learn how to do it.
  2. It makes copying and pasting without losing the link possible.
    For example, this would help, when someone copies part of a comment.Here's something tommy_san recently did, which highlights why this small improvement in the code would be useful.
    COPIED FROM: http://tatoeba.org/eng/sentences/show/1510023#comment-328491
    COPIED TO: (notice the links get lost) http://tatoeba.org/eng/sentences/show/3030014#comment-328502

Request for Gujarati/English IPA converter

Initially requested asked by "ken p" via e-mail to Team Tatoeba. He included the following example:
what is your name?...............English
तुम्हारा नाम क्या है?......................Translation into Hindi
Tumhārā nāma kyā hai?........Transliteration of Hindi
તમારું નામ શું છે?...........................Translation into Gujarati
Tamāruṁ nāma śuṁ chē?....Transliteration of Gujarati

JSONRPC API Component

This is the ticket for the API component as it is in CakePHPA few things:1) The methods throw their own exceptions, containing at least a code and a message, which then the component catches. These errors are mainly to indicated to the user which parameters the method accepts. So, for example, if the client provides the wrong arguments for a method, the client will get an error message saying "method_name only accepts the follow arguments: arg1, arg2, arg3".2) Each method manages its own version3) Still need to create routing rule for API and figure out permissions schemeDetailed information can be found at the wiki:https://github.com/trang/tatoeba-api/wiki/Tatoeba-API-specification-2

Correct remaining issues with English interface strings

See attached file. Two examples:Adopting a sentence is also part the "quality process". You can find more information about it here: app/views/pages/help.ctp:315Sentences lists app/views/pages/help.ctp:328"part the" -> "part of the""Sentences lists" -> "Sentence lists"

Allow option to retrieve page with different number of sentences than the default

gleki, Wall (though gleki posed this as a question rather than a request)

It would be nice if you could retrieve and display a page containing more than the default number of sentences (currently 10) for a given request . Possible ways to specify the number of sentences:
(1) Build the number into the request URL as a parameter.
(2) Allow the user to choose from a set of possible values via the GUI.
(3) Set the default value from your profile.Since setting the value too high might cause problems with our traffic load, we might want to think about setting a maximum value, and if so, what it should be.

UI strings missing from Launchpad

tommy_san reported (in private correspondence) the following issues with the set of UI strings to be translated in Launchpad:

Here's the list of missing strings I found:

(1) On every page at the bottom: Blog, Twitter, Facebook

(2) Names of some new languages:Algerian, Arabic, Ancient Greek, Hiligaynon, Picard, Standard Tibetan

(3) http://tatoeba.org/eng/sentences_lists/index create (the text inside a button)

(4) In the title of pages, as in http://tatoeba.org/eng/contributions/latest Tatoeba (This must be the same in the languages that use Latin alphabet, but not always so in other languages.)

(5) The whole "Improve sentences" page (and I believe this page needs to be updated) http://tatoeba.org/eng/activities/improve_sentences

(6) In the menu More, Quick Start Guide, Tatoeba Wiki

And here are some other things that I noticed:

[a] http://tatoeba.org/eng/contribute The link to "Role of moderators" must be deleted or changed, since there's no "moderator" anymore.

[b] http://tatoeba.org/eng/help I think the sentences "Note that translations are limited to registered users. Click here to register." should be deleted. Almost all other things written on the page are what only registered users can do anyway.

[c] http://tatoeba.org/eng/terms_of_use In the terms of use, there are some places where the French and English versions don't really seem to match. For example, a whole sentence beginning with "En d'autres mots, vous ne pouvez importer du texte que s'il est" is missing in the English translation. I'm not sure if it's great problem, but when we think of the fact that all other translations are translated from the English, it might be a good idea to make a more strict translation of the French, which is the only valid one.You might be able to see the difference clearly if you take a look at the French version of this page: http://tatoeba.org/fre/terms_of_use where the English translation is partly re-translated into French so you can compare them both. (This meaningless translation on this page must be deleted at any rate.)

Warn user and/or admin when sentence's language flag is changed

JimBreen, Wall
Jim wrote that he recently came across two cases where sentences were overwritten by versions in different languages. He thought it occurred because the person/persons who did it didn't know the processes that well. I asked him to clarify whether he meant that a warning should be displayed to the user or the admin, or both. He said that he meant both. In further discussion, he and tommy_san said that whereas changing a sentence's flag was likely to be legitimate when the sentence was new, it was less likely to be legitimate later, so perhaps a warning should only be issued if the sentence was old.


Original ticket: https://www.assembla.com/spaces/tatoeba2/tickets/413

Errors in phonetic transcriptions

From e-mail from "moustache sans barbe" to Team Tatoeba:

Toutes les transcriptions phonétiques des exemples (il y en a plusieurs) de phrases comprenant "...に入っていた" est retranscrite par erreur en "...には/いっていた" ce qui est un contresens sans rapport.
Je vous laisse corriger cette transcription orale.
Cdlt

In English:

All the phonetic transcriptions of the examples (there are several) of phrases containing "...に入っていた" is [sic] incorrectly retranscribed as "...には/いっていた", which is an unrelated mistranslation.
I'll leave it to you to correct this oral transcription.
Cdlt

Directionality of original lang. for sentence remains after lang. is changed

If a newly created sentence is assigned a language with one directionality, then it will continue to be displayed with the same directionality even after the language is changed. For instance, say that I add an English translation "The sky is blue." to a Hebrew sentence, but I mistakenly mark my translation as Hebrew. I see my mistake immediately because the final period clings to the beginning of the sentence (the left) rather than the end (the right). Now I click on the flag to change it to English. Even though I've changed it, it will still be displayed on my current page with right-to-left directionality (with the period clinging to the beginning rather than the end). If I actually go to the URL of the sentence, it is displayed correctly, however. Thus, this is a transient annoyance rather than a permanent problem. It would still be nice to fix the problem, though.


Original ticket: https://www.assembla.com/spaces/tatoeba2/tickets/407

Help for search engine not visible until you do a search

There is no help for the search engine under the FAQ or help links. You can only find the help after you actually perform a search (e.g., search for "blue" in English). After the search is completed, help for the search engine is displayed in the upper right-hand side of the page. This help should also be incorporated into the FAQ and help links. I'm considering this a bug rather than an enhancement because searching is basic functionality that should be covered in the FAQ and help links.

Eastern Punjabi (pan)

Language name: Punjabi (Eastern)
Language info: https://en.wikipedia.org/wiki/Punjabi_language
ISO 639-3 code: pan
Public List: https://tatoeba.org/eng/sentences_lists/show/3279
Icon: http://goo.gl/QgFcfo
Requested by: sabretou

Previous messages:

TODO

  • Also change string for pnb from "Punjabi" to "Punjabi (Western)".
  • icon for pnb: http://goo.gl/F3rBNK

ead http://tatoeba.org/eng/wall/show_message/17381#message_17381
The changes mentioned in that thread involve splitting the existing Punjabi language sentences into two, each with a different script:"(pan) is Eastern, or Indian Punjabi, predominantly written in Gurmukhi and spoken in India, while (pnb) is Western, or Pakistani punjabi, written in Shahmukhi and spoken in Pakistan."

Once I get two lists from sabretou (one for Eastern, one for Western), I'll add a link to this ticket.

Add language: Sinhala (sin)

./add_lang.sh sin Sinhala 3262

Transcriptions (furigana, etc.) no longer shown for Japanese, Georgian, etc.

On the Wall, orcrist posted the following::::For Japanese and other languages with character sets as opposed to alphabets, when people are learning the languages, we start with a phonetic alphabet. This is often written in small letters just above the actual text. These small letters are called "rubies", and for Japanese specifically, the phonetic reading written adjacent to a word is called "furigana". Before the server trouble at the end of 2013, Tatoeba showed furigana (albeit with some errors).Whether it is worth the admins' time to work on re-enabling furigana support is an open question. If there are other high quality free resources that work well in replacement of it, perhaps it is not necessary. Rikaichan works well as long as you are reading everything from within your web browser.

"Help" link is not visible enough

Recently, Ticket 384 was addressed, which was nice to see. However, I think an additional change needs to be made.
The subject of the ticket was "Place 'Help' and 'FAQ' links at top of page, not at bottom". The problem was that "Help" and "FAQ" links are extremely important but had very low visibility (at the bottom of the page). A change was made to put them at the top of the page. However, now they're underneath a "More" link. They were not obvious at all to me there, and I'm sure they would be less so to someone who is unfamiliar with the site.

There is plenty of room for another item on the top bar. I would suggest that a "Help" link be added between "Wall" and "More". Then under "Help" you could have "FAQ", "Quick Start Guide", "Tatoeba Wiki" and "Help" itself, though perhaps there should be some way to distinguish between top-level "Help" and the actual "Help" link itself. Maybe that lower-level link should be named something like "General Help"?

Meanwhile, I think that "Downloads" and "Tools" should remain under "More".

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.