When do I get to hear the TTS engine based on my voice? I'm sure there's enough data out there to get a full sampling. =P
Once this becomes complete I will also tackle the ma'oste. Once that is complete, you can bet I'll do the experimentals and some of the broader vocabulary.
When do I get to hear the TTS engine based on my voice? I'm sure there's enough data out there to get a full sampling. =P
--
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To view this discussion on the web visit https://groups.google.com/d/msg/lojban-beginners/-/rZY-4YlHgRoJ.
To post to this group, send email to lojban-b...@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginne...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.
On 6 September 2012 15:37, Lindar <lindar...@gmail.com> wrote:Once this becomes complete I will also tackle the ma'oste. Once that is complete, you can bet I'll do the experimentals and some of the broader vocabulary.
When do I get to hear the TTS engine based on my voice? I'm sure there's enough data out there to get a full sampling. =PMakes me want to record 'em all too ;)
.i mi'e la tsani mu'o
Lojban has six recognized vowels: “a”, “e”, “i”, “o”, “u” and “y”. The first five are roughly pronounced as “a” as in “father”, “e” as in “let”, “i” as in “machine”, “o” as in “dome” and “u” as in “flute”. “y” is pronounced as the sound called “schwa”, that is, as the unstressed “a” as in “about” or “around”.
Twelve consonants in Lojban are pronounced more or less as their counterparts are in English: “b”, “d”, “f”, “k”, “l”, “m”, “n”, “p”, “r”, “t”, “v” and “z”. The letter “c”, on the other hand is pronounced as the “sh” in “hush”, while “j” is its voiced counterpart, the sound of the “s” in “pleasure”. “g” is always pronounced as it is in “gift”, never as in “giant”. “s” is as in “sell”, never as in “rose”. The sound of “x” is not found in English in normal words. It is found as “ch” in Scottish “loch”, as “j” in Spanish “junta”, and as “ch” in German “Bach”; it also appears in the English interjection “yecchh!”. It gets easier to say as you practice it. The letter “r” can be trilled, but doesn’t have to be.
The Lojban diphthongs “ai”, “ei”, “oi”, and “au” are pronounced much as in the English words “sigh”, “say”, “boy”, and “how”. Other Lojban diphthongs begin with an “i” pronounced like English “y” (for example, “io” is pronounced “yo”) or else with a “u” pronounced like English “w” (for example, “ua” is pronounced “wa”).
....
All Lojban words are pronounced as they are spelled: there are no silent letters.
--
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To view this discussion on the web visit https://groups.google.com/d/msg/lojban-beginners/-/t5gGxW9sDyMJ.
To post to this group, send email to lojban-b...@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginne...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.
You must possess, on the same computer, the LAME MP3 encoder. You can download it at this site: http://lame.sourceforge.net/, which provides the source code, or you can go here: http://lame.sourceforge.net/links.php#Binaries to download a binary, or for Windows users you can download the .dll directly from the instructions page that I will make if I get any volunteers.
Thank you to Pierre, livefree, and aionys for helping out. Let me know if you'd like a music vignette, written thank you, or silly picture in Hollywood, in Lojban.
Craaaaaaaaaap... apparently I skipped a big bit? I was reading from the list of words in the gismu deck on Memrise, so maybe that deck isn't complete? Ugh... Oh well... Did you get both files when you started the whole thing? I have them both saved to my Google Drive, so please make sure you've got both of them. If that's still not the whole list then please try to get me something by Wednesday when I'll have a day off and the apartment to myself.For those of you that are helping, please let me know what gift you want and fill in the request portion if applicable.
Chunks 12, 14, 17, and 19 are officially finished. Thanks go to Daniel Lyons for completing these chunks.
Each job will be essentially the same, with the only difference between them being the actual words in your chunk.
With that in mind, these are the skills and software that are required to do this job:
You must be able to recognize a gismu from it's pronunciation. This means that you must be able to identify and type the word solely from the audio. If you know Lojban's pronunciation, this is not difficult. This is required because while Lindar recorded the audio /mostly/ in frequency order, some were skipped, and some needed to be re-recorded. If you need a list of the gismu in frequency order, you can find it here: http://www.lojban.org/tiki/Word+frequency+lists%3A+gismu
It is possible that there are more of both unrecorded and badly recorded words (badly recorded being words that are unidentifiable for any reason). If you encounter either in the course of your job, you must be able to contact Lindar, likely preferably via email, to inform him of these words so that he can submit new recordings. In the case of a word or three, preferably to yourself. In the case of a chunk's worth of words, preferably to me so I can add the recording to the chunk list for claimage.
You must possess, on the computer you will be using to work on, Audacity, available for free here: http://audacity.sourceforge.net/
You must possess, on the same computer, the LAME MP3 encoder. You can download it at this site: http://lame.sourceforge.net/, which provides the source code, or you can go here: http://lame.sourceforge.net/links.php#Binaries to download a binary, or for Windows users you can download the .dll directly from the sign-up page.
You must know how to collect a group of files into an archive. It does not matter if you use .zip, .rar, .7z, .tar, .tar.gz, .tar.bz, or any other format not mentioned.
You must know how to attach a file to an email.
You must be able to follow written instructions, which you can find here: http://tinyurl.com/gismu-instructions
Finally, you must be willing to sacrifice some of your free time to do the actual job. These last two are in my opinion no-brainers.
If you meet these criteria, please go to http://tinyurl.com/gacsp-signup, claim one or more chunks and type your name into the sign up field so everyone else knows you've claimed that chunk, and follow the download link to download your chunk in mp3 format.
When you have completed your chunk, place all the files into an archive and email them to me. Then, if you like, grab another chunk. If you don't like, go do something else.
I don't know how good of an incentive this is, but you should know that this audio, once it's been pulled apart into a usable format (the goal of this project), it will be the audio for the various learning programs currently in existence and probably for future ones as well. This means the current lessons on Anki and Memrise specifically, and more generally any other places where someone feels inclined to upload audio for how to pronounce the gismu.
So I claimed chunk 25, but I've got problems converting the gismu audio files to mp3. I installed audacity and the lame.dll plug in, but it won't work.
So either I fix the problem somehow or maybe someone else would like to take over chunk 25?
.uu go'i .i mu'omi'e jongausib
Den torsdagen den 6:e september 2012 kl. 11:30:04 UTC+2 skrev aionys:
Each job will be essentially the same, with the only difference between them being the actual words in your chunk
With that in mind, these are the skills and software that are required to do this job:
You must be able to recognize a gismu from it's pronunciation. This means that you must be able to identify and type the word solely from the audio. If you know Lojban's pronunciation, this is not difficult. This is required because while Lindar recorded the audio /mostly/ in frequency order, some were skipped, and some needed to be re-recorded. If you need a list of the gismu in frequency order, you can find it here: http://www.lojban.org/tiki/Word+frequency+lists%3A+gismu
It is possible that there are more of both unrecorded and badly recorded words (badly recorded being words that are unidentifiable for any reason). If you encounter either in the course of your job, you must be able to contact Lindar, likely preferably via email, to inform him of these words so that he can submit new recordings. In the case of a word or three, preferably to yourself. In the case of a chunk's worth of words, preferably to me so I can add the recording to the chunk list for claimage.
You must possess, on the computer you will be using to work on, Audacity, available for free here: http://audacity.sourceforge.net/
You must possess, on the same computer, the LAME MP3 encoder. You can download it at this site: http://lame.sourceforge.net/, which provides the source code, or you can go here: http://lame.sourceforge.net/links.php#Binaries to download a binary, or for Windows users you can download the .dll directly from the instructions page that I will make if I get any volunteers.
You must know how to collect a group of files into an archive. It does not matter if you use .zip, .rar, .7z, .tar, .tar.gz, .tar.bz, or any other format not mentioned.
You must know how to attach a file to an email.
You must be able to follow written instructions, which you can find here: http://tinyurl.com/gismu-instructions
Finally, you must be willing to sacrifice some of your free time to do the actual job. These last two are in my opinion no-brainers.
If you meet these criteria, please go to http://tinyurl.com/gacsp-signup, claim one or more chunks and type your name into the sign up field so everyone else knows you've claimed that chunk, and follow the download link to download your chunk in mp3 format.
There are 11 chunks total. Each chunk is ~6 minutes of audio, which based on my rather basic math should be ~100 gismu in each chunk.
When you have completed your chunk, place all the files into an archive and email them to me. Then, if you like, grab another chunk. If you don't like, go do something else.
I don't know how good of an incentive this is, but you should know that this audio, once it's been pulled apart into a usable format (the goal of this project), it will be the audio for the various learning programs currently in existence and probably for future ones as well. This means the current lessons on Anki and Memrise specifically, and more generally any other places where someone feels inclined to upload audio for how to pronounce the gismu.
--
mu'o mi'e .aionys.
.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )
--To view this discussion on the web visit https://groups.google.com/d/msg/lojban-beginners/-/ibMrNcM0TcoJ.
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To post to this group, send email to lojban-b...@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginne...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.
--
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To view this discussion on the web visit https://groups.google.com/d/msg/lojban-beginners/-/spw8He7F-9wJ.
To post to this group, send email to lojban-b...@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginne...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.