Lojban on Memrise.

Jonathan Jones

unread,

Jun 24, 2012, 1:07:27 PM6/24/12

to lojban-b...@googlegroups.com, loj...@googlegroups.com

Now that I've gotten a fair but through the work, I think it's time to let all of you in on my secret project:

Firstly, I don't know if anyone has already mentioned this, but I recently discovered a site called Memrise.com, which is everything Smart.fm could have been if they hadn't decided to be jerks. In other words, it's pretty cool.

There's already Lojban vocabulary lessons on there, including one for the gismu, but since it uses the full, rather complex definitions, (with all the parenthetical statements and so on,) I decided to create a new one that uses the same definitions the old Smart.fm lesson and the Anki flashcard list does. I've already uploaded the entire list, and now I'm going through the individual entries and adding what I can- apparently there's some default permissions things that Memrise in general needs to correct: I'm still waiting on a response from the guy in charge of that, but I don't expect to hear from him until at least Monday- so in most cases this just means adding a visual cue to those gismu for which it's appropriate, such as a picture of a computer for {skami}.

The content is divided into 14 "pages". I've added pictures to nearly all of the ones on the first page. I've attempted to keep the artistic style consistent, but there's obviously somewhat of a lack there.

In any case, I'm letting you guys know about this for two reasons: so you can use it if you want, and so that maybe some of you might decide to do a little picture adding of your own.

Whenever the permissions get fixed, I'll be adding the audio for all the words as well as a lot of other information that just isn't possible right now, but until then, it's just the words, the simplified definitions, and the pictures.

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

Jonathan Jones

unread,

Jun 24, 2012, 1:08:19 PM6/24/12

to lojban-b...@googlegroups.com, loj...@googlegroups.com

Oh, yes, I completely forgot the link: http://www.memrise.com/set/10034151/simplified-gismu-2/

Yeesh. I'm not old enough to have Alzheimer's.....

gleki

unread,

Jun 25, 2012, 1:37:10 AM6/25/12

to loj...@googlegroups.com, lojban-b...@googlegroups.com

I wish more mnemonic hints were added to all gismu. Other sets have such hints but still not for all gismu.

Jonathan Jones

unread,

Jun 25, 2012, 1:58:33 AM6/25/12

to loj...@googlegroups.com

Well, anyone can add them. There's a vote system in place to make the better ones be at the top.

--
You received this message because you are subscribed to the Google Groups "lojban" group.
To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/M4RTuPeM32cJ.
To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

Jonathan Jones

unread,

Jun 30, 2012, 4:33:15 PM6/30/12

to loj...@googlegroups.com

An update on the Simplified Gismu:

I've gone through nearly the first 2 pages (200 words) and added pictures to every one I could find a decent one for. Obviously some, like {sidbo}, are very difficult if not impossible to get a picture for.

I was planning on working on it some more just now, but the site is apparently down for maintenance, so I'm doing this update instead.

I know this is likely to fall on deaf ears given Robin's experience, but if anyone wants to assist in completing the adding-a-picture-to-every-gismu-we-can project, I'm not the only person who'll be appreciative- everyone who ever learns using them, ever, will be too.

http://www.memrise.com/set/10034151/simplified-gismu-2/

Jorge Llambías

unread,

Jun 30, 2012, 5:01:26 PM6/30/12

to loj...@googlegroups.com

On Sat, Jun 30, 2012 at 5:33 PM, Jonathan Jones <eye...@gmail.com> wrote:
> Obviously some, like {sidbo}, are
> very difficult if not impossible to get a picture for.

Light bulb on top of head?

http://jbo.wikipedia.org/wiki/pixra_liste_loi_gismu doesn't yet have
one for sidbo either.

mu'o mi'e xorxes

Jonathan Jones

unread,

Jun 30, 2012, 5:15:21 PM6/30/12

to loj...@googlegroups.com

I used the lightbulb over head for something else. I don't remember which one off the top of mine, though, and I can't look it up right now.

--
You received this message because you are subscribed to the Google Groups "lojban" group.

To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

Jonathan Jones

unread,

Jul 18, 2012, 1:13:18 AM7/18/12

to loj...@googlegroups.com

So, Ben gave me curator status, meaning I can now add all the missing stuff on the word entries. This means, among other things, that my course will now have audio.

Additionally, for each word, the "Parent" will list the entry for the gismu with the full, unsimplified definition, and the "Dependents" will list the entry for each of that gismu's places (i.e., "lo klama" "lo se klama" etc.)

Jonathan Jones

unread,

Jul 18, 2012, 1:16:49 AM7/18/12

to loj...@googlegroups.com

Unfortunately, many of the audio files we have for the gismu were produced by a TTS engine. Anyone who is willing to contribute themselves or someone they know saying any of these words is welcome to do so and send the resulting mp3 to me.

Jonathan Jones

unread,

Jul 18, 2012, 1:26:33 AM7/18/12

to loj...@googlegroups.com

It's jimpe.

la .lindar.

unread,

Jul 18, 2012, 1:44:05 AM7/18/12

to loj...@googlegroups.com

The voices that you hear now are actually me. =D

I was planning on doing more and I actually have the inclination right now to get a few done a day.

If you want to get this done and out of the way, here's what we can do:

I'll read off of the frequency-ordered gismu list. I'll do three takes for each word.

I'll do it all in one go. I don't want to cut/edit any of it.

I have the equipment to get a really good recording, but I just don't want to bother with the rest.

If somebody else can do the rest of the work, I will provide high-quality recordings of all of the gismu.

I can probably do 20 at a time.

Jonathan Jones

unread,

Jul 18, 2012, 2:24:40 AM7/18/12

to loj...@googlegroups.com

Just send them to me when you're done.

--

You received this message because you are subscribed to the Google Groups "lojban" group.

To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/qyEtXpgkfJwJ.

To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

la .lindar.

unread,

Jul 25, 2012, 3:32:56 AM7/25/12

to loj...@googlegroups.com

I did everything at once.

-_____-

It's 1.5 hours long and I almost vomited once I was done.

Send it where? It's around 75mb.

la .lindar.

unread,

Jul 25, 2012, 4:04:44 AM7/25/12

to loj...@googlegroups.com

https://docs.google.com/open?id=0B2twDfvSFg4nVzdrMEYwV0lLU0E

This is the file.

Please let me know if I missed anything or need to redo anything.

Please keep me up to date as to when it's done being edited.

Should I do all of the cmavo next? >_>

Jonathan Jones

unread,

Jul 25, 2012, 4:51:45 AM7/25/12

to loj...@googlegroups.com

I only have one question.

How certain are you that all the words are there, in order, without error or other sounds in it?

The reason I ask is because Memrise

--

You received this message because you are subscribed to the Google Groups "lojban" group.

To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/PgVvlpuHgQsJ.

To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

Jonathan Jones

unread,

Jul 25, 2012, 4:55:57 AM7/25/12

to loj...@googlegroups.com

Arg. Apparently I hit Send when I meant to hit Save Draft.

Anyway.

Would be able to say you are at least 90% confident that the file meets the requirements as listed on this page?

Basically, they want a single audio file with ~1 second pauses between words, and each word must be in order. There's a few more things than that listed on the page, but those are the absolute necessity.

I of course will be going over the file myself, I just want to know how much work you think I'll need to do to get it to those standards.

la .lindar.

unread,

Jul 25, 2012, 5:18:28 AM7/25/12

to loj...@googlegroups.com

Anyway.

Would be able to say you are at least 90% confident that the file meets the requirements as listed on this page?

No. =D

It's done as an MP3 and that clearly asks for a wav, which is going to be a loooooooooooot bigger.

You might be able to type-convert if you don't care about the quality.

I'll upload a WAV version anyway. Let me know if you want/need it.

Basically, they want a single audio file with ~1 second pauses between words, and each word must be in order. There's a few more things than that listed on the page, but those are the absolute necessity.

It's mostly in order, but I backtracked in some places, so make sure it's correct.

You'll have to do the spacing and crap like that yourself.

Also I said each word *at least* three times, so you're going to have to pick the best take of each.

Absolutely the entire thing will have to be edited to remove the other two to six takes you don't want.

I said each word a couple of times so as to avoid having to rerecord things if I balked a take.

That being said, if there's a word with no good take, let me know so I can redo it.

Jonathan Jones

unread,

Jul 25, 2012, 5:38:25 AM7/25/12

to loj...@googlegroups.com

On Wed, Jul 25, 2012 at 3:18 AM, la .lindar. <lindar...@gmail.com> wrote:

Anyway.

Would be able to say you are at least 90% confident that the file meets the requirements as listed on this page?

No. =D
It's done as an MP3 and that clearly asks for a wav, which is going to be a loooooooooooot bigger.
You might be able to type-convert if you don't care about the quality.

I'll upload a WAV version anyway. Let me know if you want/need it.

Don't bother. If I'm going to have to edit it- especially as much as it looks like I'll have to from your later comments- then I may as well just use the mp3 file I've already downloaded. Quality isn't a huge thing- as long as it's good enough to hear clearly, that's all that really matters. We're not making a platinum selling single, after all.

Basically, they want a single audio file with ~1 second pauses between words, and each word must be in order. There's a few more things than that listed on the page, but those are the absolute necessity.

It's mostly in order, but I backtracked in some places, so make sure it's correct.
You'll have to do the spacing and crap like that yourself.
Also I said each word *at least* three times, so you're going to have to pick the best take of each.

Absolutely the entire thing will have to be edited to remove the other two to six takes you don't want.

Yeesh, that's a lot of work. I'll get on it tomorrow, but I have no idea how soon I'll have it done.

I said each word a couple of times so as to avoid having to rerecord things if I balked a take.

That being said, if there's a word with no good take, let me know so I can redo it.

I'll be sure to do that.

la .lindar.

unread,

Jul 25, 2012, 6:30:31 AM7/25/12

to loj...@googlegroups.com

https://docs.google.com/open?id=0B2twDfvSFg4nOFZTeUtCS2pXRms

Just in case, here's the WAV. Exported directly as one so there's no conversion time.

Maybe source the work out to somebody?

Jonathan Jones

unread,

Jul 25, 2012, 2:15:05 PM7/25/12

to loj...@googlegroups.com

On Wed, Jul 25, 2012 at 4:30 AM, la .lindar. <lindar...@gmail.com> wrote:

https://docs.google.com/open?id=0B2twDfvSFg4nOFZTeUtCS2pXRms

Just in case, here's the WAV. Exported directly as one so there's no conversion time.

Since I'm using Audacity to do the editing, there is, actually. Regardless of format, Audacity always converts it to it's own proprietary format at load.

Maybe source the work out to somebody?

Who and with what money?

.arpis.

unread,

Jul 25, 2012, 3:22:26 PM7/25/12

to loj...@googlegroups.com

On Wed, Jul 25, 2012 at 2:15 PM, Jonathan Jones <eye...@gmail.com> wrote:

On Wed, Jul 25, 2012 at 4:30 AM, la .lindar. <lindar...@gmail.com> wrote:

https://docs.google.com/open?id=0B2twDfvSFg4nOFZTeUtCS2pXRms

Just in case, here's the WAV. Exported directly as one so there's no conversion time.

Since I'm using Audacity to do the editing, there is, actually. Regardless of format, Audacity always converts it to it's own proprietary format at load.

Are you sure you mean "proprietary" and not just "internal"? Sorry to pick nits, but if you meant exactly what I think you said, I'm confused.

Maybe source the work out to somebody?

Who and with what money?

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

--
You received this message because you are subscribed to the Google Groups "lojban" group.

To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

--
mu'o mi'e .arpis.

Jonathan Jones

unread,

Jul 25, 2012, 3:51:11 PM7/25/12

to loj...@googlegroups.com

On Wed, Jul 25, 2012 at 1:22 PM, .arpis. <rpglover...@gmail.com> wrote:

On Wed, Jul 25, 2012 at 2:15 PM, Jonathan Jones <eye...@gmail.com> wrote:

On Wed, Jul 25, 2012 at 4:30 AM, la .lindar. <lindar...@gmail.com> wrote:

https://docs.google.com/open?id=0B2twDfvSFg4nOFZTeUtCS2pXRms

Just in case, here's the WAV. Exported directly as one so there's no conversion time.

Since I'm using Audacity to do the editing, there is, actually. Regardless of format, Audacity always converts it to it's own proprietary format at load.

Are you sure you mean "proprietary" and not just "internal"? Sorry to pick nits, but if you meant exactly what I think you said, I'm confused.

It is the internal format, but not "just". Audacity converts all audio to it's own .aup format at load, which is also the only format it is possible to save in. (.wav, .mp3, and .ogg are EXPORT formats.) As far as I am aware, only Audacity is able to read that format, making it proprietary.

Maybe source the work out to somebody?

Who and with what money?

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

--
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

--
mu'o mi'e .arpis.

--
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

vitci'i

unread,

Jul 25, 2012, 5:26:00 PM7/25/12

to loj...@googlegroups.com

On 07/25/2012 02:51 PM, Jonathan Jones wrote:
> On Wed, Jul 25, 2012 at 1:22 PM, .arpis. <rpglover...@gmail.com>wrote:
>> Are you sure you mean "proprietary" and not just "internal"? Sorry to pick
>> nits, but if you meant exactly what I think you said, I'm confused.
>
> It is the internal format, but not "just". Audacity converts all audio to
> it's own .aup format at load, which is also the only format it is possible
> to save in. (.wav, .mp3, and .ogg are EXPORT formats.) As far as I am
> aware, only Audacity is able to read that format, making it proprietary.

I think 'proprietary' implies that it would be *illegal* to write
another program that can also read .aup files. Given that Audacity is
GPL'd, it seem more likely that someone *could* write such a program,
but no one has bothered to yet.

Jonathan Jones

unread,

Jul 25, 2012, 6:03:08 PM7/25/12

to loj...@googlegroups.com

Fine. Replace "proprietary" with "unique", then.

--
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

la .lindar.

unread,

Jul 25, 2012, 8:45:45 PM7/25/12

to loj...@googlegroups.com

>> Are you sure you mean "proprietary" and not just "internal"? Sorry to pick

>> nits, but if you meant exactly what I think you said, I'm confused.

This isn't appropriate for this thread and is not contributing to the conversation.

.arpis.

unread,

Jul 25, 2012, 11:45:32 PM7/25/12

to loj...@googlegroups.com

According to the wikipedia page [1], whether or not a format is proprietary depends on whether or not someone owns IP rights (usually patents) to it. I don't believe this to be the case with Audacity.

[1]: https://en.wikipedia.org/wiki/Proprietary_format

Jonathan Jones

unread,

Jul 25, 2012, 11:55:34 PM7/25/12

to loj...@googlegroups.com

We're done with this. As Lindar has already said:

On Wed, Jul 25, 2012 at 6:45 PM, la .lindar. <lindar...@gmail.com> wrote:

This isn't appropriate for this thread and is not contributing to the conversation.

Leave off it.

Jonathan Jones

unread,

Jul 26, 2012, 2:04:17 PM7/26/12

to loj...@googlegroups.com

On Wed, Jul 25, 2012 at 3:38 AM, Jonathan Jones <eye...@gmail.com> wrote:

On Wed, Jul 25, 2012 at 3:18 AM, la .lindar. <lindar...@gmail.com> wrote:

Anyway.

Would be able to say you are at least 90% confident that the file meets the requirements as listed on this page?

No. =D
It's done as an MP3 and that clearly asks for a wav, which is going to be a loooooooooooot bigger.
You might be able to type-convert if you don't care about the quality.

I'll upload a WAV version anyway. Let me know if you want/need it.

Don't bother. If I'm going to have to edit it- especially as much as it looks like I'll have to from your later comments- then I may as well just use the mp3 file I've already downloaded. Quality isn't a huge thing- as long as it's good enough to hear clearly, that's all that really matters. We're not making a platinum selling single, after all.

Basically, they want a single audio file with ~1 second pauses between words, and each word must be in order. There's a few more things than that listed on the page, but those are the absolute necessity.

It's mostly in order, but I backtracked in some places, so make sure it's correct.
You'll have to do the spacing and crap like that yourself.
Also I said each word *at least* three times, so you're going to have to pick the best take of each.

Absolutely the entire thing will have to be edited to remove the other two to six takes you don't want.

Yeesh, that's a lot of work. I'll get on it tomorrow, but I have no idea how soon I'll have it done.

So, I'm doing this in two parts. First, I'm sorting through the whole thing and exporting what I feel to be the best example of the >3 for each word. This will make it possible to update the audio for the Anki decks, which use a single mp3 for each entry. Second, I'll be pasting them all back together with exactly 1 second intervals of silence between them to give to Memrise Ben for uploading there. This might turn out to be more work for me in the long run, but not by much.

In order to do the audio for the gismu places deck/garden, I need "lo broda", "lo se broda", "lo te broda", "lo ve broda", and "lo xe broda". I really just need lo se te ve xe, but it won't sound natural if you say them individually and then they get pasted on.

doi.camgusmis. Can you write a script that can automatically parse the files together to make all the needed audio for the places, or will it need to be done manually?

I said each word a couple of times so as to avoid having to rerecord things if I balked a take.

That being said, if there's a word with no good take, let me know so I can redo it.

I'll be sure to do that.

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

Jonathan Jones

unread,

Jul 26, 2012, 2:06:53 PM7/26/12

to loj...@googlegroups.com

On Thu, Jul 26, 2012 at 12:04 PM, Jonathan Jones <eye...@gmail.com> wrote:

On Wed, Jul 25, 2012 at 3:38 AM, Jonathan Jones <eye...@gmail.com> wrote:

On Wed, Jul 25, 2012 at 3:18 AM, la .lindar. <lindar...@gmail.com> wrote:

Anyway.

Would be able to say you are at least 90% confident that the file meets the requirements as listed on this page?

No. =D
It's done as an MP3 and that clearly asks for a wav, which is going to be a loooooooooooot bigger.
You might be able to type-convert if you don't care about the quality.

I'll upload a WAV version anyway. Let me know if you want/need it.

Don't bother. If I'm going to have to edit it- especially as much as it looks like I'll have to from your later comments- then I may as well just use the mp3 file I've already downloaded. Quality isn't a huge thing- as long as it's good enough to hear clearly, that's all that really matters. We're not making a platinum selling single, after all.

Basically, they want a single audio file with ~1 second pauses between words, and each word must be in order. There's a few more things than that listed on the page, but those are the absolute necessity.

It's mostly in order, but I backtracked in some places, so make sure it's correct.
You'll have to do the spacing and crap like that yourself.
Also I said each word *at least* three times, so you're going to have to pick the best take of each.

Absolutely the entire thing will have to be edited to remove the other two to six takes you don't want.

Yeesh, that's a lot of work. I'll get on it tomorrow, but I have no idea how soon I'll have it done.

So, I'm doing this in two parts. First, I'm sorting through the whole thing and exporting what I feel to be the best example of the >3 for each word. This will make it possible to update the audio for the Anki decks, which use a single mp3 for each entry. Second, I'll be pasting them all back together with exactly 1 second intervals of silence between them to give to Memrise Ben for uploading there. This might turn out to be more work for me in the long run, but not by much.

In order to do the audio for the gismu places deck/garden, I need "lo broda", "lo se broda", "lo te broda", "lo ve broda", and "lo xe broda". I really just need lo se te ve xe, but it won't sound natural if you say them individually and then they get pasted on.

doi.camgusmis. Can you write a script that can automatically parse the files together to make all the needed audio for the places, or will it need to be done manually?

And also maybe a script to do the first-in-frequency second-of-silence next-in-frequency ... second-of-silence last-in-frequency thing?

I said each word a couple of times so as to avoid having to rerecord things if I balked a take.

That being said, if there's a word with no good take, let me know so I can redo it.

I'll be sure to do that.

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

Jonathan Jones

unread,

Jul 26, 2012, 2:21:54 PM7/26/12

to loj...@googlegroups.com

ARG! They're not even in freq. order! :(

Jonathan Jones

unread,

Jul 26, 2012, 2:32:52 PM7/26/12

to loj...@googlegroups.com

Actually, it seems they are, mostly. However, it seems when you recorded this, you went from drata to rinka, skipping everything inclusive from pensi to kelci.

I haven't listened to the entire thing, so I may be wrong and you do have those words in here somewhere. I'll find out when I've finished the whole list. I'm only doing a few words at a time, mainly because this is a very boring task and there's only so much I can take at once. :)

Jonathan Jones

unread,

Jul 26, 2012, 10:06:13 PM7/26/12

to loj...@googlegroups.com

I'm not sure if it's just me, but sometimes, Lindar, your ry.s sounds like xy.s, and your xy.s sounds like xry.s. It's odd.

v4hn

unread,

Jul 26, 2012, 7:32:25 AM7/26/12

to loj...@googlegroups.com

On Wed, Jul 25, 2012 at 12:15:05PM -0600, Jonathan Jones wrote:
> Since I'm using Audacity to do the editing, there is, actually. Regardless

> of format, Audacity always converts it to it's own [peeep] format at
> load.

Maybe you would like to use something else then like e.g. traverso[1]?
To the best of my knowledge it works directly on wav files.

mu'umi'e la .van.

---
[1] - http://traverso-daw.org

Jonathan Jones

unread,

Jul 31, 2012, 2:29:48 PM7/31/12

to loj...@googlegroups.com

No. I would not.

la .lindar.

unread,

Jul 31, 2012, 8:04:19 PM7/31/12

to loj...@googlegroups.com

Sent the requested retakes.

Any updates?

Jonathan Jones

unread,

Jul 31, 2012, 9:21:43 PM7/31/12

to loj...@googlegroups.com

I got them. I'm working on them now. It'll be a while before I finish this, though.

On Tue, Jul 31, 2012 at 6:04 PM, la .lindar. <lindar...@gmail.com> wrote:

Sent the requested retakes.
Any updates?

--

You received this message because you are subscribed to the Google Groups "lojban" group.

To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/IYZfu4sRY7AJ.

To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

Michael Turniansky

unread,

Sep 3, 2012, 3:24:45 PM9/3/12

to loj...@googlegroups.com

As per usual, I am behind on email, so just getting to poke around here. I notice viska is still TTL, not lindar's voice. Not sure if that's simply because things haven't actually progressed much in this direction in the past month?

--gejyspa

Jonathan Jones

unread,

Sep 3, 2012, 3:40:15 PM9/3/12

to loj...@googlegroups.com

Yeah, pretty much. The work is extremely tedious and boring, so I haven't been putting much effort into doing it. At this point I have separated audio for 179 gismu. and 82 minutes worth of audio. The last time I sat down and really worked on it, it took me ~1 hour to get through 100 words, which was ~7 minutes worth of audio.

la .lindar.

unread,

Sep 3, 2012, 6:02:12 PM9/3/12

to loj...@googlegroups.com

Crowd source!!

Jonathan Jones

unread,

Sep 5, 2012, 4:18:43 AM9/5/12

to loj...@googlegroups.com

Volunteers?

On Mon, Sep 3, 2012 at 4:02 PM, la .lindar. <lindar...@gmail.com> wrote:

Crowd source!!

--
You received this message because you are subscribed to the Google Groups "lojban" group.

To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/JsFXuScUpU8J.

To post to this group, send email to loj...@googlegroups.com.
To unsubscribe from this group, send email to lojban+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

v4hn

unread,

Sep 5, 2012, 5:12:04 AM9/5/12

to loj...@googlegroups.com

> On Mon, Sep 3, 2012 at 4:02 PM, la .lindar. <lindar...@gmail.com> wrote:
>
> > Crowd source!!

On Wed, Sep 05, 2012 at 02:18:43AM -0600, Jonathan Jones wrote:
> Volunteers?

That's not exactly how crowd sourcing works..

- Where's the page with the exact instructions for the task?
(not just some memrise page and a long mail thread)

- Where's a list of jobs to do? I mean "here's the file, everyone, chop it" is _one_
job, not a bunch of them. Just split the word lists (also the audio if possible)
into manageable chunks, number them and and add a wiki page,
where people can mark finished items.

- What to do with the final audio file of each job?

v4hn

Jonathan Jones

unread,

Sep 5, 2012, 5:29:08 AM9/5/12

to loj...@googlegroups.com

That's /exactly/ how crowd sourcing works. A bunch of people volunteer, and they're told what to do.

IFF I get volunteers, then I will provide exact instructions on what to do, how to do it, and what to do with the finished work.

If I don't get volunteers, then doing any of that work is pointless and a waste of time. As such, there is not currently a page of instructions, nor a list of jobs, nor any other thing.

I hope you don't think that me writing the instructions, cutting the audio into manageable chunks, giving them unique identifiers, creating a means by which volunteers can claim one or more of the chunks to prevent worker overlap, posting all of these things somewhere, and providing a repository for completed jobs, takes no effort on my part, because you would be sorely mistaken. I am not willing to go through all that effort if no one is going to volunteer.

Jonathan Jones

unread,

Sep 5, 2012, 6:04:17 AM9/5/12

to loj...@googlegroups.com

Each job will be essentially the same, with the only difference between them being the actual words in your chunk

With that in mind, these are the skills and software that are required to do this job:

You must be able to recognize a gismu from it's pronunciation. This means that you must be able to identify and type the word solely from the audio. If you know Lojban's pronunciation, this is not difficult. This is required because while Lindar recorded the audio /mostly/ in frequency order, some were skipped, and some needed to be re-recorded. If you need a list of the gismu in frequency order, you can find it here: http://www.lojban.org/tiki/Word+frequency+lists%3A+gismu

It is possible that there are more of both unrecorded and badly recorded words (badly recorded being words that are unidentifiable for any reason). If you encounter either in the course of your job, you must be able to contact Lindar, likely preferably via email, to inform him of these words so that he can submit new recordings. In the case of a word or three, preferably to yourself. In the case of a chunk's worth of words, preferably to me so I can add the recording to the chunk list for claimage.

You must possess, on the computer you will be using to work on, Audacity, available for free here: http://audacity.sourceforge.net/

You must possess, on the same computer, the LAME MP3 encoder. You can download it at this site: http://lame.sourceforge.net/, which provides the source code, or you can go here: http://lame.sourceforge.net/links.php#Binaries to download a binary, or for Windows users you can download the .dll directly from the instructions page that I will make if I get any volunteers.

You must know how to collect a group of files into an archive. It does not matter if you use .zip, .rar, .7z, .tar, .tar.gz, .tar.bz, or any other format not mentioned.

You must know how to attach a file to an email.

You must be able to follow written instructions.

Finally, you must be willing to sacrifice some of your free time to do the actual job. These last two are in my opinion no-brainers.

v4hn

unread,

Sep 5, 2012, 6:08:25 AM9/5/12

to loj...@googlegroups.com

On Wed, Sep 05, 2012 at 03:29:08AM -0600, Jonathan Jones wrote:
> IFF I get volunteers, then I will provide exact instructions on what to do,
> how to do it, and what to do with the finished work.

Then please do so, as I'm willing to do some of the work.

> I hope you don't think that me writing the instructions, cutting the audio
> into manageable chunks, giving them unique identifiers, creating a means by
> which volunteers can claim one or more of the chunks to prevent worker
> overlap, posting all of these things somewhere, and providing a repository
> for completed jobs, takes no effort on my part, because you would be sorely
> mistaken.
> I am not willing to go through all that effort if no one is going to volunteer.

Writing instructions and adding a table to the wiki with jobs described
by their time interval in lindars file might be all the work there is to set this up,
if you're willing to accept the completed job by mail.

v4hn

Jonathan Jones

unread,

Sep 5, 2012, 6:18:10 AM9/5/12

to loj...@googlegroups.com

On Wed, Sep 5, 2012 at 4:08 AM, v4hn <m...@v4hn.de> wrote:

On Wed, Sep 05, 2012 at 03:29:08AM -0600, Jonathan Jones wrote:
> IFF I get volunteers, then I will provide exact instructions on what to do,
> how to do it, and what to do with the finished work.

Then please do so, as I'm willing to do some of the work.

Alright then, I work on it tomorrow after I've gotten home from my class.Hopefully there will be others who have volunteered by then. Expect to receive notice of where to go for the instructions and audio chunks by 12 a.m. my time, ~20 hours from the time of this email.

> I hope you don't think that me writing the instructions, cutting the audio
> into manageable chunks, giving them unique identifiers, creating a means by
> which volunteers can claim one or more of the chunks to prevent worker
> overlap, posting all of these things somewhere, and providing a repository
> for completed jobs, takes no effort on my part, because you would be sorely
> mistaken.
> I am not willing to go through all that effort if no one is going to volunteer.

Writing instructions and adding a table to the wiki with jobs described
by their time interval in lindars file might be all the work there is to set this up,
if you're willing to accept the completed job by mail.

I have to clean up Lindar's audio before I do any chunking, as there are portions of the audio that are not him reciting gismu. I don't expect to get everything, but the larger bits do need to be removed. IIRC, there is at least one ~5 minute or longer contiguous portion wherein Lindar is not recording anything at all, because he is waiting for some noise he hears to cease. Seeing as how I'd be doing that, actually splitting it into individual files would not be much more effort, and it's possible that the file would need to be split anyway just due to file size limitations on whatever currently un-chosen thing I use to host the audio. Writing the instructions would probably the second largest task, and the rest is actually pretty minimal in comparison. As far as the repository is concerned, that parts already taken care of, as the completed work should be emailed to me directly.

Jonathan Jones

unread,

Sep 6, 2012, 3:53:28 AM9/6/12

to loj...@googlegroups.com

On Wed, Sep 5, 2012 at 4:18 AM, Jonathan Jones <eye...@gmail.com> wrote:

On Wed, Sep 5, 2012 at 4:08 AM, v4hn <m...@v4hn.de> wrote:

On Wed, Sep 05, 2012 at 03:29:08AM -0600, Jonathan Jones wrote:
> IFF I get volunteers, then I will provide exact instructions on what to do,
> how to do it, and what to do with the finished work.

Then please do so, as I'm willing to do some of the work.

Alright then, I work on it tomorrow after I've gotten home from my class.Hopefully there will be others who have volunteered by then. Expect to receive notice of where to go for the instructions and audio chunks by 12 a.m. my time, ~20 hours from the time of this email.

I have written the instructions, you can view them here: http://tinyurl.com/gismu-instructions

I still need to clean up and chunkify the audio itself.

> I hope you don't think that me writing the instructions, cutting the audio
> into manageable chunks, giving them unique identifiers, creating a means by
> which volunteers can claim one or more of the chunks to prevent worker
> overlap, posting all of these things somewhere, and providing a repository
> for completed jobs, takes no effort on my part, because you would be sorely
> mistaken.
> I am not willing to go through all that effort if no one is going to volunteer.

Writing instructions and adding a table to the wiki with jobs described
by their time interval in lindars file might be all the work there is to set this up,
if you're willing to accept the completed job by mail.

I have to clean up Lindar's audio before I do any chunking, as there are portions of the audio that are not him reciting gismu. I don't expect to get everything, but the larger bits do need to be removed. IIRC, there is at least one ~5 minute or longer contiguous portion wherein Lindar is not recording anything at all, because he is waiting for some noise he hears to cease. Seeing as how I'd be doing that, actually splitting it into individual files would not be much more effort, and it's possible that the file would need to be split anyway just due to file size limitations on whatever currently un-chosen thing I use to host the audio. Writing the instructions would probably the second largest task, and the rest is actually pretty minimal in comparison. As far as the repository is concerned, that parts already taken care of, as the completed work should be emailed to me directly.

--
mu'o mi'e .aionys.

.i.e'ucai ko cmima lo pilno be denpa bu .i doi.luk. mi patfu do zo'o
(Come to the Dot Side! Luke, I am your father. :D )

Jonathan Jones

unread,

Sep 6, 2012, 5:08:51 AM9/6/12

to loj...@googlegroups.com

I have written the instructions, you can view them here: http://tinyurl.com/gismu-instructions

The sign up page is here: http://tinyurl.com/gacsp-signup

Pick your chunk, type you name into the sign p field so everyone else knows you've claimed that chunk, and follow the download link to download your chunk in mp3 format.

There are 11 chunks total. Each chunk is ~6 minutes of audio, which based on my rather basic math should be ~100 gismu in each chunk.

When you have completed your chunk, place all the files into an archive and email them to me. Then, if you like, grab another chunk. If you don't like, go do something else.

Reply all

Reply to author

Forward