Help needed!

11 views
Skip to first unread message

Sergey Khenkin

unread,
Mar 28, 2008, 4:34:19 AM3/28/08
to SICP Subtitles
Hi everybody!

I created the project at Google Code for creating and translating SICP
subtitles.
Lecture 1-A is already there and 5 issues related to it which I can't
fight mmyself are there too.
Subtitles are stored under version control (Subversion is used).
Issues can be created and tracked. Wiki is available.

So please feel free to let me know if you want to join the project and
help with creating subtitles.
I am sure this is a very important work and only together we can
complete it.

Best regards,
Sergey Khenkin

Anton Tayanovskyy

unread,
Mar 28, 2008, 4:27:04 AM3/28/08
to sicp-su...@googlegroups.com
Pryvit,

I'm in!

--A

Anton Tayanovskyy

unread,
Mar 28, 2008, 9:55:26 AM3/28/08
to sicp-su...@googlegroups.com
Hi everyone,

Today I looked for Video+Subtitles software. Results:

1. SRT format looks good - let's stick to it;

2. Subtitle editors - I found gnome-subtitles which looks OK (and helps
to translate - two SRTs side-by-side, with preview). What it can't do,
though, is splitting the video automatically into time periods. Did your
software do that, Sergey, or how did you work it out?

3. Video and audio codecs: the lectures (at least L1 that I have) use an
AVI container with DIV3 video and Mp3 audio. They are not free! By
default, Linux distros do not ship them. The only alternative I found:
Theora (video) + Vorbis (audio). I am trying right now to transcode
Lecture-1a to these.

4. Container formats: AVI is good, but... It does not support SRT
subtitles, e.g. you can't package them into the same file. Ideally, we'd
like to package together with the video stream several languages in text
and audio streams, and let the user choose. Again, if we stick to free
containers, then it looks like we have Matroska (mkv), OGG, and OGM.
Which to choose? OGG is the most widespread, but subtitle support is bad
(unless you package it into Annodex, which looks complicated). OGM
works, but reportedly is a 'crude hack' that nobedy will remember in a
few years. I tried the Matroska and it worked with the AVI file
(put .AVI and .SRT inside) but failed to work with Theora/OGG.

5. Another (simpler?) way would be to use 'hard' titles, that is, render
the titles right into the video. I have not yet found software to do
that - most software seems to focus on 'soft' titles, and those do look
preferable.


--A

Sergey Khenkin

unread,
Mar 28, 2008, 12:13:44 PM3/28/08
to SICP Subtitles
Anton,

> 1. SRT format looks good - let's stick to it;

IMO, it's the simplest and the most widespread format.
We can add more effects and use other formats supporting bells and
whistles later on if needed.

> 2. Subtitle editors - I found gnome-subtitles which looks OK (and helps
> to translate - two SRTs side-by-side, with preview). What it can't do,
> though, is splitting the video automatically into time periods. Did your
> software do that, Sergey, or how did you work it out?

What do you mean by splitting the video into time periods? Why would
you use it?

> 3. Video and audio codecs: the lectures (at least L1 that I have) use an
> AVI container with DIV3 video and Mp3 audio. They are not free! By
> default, Linux distros do not ship them. The only alternative I found:
> Theora (video) + Vorbis (audio). I am trying right now to transcode
> Lecture-1a to these.

Well, I am mostly MS Windows user and there're no such problems there.
Video lectures I have (the lighter ones, DivX-encoded) can be played
without any problem on WinXP.

> 4. Container formats: AVI is good, but... It does not support SRT
> subtitles, e.g. you can't package them into the same file. Ideally, we'd
> like to package together with the video stream several languages in text
> and audio streams, and let the user choose. Again, if we stick to free
> containers, then it looks like we have Matroska (mkv), OGG, and OGM.
> Which to choose? OGG is the most widespread, but subtitle support is bad
> (unless you package it into Annodex, which looks complicated). OGM
> works, but reportedly is a 'crude hack' that nobedy will remember in a
> few years. I tried the Matroska and it worked with the AVI file
> (put .AVI and .SRT inside) but failed to work with Theora/OGG.

All the video players I used loaded subtitles automatically when
subtitles file name was the same as the video file name and they were
located in the same directory.
So no container was needed.
Of course there can be players that can't load subtitles that way. Is
it so on Linux?

> 5. Another (simpler?) way would be to use 'hard' titles, that is, render
> the titles right into the video. I have not yet found software to do
> that - most software seems to focus on 'soft' titles, and those do look
> preferable.

You're absolutely right. Soft titles are more flexible and
customizable and I think they are the preferable way for us as we'll
probably end up with several sets of subtitles for different
languages.

Regards,
Sergey Khenkin

Anton Tayanovskyy

unread,
Mar 28, 2008, 12:28:45 PM3/28/08
to sicp-su...@googlegroups.com
Hey Sergey,

Thanks for the reply!

> > 2. Subtitle editors - I found gnome-subtitles which looks OK (and helps
> > to translate - two SRTs side-by-side, with preview). What it can't do,
> > though, is splitting the video automatically into time periods. Did your
> > software do that, Sergey, or how did you work it out?
>
> What do you mean by splitting the video into time periods? Why would
> you use it?

Oh, I mean the actual subtitle frames. You made 1012 of those - and
it's actually very good! I tried doing that once and it was so
time-consuming. Did you just manually decide where each frame
stops/starts? Or is there a software that can do that, maybe by sound
levels?

> > 3. Video and audio codecs: the lectures (at least L1 that I have) use an
> > AVI container with DIV3 video and Mp3 audio. They are not free! By
> > default, Linux distros do not ship them. The only alternative I found:
> > Theora (video) + Vorbis (audio). I am trying right now to transcode
> > Lecture-1a to these.
>
> Well, I am mostly MS Windows user and there're no such problems there.
> Video lectures I have (the lighter ones, DivX-encoded) can be played
> without any problem on WinXP.

No, there aren't any _technical_ problems on Linux either :) I
installed Mp3 and DIVX support - shhh - don't tell anyone. It is more
an issue of using FREE technology and being SAFE from litigation.

You know why Linux does not ship MP3 and DIVX? Microsoft does that,
and it was sued for $1.5 billion dollars for "patent infringements",
see, for example, here:

http://www.iht.com/articles/2007/02/23/business/web-0223microsoft.php

That is why people push OGG...


> > 4. Container formats: AVI is good, but... It does not support SRT
> > subtitles, e.g. you can't package them into the same file. Ideally, we'd
> > like to package together with the video stream several languages in text
> > and audio streams, and let the user choose. Again, if we stick to free
> > containers, then it looks like we have Matroska (mkv), OGG, and OGM.
> > Which to choose? OGG is the most widespread, but subtitle support is bad
> > (unless you package it into Annodex, which looks complicated). OGM
> > works, but reportedly is a 'crude hack' that nobedy will remember in a
> > few years. I tried the Matroska and it worked with the AVI file
> > (put .AVI and .SRT inside) but failed to work with Theora/OGG.
>
> All the video players I used loaded subtitles automatically when
> subtitles file name was the same as the video file name and they were
> located in the same directory.
> So no container was needed.
> Of course there can be players that can't load subtitles that way. Is
> it so on Linux?

Linux is OK - Totem loaded SRT just fine. I just want to clarify - I
absolutely agree that our main project aim will be to produce and
distribute the SRT files. Packaging them together with the video can
just be a fringe, well, some day in the future. In an ideal world we
would also have someone READ the text in Russian (etc.)

It is simply more convenient, especially when there are several
languages. Maybe at some point we'll also produce and distribute a DVD
with the translated lectures. Again, SOME day in the future :)


Cheers,

--A

Sergey Khenkin

unread,
Mar 28, 2008, 5:47:53 PM3/28/08
to sicp-su...@googlegroups.com
Hi Anton,

>> What do you mean by splitting the video into time periods? Why would
>> you use it?
>
> Oh, I mean the actual subtitle frames. You made 1012 of those - and
> it's actually very good! I tried doing that once and it was so
> time-consuming. Did you just manually decide where each frame
> stops/starts? Or is there a software that can do that, maybe by sound
> levels?

It was really a manual and time-consuming process. You should also
constantly think about the best way to break speech into frames.
I don't know of any software that can help. However such software may
exists.

> No, there aren't any _technical_ problems on Linux either :) I
> installed Mp3 and DIVX support - shhh - don't tell anyone. It is more
> an issue of using FREE technology and being SAFE from litigation.
>
> You know why Linux does not ship MP3 and DIVX? Microsoft does that,
> and it was sued for $1.5 billion dollars for "patent infringements",
> see, for example, here:
>
> http://www.iht.com/articles/2007/02/23/business/web-0223microsoft.php
>
> That is why people push OGG...

But almost everyone uses mp3 anyway ;)
However I see your point here.
Do we infringe anybody's rights creating subtitles for video encoded
with proprietary codecs?

> Linux is OK - Totem loaded SRT just fine. I just want to clarify - I
> absolutely agree that our main project aim will be to produce and
> distribute the SRT files. Packaging them together with the video can
> just be a fringe, well, some day in the future. In an ideal world we
> would also have someone READ the text in Russian (etc.)

Hmm... Sounds really interesting. Who knows?

> It is simply more convenient, especially when there are several
> languages. Maybe at some point we'll also produce and distribute a DVD
> with the translated lectures. Again, SOME day in the future :)

There's a lot of work to do before that :)
But that would be really nice.

Regards,
Sergey Khenkin

Reply all
Reply to author
Forward
0 new messages