How to start using Kaldi?

156 views
Skip to first unread message

farzaneh tabatabaee

unread,
Feb 17, 2018, 2:46:44 PM2/17/18
to kaldi-help
Hello Everyone,

I'm a beginner in Kaldi , I want to use Kaldi , I need some tutorials or documents , specially for using Kaldi for OCR. May you please introducing me some documents or tutorials about using Kaldi or using Kaldi for OCR?
Message has been deleted

Jan Trmal

unread,
Feb 17, 2018, 3:20:06 PM2/17/18
to kaldi-help
we have some OCR recipes now, Zoltan. When you have the lines of characters isolated (and rectified), you can use the same framework as is used in ASR. At least on a conceptual level.
y.

On Sat, Feb 17, 2018 at 3:08 PM, Zoltán Somogyi <zsomo...@gmail.com> wrote:
In case OCR means optical character recognition then you will need an other software because kaldi is speech recognition.


On Saturday, February 17, 2018 at 8:46:44 PM UTC+1, farzaneh tabatabaee wrote:
Hello Everyone,

I'm a beginner in Kaldi , I want to use Kaldi , I need some tutorials or documents , specially for using Kaldi for OCR. May you please introducing me some documents or tutorials about using Kaldi or using Kaldi for OCR?

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/kaldi-help/cd125969-8640-4b50-9a4a-2101c3933eb4%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Message has been deleted

Daniel Povey

unread,
Feb 17, 2018, 3:59:29 PM2/17/18
to kaldi-help
There are examples in egs/iam and egs/uw3 (the latter is actually OCR).
These are for setups where you have already done the segmentation into lines of text.

There are some fixes pending to the IAM setup, though (we had mistakenly included the training text in the language modeling training data, so I believe you can't trust the results right now; we are working on a fix).

Dan


On Sat, Feb 17, 2018 at 3:47 PM, Zoltán Somogyi <zsomo...@gmail.com> wrote:
OCR is a complex matter and there are surely some open source packages to do that but doing that with kaldi sounds strange. I am sorry but if I may be honest with you then I would not recommend it.


On Saturday, February 17, 2018 at 9:20:06 PM UTC+1, Yenda wrote:
we have some OCR recipes now, Zoltan. When you have the lines of characters isolated (and rectified), you can use the same framework as is used in ASR. At least on a conceptual level.
y.
On Sat, Feb 17, 2018 at 3:08 PM, Zoltán Somogyi <zsomo...@gmail.com> wrote:
In case OCR means optical character recognition then you will need an other software because kaldi is speech recognition.

On Saturday, February 17, 2018 at 8:46:44 PM UTC+1, farzaneh tabatabaee wrote:
Hello Everyone,

I'm a beginner in Kaldi , I want to use Kaldi , I need some tutorials or documents , specially for using Kaldi for OCR. May you please introducing me some documents or tutorials about using Kaldi or using Kaldi for OCR?

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+...@googlegroups.com.

To post to this group, send email to kaldi...@googlegroups.com.

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
Message has been deleted

Daniel Povey

unread,
Feb 17, 2018, 4:15:02 PM2/17/18
to kaldi-help
Kaldi doesn't do the whole pipeline, it's applicable to pre-segmented data where you already have the text lines.


On Sat, Feb 17, 2018 at 4:11 PM, Zoltán Somogyi <zsomo...@gmail.com> wrote:
May I ask you which software you use for the OCR? Is it something that is installed separately? Normally OCR means that you have an image and that you recognize the characters on the image and you make sentences from it.

--
Go to http://kaldi-asr.org/forums.html find out how to join
---
You received this message because you are subscribed to the Google Groups "kaldi-help" group.
To unsubscribe from this group and stop receiving emails from it, send an email to kaldi-help+unsubscribe@googlegroups.com.
To post to this group, send email to kaldi...@googlegroups.com.
Reply all
Reply to author
Forward
Message has been deleted
0 new messages