YOLO / text detection - need help

1,133 views
Skip to first unread message

Anna Muller

unread,
Jan 31, 2023, 10:30:49 AM1/31/23
to tesseract-ocr
Hi,

Has anybody implemented text detection using YOLO or some other similar software? I wanted to tag screenshots of tiktok videos to extract text from certain areas (using VoTT), pass that into YOLOv3, and then alter the image to make the text more readable given the noisy background. 

However, I am stuck on the YOLOv3 part - does anyone have any experience with this/know of a simple distribution of YOLO to use? Or can give me any pointers in general?

I would greatly appreciate it

Thank you

Zdenko Podobny

unread,
Jan 31, 2023, 1:51:00 PM1/31/23
to tesser...@googlegroups.com
Hello,

something like this I have on my "try todo list". 

As far as I understand you need to train YOLO for text detections:

One of the most interesting bookmarks from this area is IMO https://roboflow.com - the offer to annotate images with their tool [1] and run training there + export model (even to RB PI[2]). They also offer  synthetic dataset The Open Poetry Vision  for OCR tasks. So maybe with it you speed up training for your project.


BR,

Zdenko


ut 31. 1. 2023 o 16:30 Anna Muller <amul...@nd.edu> napísal(a):
--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/07858222-caec-4213-84d7-5a81eab9c4acn%40googlegroups.com.
Reply all
Reply to author
Forward
0 new messages