Hello Zdenko,--Thanks for the update. You may want to edit the subjectline to reflect the discussion - 'Plans for Tesseract-3.04rc'Yes, it is possible that a number of issues would get resolved with new 'traineddata' files from Ray. However, if he is only planning to release the source language data files, then we may also have an extra task of building traineddata files from them. I hope Ray will clarify and also let us know timeline of expected release.Is https://docs.google.com/spreadsheets/d/1ePMcP1f6ot0fMbBlZ40llC_7PX_1yib4RTN6N4G5OlI/edit#gid=0 the NEW issue tracker .. If so, I woudd suggest adding columns with the version of tesseract, o/s effected and date issue was filed.I can test on msys2, windows8 and maybe windows7. My interest is improving the training for Indic languages, so I would be testing the training tools also. My focus will be on Devanagari based Indic languages such as Hindi, Sanskrit, Marathi etc. I can also review Gujarati and Tamil for basic info. Srirangaji can test for Kannada.Shree
On Thursday, October 30, 2014 2:55:19 AM UTC+5:30, Zdenko Podobný wrote:I am sorry for late reply, but I am overload with my "regular tasks", so there is no time for free projects...IMO these steps should be done before next release:
- Release of new language data - this promised by Ray in past and this looks like key open topic for me.
- Check the wikis and other project documentation files (INSTALL, README etc.) - first to check/improve content and than it should be checked by English native speaker for grammar etc.
- Check the issue tracker :-):
- check if there is the issue valid with current code
- check if there is test case, that should help replicate error (if not ask/create simple test case)
- grouping of issues with extended info[1] (I can add edit right for those who want to collaborate) => this will trigger other actions: what shall be fixed for this release (e.g. issues related to language files), what will be postponed... IMO issues <= 1066 commented well, but double check will help.
Beside about mention it would be great if there is community testing team e.g. for each platform (linux, windows, ios) or maybe for each compiler (gcc, clang, msys2, msys, VS 2009, VS 2010...). It is important that tester would use it on regular base. The reason such team is e.g. issue 1354[1]: I guess that usage of uintptr_t will break support of VS2009 (which is needed for python2.x on windows)Also especially windows packager(s) is welcomed (building library, creating installation etc.).There is a more things to be done (also after release of source code)... So somebody want to help just ask for task at this forum.[1] https://docs.google.com/spreadsheets/d/1ePMcP1f6ot0fMbBlZ40llC_7PX_1yib4RTN6N4G5OlI/edit?usp=sharingZdenkoOn Sat, Oct 25, 2014 at 8:49 AM, ShreeDevi Kumar <shree...@gmail.com> wrote:Zdenko,Do you know what milestones we are waiting for before the next release?Is there anything that the tess community can do to help?ShreeDevi
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.comOn Sat, Oct 25, 2014 at 3:44 AM, Jeff Breidenbach <breid...@gmail.com> wrote:--Yesterday's Ubuntu release fixed the training tools omission,as documented in the Tesseract FAQ. Otherwise it is the exactsame code as Ubuntu 14.10.The mixed language PDF improvement I mentioned in theprevious post is complete. But it won't ship with Ubuntu until afterTesseract has made a formal release. The next Ubuntu releasewill be April 2015.
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/6dbe79f7-206e-4eae-a5dc-ec6e7edf3af7%40googlegroups.com.To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAG2NduXV_LD3BcXa6PzPd_O4KjxeSLR%2BEps%3DoZh5EmJbua1YUA%40mail.gmail.com.--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/0cb0b0c3-b802-4c0a-9768-065c9c4b646d%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAGuE8nU0xpAa3sF2Evf2d0e4JJCEGg0MX%2BQzmTOF%3DGMZwf9m%3Dw%40mail.gmail.com.
What will be in it:A bunch of fixes for issues, including 1245,1205,1241,899,1229,1246,1243,1264, 1207.
Language-specific issues: 792,865,758,969,1254. If there are any more like these, an email summarizing them would be really useful, as I am preparing to do some major retraining this week/next week.
I have fixed a bunch of problems with our internal tools for generating those langdata wordlist files. They will be totally refreshed for 3.04, and should be a lot better for a lot of languages, and include more languages.The new release will include a refreshed set of traineddata files. The idea is to use regressions from training to flush out accuracy bugs, so it may take a while longer yet, but it ought to lead to at least some improvements.That spreadsheet looked awfully long. I really appreciate Zdenko's efforts in summarizing the important issues, so if anyone else wants to help out with that, it would help. There is a trade-off though between fixing issues and getting the next release out...
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAGuE8nU0xpAa3sF2Evf2d0e4JJCEGg0MX%2BQzmTOF%3DGMZwf9m%3Dw%40mail.gmail.com.
Ubuntu 15.04 | April 2015 | January 2016 |
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/da6f8d1f-4918-4626-b436-08314c7c6d8c%40googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/1aaf0830-d775-43f6-95d5-798c4ad3d8f8%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAG2NduU3pfo9fqooi%2B5mXc235-kXUvk94utEzafz%3DY5fmRjDtQ%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/1aaf0830-d775-43f6-95d5-798c4ad3d8f8%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAJbzG8zO%3DYYH%3DPM4AOgJMHg6xAP3hdG-0-wj51kyWLLa5dUujg%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/c3bdaf77-711d-4be1-b015-a6a288d3adb0%40googlegroups.com.
github.com creates tar ball/zip automatically for release or tag[1] or you can get the master copy by links like [2] or [3].
Zdenko--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/5c031112-c445-4c29-a12b-d89fdf239285%40googlegroups.com.
* I haven't even tried to package tesstrain.sh yet* lots of font challenges with tesstrain.sh- can't find some of the fonts- others like 'noto' are opentype instead of truetype and don't seem to work
if [ "$(uname)" == "Darwin" ];then | |
FONTS_DIR="/Library/Fonts/" | |
else | |
FONTS_DIR="/usr/share/fonts/truetype/" | |
fi |
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAG2NduW%2B5RyYBWd5sHyYAR2vcM8WxHKsg%2Bz%3DnupHZ6jrMa%3DY_w%40mail.gmail.com.
Awesome. Someone with wrote permission, please tag.
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/6455ed2b-9185-44cd-adf6-0fe79949e916%40googlegroups.com.
What will be the process for releasing this on other platforms ?
- sent from my phone. excuse the brevity
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/ca08b6a9-4c1f-464b-83f2-6133090be5f4%40googlegroups.com.
October 22nd | FinalRelease Ubuntu 15.10 |
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/62c81fca-e634-4566-83e3-047d561a0520%40googlegroups.com.
Tesseract can be directly installed on Ubuntu using the apt-get install
Command: sudo apt-get install tesseract-ocr
You need the source, only if you want the latest changes made after 3.04 release.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAPyuS5jiffteDnG3bvgrpqHj1%2B9T1pW6T%2B-aqJCmsixD9fQb9w%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAG2NduWFT%3DGdw46fyuY2x18%2Bj6Fa4riky-%3DyFkb74h0rDoQnYg%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/34244761-4651-4dc0-9bdf-5de9d8703aa3%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAJbzG8wg88S7ecP7QRxy0%2BSO7Pr2NdmtBM_sSdM64ZJmcGWP7Q%40mail.gmail.com.
I think that we should release 3.04.01 (bug-fix) version:
- We planed to make it when we made 3.04.00 release because there was expectation OpenCL fix will come soon...
- There is a lot of other fixes in 3.05 branch that need to be transferred to 3.04 branch (AFAIK only "monitor" "cmake" patches could be considered as new features => should stay in 3.05 branch)
IMO it would be nice to have additionally these issues solved ASAP:
- API compatibility with 3.02 version (namely tesseract::TessBaseAPI::ProcessPages and tesseract::TessBaseAPI::ProcessPage) that cause that some tesseract wrappers stop to work
- OpenCL
- check/fix docs (e.g. all examples on wiki should be tested with the latest code)
- and of course close as much as possible open issues[1]
On Thu, Feb 4, 2016 at 3:04 AM, zdenko podobny <zde...@gmail.com> wrote:I think that we should release 3.04.01 (bug-fix) version:
- We planed to make it when we made 3.04.00 release because there was expectation OpenCL fix will come soon...
- There is a lot of other fixes in 3.05 branch that need to be transferred to 3.04 branch (AFAIK only "monitor" "cmake" patches could be considered as new features => should stay in 3.05 branch)
I don't see a 3.05 branch. Is 'master' effectively the 3.05 branch?
IMO it would be nice to have additionally these issues solved ASAP:
- API compatibility with 3.02 version (namely tesseract::TessBaseAPI::ProcessPages and tesseract::TessBaseAPI::ProcessPage) that cause that some tesseract wrappers stop to work
- OpenCL
- check/fix docs (e.g. all examples on wiki should be tested with the latest code)
- and of course close as much as possible open issues[1]
That all sounds like good stuff, but it sounds like it could be a lot to squeeze into the two week window that Jeff is talking about.
Tom
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAE9vqEGbEyaJEuCzr8TU11Cs3vBScT%3D5zNGTvMEdfkW_q4M6EQ%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/234d2f12-c455-4c68-9751-0f6c118f337c%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAJbzG8xoizUS7k1XR%3DxOj38se8Twwna%2Bphmze5XE0a-5AEcRXg%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/CAE9vqEG2HUOJ%3DstS%2BPjAe8%2Bt8TZYa_H%3DherZaW2EU3SmV-F3Ng%40mail.gmail.com.
I should probably ship some time within the next 24 hours....
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/e7115363-c149-40dd-ac76-7e5f4dc885a7%40googlegroups.com.
Can you have have a look at #223 and #224 (there is message "This branch has conflicts that must be resolved")
I changed it as below to get it compiled:char* id_buffer = new char[bufsize];
--
You received this message because you are subscribed to the Google Groups "tesseract-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-de...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-dev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/f501444c-7da8-4784-a067-38b25ba16783%40googlegroups.com.
Not urgent, but please take a look at this discussion:It suggests moving a three line function from baseapi.h tobaseapi.cc for better ABI compatibility.
Supposedly nodownside. Thoughts?
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-dev/48fedaf1-5d4b-407b-9c80-5d15d7ba07f3%40googlegroups.com.