Even with simple images also I am not getting proper results. But If I ran tesseract.exe which is given in github I am getting better results. Is something(Like any preprocessing internally tesseract is doing?) they are doing internally?
My code is simple as below:
Pix* image = pixRead(imagePath);
ocr = new tesseract::TessBaseAPI();
if (ocr->Init(dataPath, languageCode) != -1) {
//ocr->SetPageSegMode(tesseract::PSM_RAW_LINE);
return SDS_SUCCESS;
}
if (!image)
{
return NULL;
}
ocr->SetImage(image);
text = ocr->GetUTF8Text();
if (text)
{
return text;
}
else
{
return NULL;
}