We are students from St. Francis Institute of Technology, Mumbai.We intend to make a hocr for modi lipi which will be able to identify and translate basic modi words or sentences.
our basic idea is to make an ocr system that will extract modi lipi from the handwritten image and send it to aksharmukha to translate it to devnagari
we need help with understanding what kind of dataset is required and how to train and use tesseract,
we are total noobs and new to the concept of tesseract