Hi and welcome to the group.
I think the "handwritten" aspect is probably at least as important as the "new language" part. Tesseract was designed to do optical character recognition of mechanically printed texts. Handwriting is very different. There have been some attempts to do this in the past, but only with block printed characters and, even then recognition rates were under 90% which isn't adequate for most uses. If you search the archives here or google "tesseract handwriting" (without the quotes), you'll find lots of reading material.
There is a Latin script model which has been trained in a language independent fashion, so you could give that a try to see how well it does (modulo your superscript u).
For training with natural images (standard training uses synthesized images), look at some of the examples in the
tesstrain wiki, particularly the GT4HistOCR
page.
For any training you'll need ground truth text matched with your segmented line images to train on.
Good luck! It sounds like an interesting (but non-trivial) project.
Tom