Seeking (proofreading) volunteers for a sanskrit OCR project

23 views
Skip to first unread message

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Feb 26, 2016, 10:26:48 AM2/26/16
to
bcc: sanskr...@googlegroups.com, sanskrit-p...@googlegroups.com, संस्कृतसन्देशश्रेणिः samskrta-yUthaH <sams...@googlegroups.com>, bhAratIya-vidvat-pariShad भारतीय-विद्वत्परिषद् <bvpar...@googlegroups.com>, sb...@googlegroups.com, SB-US rAShTriya-shikshakagana, ​​Suhas , Pooja , Sumana  , Vasuvaj, Sai , Sudarshan , nandu , Shriramana

We're seeking volunteers to contribute to a sanskrit OCR project (which may be particularly useful for existing projects such as sanskrit wikipedia/ wikisource). Particularly proof-readers. 

Details about the scope of the project, what's involved and how to join up (from the Project homepage here):
​========================

Purpose

  • OCR important Sanskrit texts whose digitized versions are not available.
  • Proofread OCR-ed texts to fix errors.

Why digitize?

  • Want to easily search for the source of that amazing shloka someone mentioned?
  • Want to look up a certain term in (say) the abhyankar grammar dictionary on your phone?
  • Want to read flowing text comfortably on your tablet on phone? Want your annotations to be safe for ever? (See advantages described here)
  • Want to carry an entire library on your tablet?
  • Want your phone to read out a certain text?
  • Dream of a richer sanskrit wikipedia, wikisource etc..?​

How to request texts to be OCR-ed?

Just create an issue here. Be sure to answer all questions.

What scan quality can you currently offer?

To get an idea of the scan quality that we can currently offer, see:

How to contribute?
  • Can you help us OCR texts? See workflow described here to know what's involved.
  • Can you help us proofread and markup texts? See the workflow described here to know what's involved.
If you think you can do the above, join and email us at sanskr...@googlegroups.com (web-ui) .


विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Feb 26, 2016, 10:46:32 AM2/26/16
to
​bcc: same recipients as before.​

2016-02-26 7:26 GMT-08:00 :
Details about the scope of the project, what's involved and how to join up (from the Project homepage here):

​Fixing the link above: Project homepage here.



--
--
Vishvas /विश्वासः

विश्वासो वासुकिजः (Vishvas Vasuki)

unread,
Mar 1, 2016, 4:05:56 PM3/1/16
to sanskrit-programmers, Mārcis Gasūns
+ marcis

Hey marcis, just stumbled upon this: https://docs.google.com/document/d/1Dr2DNOITiCHzDktWGr7QIt4CkHTHnp1cB4lfwU9rePw/edit . Can you:
* share the OCR-s and the books listed there?
* describe how to get this OCR layer from the pdf?
Reply all
Reply to author
Forward
0 new messages