Which is the best approach for soft hyphen in for website?

23 views
Skip to first unread message

Amir Simantov

unread,
Oct 25, 2014, 9:29:34 PM10/25/14
to sanskrit-p...@googlegroups.com
Hi. I have stumbled upon this group while searching "sanskrit soft hyphen". I am building a website which has very long Sanskrit words. The site uses php and I think that I can add any technique for the soft hyphenating.

My questions:
  1. Which technique/library/plugin which is out there (as open source code) you have found good for Sanskrit?
  2. Does the technique you mention has any restriction on searching well the text?
Thanks.

Mārcis Gasūns

unread,
Nov 2, 2014, 4:10:05 PM11/2/14
to sanskrit-p...@googlegroups.com


On Sunday, 26 October 2014 05:29:34 UTC+4, Amir Simantov wrote:
Hi. I have stumbled upon this group while searching "sanskrit soft hyphen". I am building a website which has very long Sanskrit words. The site uses php and I think that I can add any technique for the soft hyphenating.
Can you code php weel enough to build a library yourself?
 

My questions:
  1. Which technique/library/plugin which is out there (as open source code) you have found good for Sanskrit?
None, let's develop some. I can show you what I've done in printed books.  Our .xls macro gives us such an output:

i12has2tha
i12hasthā2na
i1hāt2mati1kā
i1hāmu1t2ra2pha2labho12gavi1rā2ga
i1hār2tha
ī
īkā2ra
īkṣ
īkṣa1ṇa1

That can be split easily afterwards, based on similar TeX rules. Read more on https://docs.google.com/document/d/1Ktm-rMjZnOGFdwN7u7gE1WzkhohPcPMQA3XrRSvn56o/edit (in Russian about Sanskrit hyphenation issues).
  1. Does the technique you mention has any restriction on searching well the text?
No, search should work as expected.
 
Thanks.

Amir Simantov

unread,
Dec 30, 2014, 1:20:36 PM12/30/14
to sanskrit-p...@googlegroups.com
Hi Mārcis,

Thanks for your quick reply. I did not get any notification about it in my email, so I just saw your answer now when checking on this issue.

I will learn the issue better for a future phase of a site of my given project and we can continue then. I have to learn some Russian in the meantime, spasiba :)

Anyway, I am using for now hyphenator.js which breaks words in places my customer (an academic researcher) still have to check.

Thanks again,
Amir
Reply all
Reply to author
Forward
0 new messages