I had been looking for a morphologically-tagged LXX for research and
came across the CATSS LXXM text [1]. The one thing lacking for my use of
this text was that it was in betacode and not in unicode.
By searching I have found that many people have taken this text and
converted it to unicode for embedding in web sites, but to my knowledge
nobody is publishing the equivalent plain text files. The Unbound Bible
comes closest, but it publishes the text and the morphological analysis
in two separate files, which is suboptimal. So I decided to embark on
converting the LXXM to unicode.
Luckily James Tauber has shared a Greek betacode to unicode conversion
script [2] which took care of most of the hard work for me. Using this,
I was able to convert all of the texts to betacode to unicode. I am
sharing the result as a git archive [3]. Please take a look.
My long term goal is to edit the text to add some features and clean
things up a bit. Basically I'd like the text to look more like the
MorphGNT. A part of that may include adding this text to the Open
Scriptures project. This will probably result in a lot of interesting
discussion on this list (e.g. regarding LXX versification, which
includes prologues, song titles, abc verses, verse "13/14", etc.).
Please note that this resource has a rather novel license which
requires users to fill out a user declaration and send it in to the
CCAT program at the University of Pennsylvania (see
0-user-declaration.txt in the repo).
Let me know if you find anything to be corrected or have questions.
[1]
http://ccat.sas.upenn.edu/gopher/text/religion/biblical/lxxmorph/
[2]
http://jtauber.com/blog/2005/01/27/betacode_to_unicode_in_python/
[3]
https://gitorious.org/lxxmorph-unicode
--
Nathan Smith
http://nathan.smithfam.info/
PGP key ID 0x147aed15