The Golden Ass of Apuleius, 1893
gutenburg: https://www.gutenberg.org/ebooks/1666
archive https://archive.org/details/goldenassofapule00apuliala
wiki: https://en.wikipedia.org/wiki/The_Golden_Ass
Cover – cropped to title character, lower edge at about the left knee, no rescale
“Fotis sees her Lover Lucius Transformed into an Ass.
Motif from Apeleius' The Golden Ass”, 1809
source: https://bit.ly/3O4FBGE
legit: https://open.smk.dk/artwork/image/KMS591
Artist
Nicolai Abildgaard
wiki: https://en.wikipedia.org/wiki/Nicolai_Abildgaard
Author:
short: Apuleius
long: Lucius Apuleius Madaurensis OR? Lucius Apuleius “Africanus”
wiki: https://en.wikipedia.org/wiki/Apuleius
Translator:
name: William Adlington
wiki: https://en.wikipedia.org/wiki/William_Adlington
Introducer:
name: Charles Whibley
wiki: https://en.wikipedia.org/wiki/Charles_Whibley
Translation Editor:
name: W. E. Henley
wiki: https://en.wikipedia.org/wiki/William_Ernest_Henley
Front matter:
Translation Editor’s Dedication (Henley)
Introduction (Whibley) [missing from Gutenburg]
Translator’s Dedication (Adlington)
To the Reader (Adlington) [missing from Gutenburg]
The Life of … (Adlington)
Preface (Apuleius)
Body:
95K words, structured as 48 chapters, grouped as 11 books
It looks like this book contains ~95k words of which ~8.6k are unique and ~2.6k are not in the long Scrabble dictionary located here (enable1.txt at https://github.com/dolph/dictionary) and thus suspicious for being “old”. Allowing for proper names (etc), I expect this list will reduce to ~2k. I think I can script the substitutions of old for new spellings assigning a commit to each, though this will be many commits. I know there will be some exceptions not found this way, e.g. “doe” but expect this number will be small and easily spotted during the read-through. I’m using “dictionary.cambridge.org” where UK vs US spellings are in question and assume I will need keep the scripts out of the repository. I’m more optimistic and so will continue formatting the front-matter that wasn’t included in Gutenburg.
- Paul
To unsubscribe from this group and stop receiving emails from it, send an email to standardebook...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/standardebooks/ab3aa948-96db-4d5b-b95a-996b79a45b14n%40googlegroups.com.
This is still a work in progress and I have not begun to address capitalization. The substitutions I so far plan to make are located here: https://github.com/kingrpaul/modernize_english
archaic_words.txt (1640)
A list of word pairs with the first being an existing word and the second what it is to become. There’s just a few instances where a space appears in a “word” and compound words are treated as words. All are lower case and letters only. If transcription errors have slipped into this list, that’s not the intent. So many obscure words and so very many spellings. Plurals etc appear as separate words. Possessives do not. Possessives missing an apostrophe in the source are. I’ve run into only one case where this is not a mapping: “sowne” maps onto both “sewn” and “sown” from context. The reverse is not true, modern words are frequently spelled more than one way.
archaic_proper_nouns.txt (42)
Same as before, but limited to proper nouns and capitalized. I mean to use this to fix bad capitalizations.
obsolete_words.txt (29)
A list of obsolete former-words with an intended replacement. I tried to minimize the scale of this. Words that are fundamentally the same as the original are meant to be on the other list and any word that is fundamentally different is meant to be here. This is for convenience. They’ll be treated the same.
Other files in the repository are:
french_words.txt
german_words.txt
greek_words,txt
italian_words.txt
latin_words.txt (taken from inside “la” tags)
good_common_words.txt (from a public list)
good_proper_nouns.txt (spellings checked)
good_checked_words.txt (spellings checked, includes UK versions of common US words)
I have a little over a hundred words not yet in a bin. Some, I do not yet know what they mean. If anything here looks wonky, thanks in advance for the heads-up.
- Paul
- Paul
Proper or common noun?
1. If a pagan character uses a monotheist-sounding idiom (in translation) that includes the word “God”, e.g. “I would to God that” or “God forbid”, it should be capitalized?
2. The word used like a job title as in “the God Osiris” should be a capitalized?
4. The word used to refer to a specific deity whose identity is known to the reader should be capitalized as it stands in place of a name, e.g. “the Goddess” when we know this refers to Isis, akin to UK English capitalizing the “Queen”?
5. Or, all ancient uses of “god” and “goddess” should be lower case?
Gutenburg and Archive are inconsistent on this, with themselves and each other.
As suggested, this production has been challenging, but progress continues.
Thanks in advance.
- Paul
x-017, Error, content.opf, Duplicate value for id attribute, part-1 (etc)
Thanks in advance.
- Paul
https://github.com/kingrpaul/apuleius_the-golden-ass-of-apuleius_william-adlington
The project is here:
https://github.com/kingrpaul/apuleius_the-golden-ass-of-apuleius_william-adlington
Some working notes are here, may be helpful in review, but is not fully complete:
https://github.com/kingrpaul/modernize_english/blob/main/all_words.csv
format: [existing word (or phrase), substitution if any, notes]
Research was required in places, some of it interesting, if there are questions. I’m certain there are words that remain in their original form and I have a newfound appreciation for Samuel Johnson.
Thanks in advance.
- Paul
Started doing that after thinking a project was going to fail,
mid-way when I couldn't find an image. Also, i found myself making
dummy covers to proofread with. The "of" title was Adlingon's and
I originally used his publication date. Makes sense those should
stay together. I'll take a look. Thanks.
--
You received this message because you are subscribed to a topic in the Google Groups "Standard Ebooks" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/standardebooks/ye7RUN956iw/unsubscribe.
To unsubscribe from this group and all its topics, send an email to standardebook...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/standardebooks/34DF6A0C-5FE7-4D12-9763-6E1C592C2C04%40standardebooks.org.