Hi! A friend just forwarded this to me (I have now joined the list).
I think I can save you a lot of work on this. I've been wanting such a
database for years, and finally worked on it seriously last spring. I
got most of the names in, though eventually burned out (it's awfully
tedious work after a while). But, I was able to get help from the
excellent folks at
ccel.org, and a summer student there finished it.
What we now have is an OWL database of close to 4000 thousand names,
with many variant spellings and forms, and with information to
disambiguate them (there are about 30 Zechariahs, for example).
There's a specific sequence of rules we followed for how to create an
unambiguous form (the first of them is to append the father's name if
available, following "_" (I think "_" is nice because you can
pronounce it "bar", which is Hebrew for "son of"); but there are
several more rules to cover cases where that's not sufficient (turns
out that when available, the father's name is almost always
sufficient).
The entries for most individuals also have fields for father, mother,
gender, occupation (king, prophet, warrior, savior,...) , tribal
affiliation, etc. where applicable and available. There's also at
least one, and often all, verse references where the person is
mentioned. They are also linked to the names (and disambiguation
numbers) used in multiple published "all the people in the bible"
kinds of reference books, and to the particular translation that uses
the spelling. Alternates use the OWL "SameAs" consruct to link back to
one "main" spelling, and ambiguous names have a separate entry
(without any suffix to disambiguate), that specifically says they're
an AmbiguousName, and lists the people that spelling can refer to.
No doubt it has some errors (though OWL can check for many kinds of
inconsistencies), and there are some cases where you simply can't tell
what a name is -- for example in Nehemiah there are many names that
might be either cities or people. But the data is very thorough. The
data is in Manchester OWL syntax, which is awfully easy to convert to
anything else one might want, as well as being quite human-readable as-
is, and supported by appropriate applications such as the (open-
source) Protege' ontology editor.
I have additional data on cities, mountains, rivers, and so on; people
groups; languages; month names; artifacts; Bible translations; Canon
lists; and a variety of other object classes (have planned but barely
begun a comparable database of HB/GR/AR words). They classes are
linked directly in to the standard SUMO ontology, widely used in
ontological R&D.
The data does not have a few of the items you suggested, such as an
English gloss for the "meaning" of the name, or links to "related"
names (except that it *does* have alternate spellings and alternate
names for the same individual (for example "Simon" vs. "Peter", etc.).
And no "encyclopedic" information.
This data is freely available to use; I do ask that enhancements and/
or corrections be sent back so I can incorporate them and avoid having
a thousand slightly-varying versions all over the place (just because
such variation is more pain for everyone!). I meant to put this up
already for public access, but have been a bit slow on the final
proofreading.
Let me know if you're interested.
Steve DeRose
Oh, a sample entry (see what I mean about easy to read/parse?):
Individual: Abagtha
Types: Human
Facts: Tr KJV, Gender "M", Profession "Chamberlain",
Affiliation Ahasuerus