Example book

16 views
Skip to first unread message

Tom Morris

unread,
Mar 24, 2015, 2:52:58 PM3/24/15
to gitenber...@googlegroups.com
This book that I worked on 8-9 years ago (!) at
DistributedProofreaders just got posted to PG.

Canadiana Online collection http://eco.canadiana.ca/view/oocihm.07539/7?r=0&s=1
Canadiana Online source volume
http://eco.canadiana.ca/search?so=oldest&lang=fra&collection=jsr

DistributedProofreaders project
http://www.pgdp.net/c/project.php?id=projectID3ff5e30465b91

Project Gutenberg posting http://www.gutenberg.org/ebooks/48562

While it's a relatively complex example with translations on facing
pages, archaic spellings & ligatures, macron-like marks, and other fun
features, it would be useful to evaluate the proposed toolchain
against it to see how much loss of fidelity there would be.

On the metadata front, I'd like to have the provenance of all these
sources linked together.

Tom

Stuart A. Yeates

unread,
Mar 24, 2015, 10:27:23 PM3/24/15
to gitenber...@googlegroups.com
> On the metadata front, I'd like to have the provenance of all these
> sources linked together.

If you're looking for a metadata format for that kind of thing, it
might be worth considering a TEI header with an encodingDesc
http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-encodingDesc.html

If that's something we're willing to seriously entertain, I can post
to the TEI list (which I'm on) and get the example fleshed out.

cheers
stuart



--
...let us be heard from red core to black sky

Eric Hellman

unread,
Mar 24, 2015, 10:38:26 PM3/24/15
to gitenber...@googlegroups.com
Stuart,

That would be a huge help.

I'm particularly interested in vocabularies that can be reused.

I'm looking at DC, BibFrame, Schema.org, BIBO, Onix, etc. And overall I'm a novice in these.

Eric

--
You received this message because you are subscribed to the Google Groups "GITenberg Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gitenberg-proj...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gitenberg-project/CAC_Lu0YRgSemeomwmcHUPP%2B%3DburVQ0J8SyWyUBHci2UQfThTzw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Seth Woodworth

unread,
Mar 24, 2015, 11:48:28 PM3/24/15
to gitenber...@googlegroups.com
An impressive work!

(1) I too would love to get DP data into our metadata
do you know of any way to list the internal DP project ids?
The project pages for completed DP projects link to PG, and would allow us to connect the records.
But I can only seem to find DP project pages for works in-progress.

(2) I highly suspect that asciidoc wont be up to the task.  
I'm not sure what would other than potentially TEI, or raw html.



Tom

--
You received this message because you are subscribed to the Google Groups "GITenberg Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gitenberg-proj...@googlegroups.com.

Tom Morris

unread,
Mar 25, 2015, 12:06:24 PM3/25/15
to gitenber...@googlegroups.com
On Tue, Mar 24, 2015 at 11:48 PM, Seth Woodworth <se...@sethish.com> wrote:

(1) I too would love to get DP data into our metadata
do you know of any way to list the internal DP project ids?
The project pages for completed DP projects link to PG, and would allow us to connect the records.
But I can only seem to find DP project pages for works in-progress.

I'll check and see if I can get a list.  The closest I see without digging too much is this list of a few thousand that came from IA image scans and went to PG after being processed.

Tom

Raymond Yee

unread,
Mar 25, 2015, 12:13:00 PM3/25/15
to gitenber...@googlegroups.com
When I tried to located a completed Distributed Proofreader project, I had to resort to doing a search for the title of the project.

For example,  to locate http://www.gutenberg.org/ebooks/48573, I did a search for " British and Foreign Arms and Armour" in the title field of the search form (http://www.pgdp.net/c/tools/project_manager/projectmgr.php?show=search_form):

http://www.pgdp.net/c/tools/project_manager/projectmgr.php?show=search&title=British+and+Foreign+Arms+and+Armour

to turn up

http://www.pgdp.net/c/project.php?id=projectID505c8a74597b3&detail_level=3


-Raymond
--
You received this message because you are subscribed to the Google Groups "GITenberg Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gitenberg-proj...@googlegroups.com.

Tom Morris

unread,
Mar 25, 2015, 12:19:54 PM3/25/15
to gitenber...@googlegroups.com
On Wed, Mar 25, 2015 at 12:12 PM, Raymond Yee <raymo...@gmail.com> wrote:
When I tried to located a completed Distributed Proofreader project, I had to resort to doing a search for the title of the project.

For example,  to locate http://www.gutenberg.org/ebooks/48573, I did a search for " British and Foreign Arms and Armour" in the title field of the search form (http://www.pgdp.net/c/tools/project_manager/projectmgr.php?show=search_form):

http://www.pgdp.net/c/tools/project_manager/projectmgr.php?show=search&title=British+and+Foreign+Arms+and+Armour

I much simpler and more direct way is to just search on the PG etext number:


Tom 

Raymond Yee

unread,
Mar 25, 2015, 12:22:18 PM3/25/15
to gitenber...@googlegroups.com


On 3/25/15 9:19 AM, Tom Morris wrote:
>
> I much simpler and more direct way is to just search on the PG etext
> number:
>
> http://www.pgdp.net/c/tools/project_manager/projectmgr.php?show=search&postednum=48573

Even better, Tom -- thanks.

So it seems we could can gently do a series of queries on the DP servers
to create the mappings of PG to DP ids....

-Raymond

Tom Morris

unread,
Mar 25, 2015, 12:42:52 PM3/25/15
to gitenber...@googlegroups.com
On Wed, Mar 25, 2015 at 12:22 PM, Raymond Yee <raymo...@gmail.com> wrote:

So it seems we could can gently do a series of queries on the DP servers
to create the mappings of PG to DP ids....

I did say I'd check to see if a list was available.  Why not give me an hour or two to do that before you start hammering their search page?

Tom
Reply all
Reply to author
Forward
0 new messages