Please review https://github.com/blt/project-alexandria/tree/owl_creek_bierce
Travel down b/ and let me know what you think. I've set the directory
structure to be alphabetical, which works OK for now. This is
divergent from my outstanding pull-request to the main project, but I
think I prefer this far more than the flat-files in root approach.
On Mon, Feb 27, 2012 at 9:56 PM, startling <tdixo...@gmail.com> wrote:
> First of all: hi, I lurk on HN and saw this and I'm pretty excited
> about it.
>
> So, I'd like to format a short-ish story, maybe [An Occurrence at Owl
> Creek Bridge][] by Ambrose Bierce. How should the formatting look?
> There was some discussion about this in the Welcome to Project
> Alexandria but I feel this warrants its own thread. rst seems like a
> decent choice, but I have a few questions (and suggestions) about how
> basic things should be handled.
>
> Line breaks: I'd like to suggest that lines be wrapped to 80 or 90
> characters. It makes reading and writing a thousand times easier, and
> it can be easily undone by automated tools. Paragraph breaks would be
> double-newlines, which seems consistent with how rst is usually
> handled.
I chose 80: `fmt -w 80`
> Line numbers: I'd like to do away completely with line numbers and
> page numbers in source texts. They're distracting, hard to format
> around, and easily added by automated tools. These aren't ubiquitous
> in Project Gutenberg texts, but they're terrible, so I feel I need to
> mention them.
Agreed.
> Metadata: the license information and metadata at the beginnings and
> ends of Project Gutenberg texts have always seemed extraneous to me.
> Could we have these things in separate files? a license.txt and
> metadata.txt seem like they would work. Maybe we should have a
> readme.rst to cooperate with github? It would be neat if we could add
> little descriptions or reviews of the text somewhere, especially for
> the lesser-known works that don't exist online but for Project
> Gutenberg. I think metadata should include (at the very least) word
> count, authors, editors, contributors, original language, text
> language, translators, publishing date, source isbn (if it's
> available) and maybe a set of topic tags. I'd suggest YAML or JSON for
> metadata, leaning heavily towards YAML as it's meant to be human-
> written. JSON is nicer to write than XML, but it's easy to leave a
> trailing comma or something that invalidates the whole file.
I dropped the metadata into 'metadata.yml' and the license into
'license.txt'. I didn't fill out metadata.yml as much as you'd
intended.
> Encoding: All tools should work with utf-8 by default. Utf-8 texts
> shouldn't have BOM because they make editing and writing tools a pain.
> We could use utf-16 when double-width characters are needed, in which
> case we would need a BOM; I suspect these would require a different
> set of tools, anyway.
>
> It seems prudent not to worry about specifics, but to stick to a well-
> defined standard for easy conversion later should we change our minds.
> These are just a few suggestions I have, and places I could see being
> a little thorny.
>
> I hope this doesn't come off as too opinionated or bikesheddy. Thanks
> for reading!
Patches welcome if I missed your vision!
> [An Occurrence at Owl Creek Bridge]: http://www.gutenberg.org/ebooks/375
>
> --
> You have received this message from the Project Alexander mailing list.
> The IRC channel is #ProjectAlexandria on irc.freenode.net.
> The project-central repository is here: https://github.com/felix-faber/project-alexandria
--
Brian L. Troutwine
martin
--
Brian L. Troutwine