Kindle2 vict...I mean, user

1 view
Skip to first unread message

Juan Fernando Carpio

unread,
Jan 21, 2010, 6:25:57 PM1/21/10
to Mises Documents

Hi guys, some may know me from MU or LRC, etc. Anyway, I am the first
user of a Kindle2 in my campus (USFQ, in Quito, Ecuador) apparently
and although thrilled by the possibilities, I am really far from happy
on the pdf department, even though a good portion of them are readable
enough.

I just wanted to say hi and to know if there are some basics for
hacking it and conversion software other than Mobipocket and Calibre,
etc etc.

Glad to be there, thanks for taking me in.


Juan Fernando

Dave Wentzel

unread,
Jan 21, 2010, 8:04:23 PM1/21/10
to mises-d...@googlegroups.com, Mises Documents
Easiest, but least flexible method is to send your PDF to your_a...@free.kindle.com
. It will be returned for free to your email. Hook up your USB and
upload.

--Dave

On Jan 21, 2010, at 6:25 PM, Juan Fernando Carpio
<jfca...@gmail.com> wrote:

> --
> You received this message because you are subscribed to the Google
> Groups "Mises Documents" group.
> To post to this group, send email to mises-d...@googlegroups.com.
> To unsubscribe from this group, send email to mises-documen...@googlegroups.com
> .
> For more options, visit this group at http://groups.google.com/group/mises-documents?hl=en
> .
>

Isaac Bergman

unread,
Jan 21, 2010, 11:06:38 PM1/21/10
to Mises Documents
Today I got the Bastiat Collection to look pretty decent.

I started with the PDF and opened it with a full version of Adobe
Acrobat 9.0. In the menu bar, select "Document", and then near the
bottom an option called "Inspect Document". The sidebar will open a
new panel revealing the document parsing. In this process it will scan
for metadata, TOCs, and for hidden data. The hidden data will
typically include all the annoying quark (qxd) information that was
being included in your PDF -> mobi conversion.

When it's done scanning, it will have checkboxes next to each of those
categories and (+) sign. Click the (+) to reveal the scan results.
Click on the hidden data results and it will open a new window showing
you a page by page comparison and you can change the setting to see
the hidden data only, or the regular text, or both. In the case of the
Bastiat Collection PDF, there was a quark header on every page. In the
couple of pages I checked, the hidden data was only the unimportant
quark metatags, so I went back to the side panel and unchecked the
first two checkboxes and left only the hidden data checked. Then on
the bottom of the side panel I clicked the 'remove' to clear out that
entire selection, and then saved my PDF.

The next thing I did was select "file" and then "export as" and
selected HTML 4.0 + CSS. This is to preserve the hyperlinks from the
TOC to the actual chapters. If you would convert from the PDF to the
mobi without doing this step, your TOC will be less useful than the
tree pulp TOC sort. Now finally open Calibre and import your htm or
html file. It will add it to the library but for some reason will save
it internally as a .zip file. Now finally click 'Convert' and choose
mobi if your making this for your Kindle, or ePub if its for something
else.

The formatting of the pages is very good except for the occasional
page number popping in every, well, every page!

The only downside I had to using this PDF->HTML->Mobi process was for
documents that have lots of embedded images- like Walter Blocks
'Defending the Undefendable" will lose all the images in the process
of converting to an HTML file. They will actually get exported into a
separate folder, and I haven't seen an option for importing them along
the html file into Calibre.

I didn't see any better export options in Adobe Acrobat and I played
with a few of them, namely XML 1.0, and rtf (with embedded images).
The results were not any better with those and with the few documents
I played with.

Oh, and the weirdest surprise came from Hoppe's Economic Science and
the Austrian Method. IIRC, the PDF appears to based off an OCR scan.
Now get this- all the text from the OCR scan is saved in the document
as metadata. So when I used my Acrobat cleansing method, it turned it
back into a plain PDF file with only the original scanned images
visible, and while the text is there, it is no longer selectable!

I guess this process is going to utilize a variety of solutions. Find
what works best for mainly-text, another process for graphic-laden
texts and another for the non-OCRed PDFs.


On Jan 21, 8:04 pm, Dave Wentzel <d...@davewentzel.com> wrote:
> Easiest, but least flexible method is to send your PDF to your_acco...@free.kindle.com

Reply all
Reply to author
Forward
0 new messages