The trouble with PDF is that, contrary to expectations, it is actually an image file format, rather than a text document format. In other words, it doesn’t know anything about paragraphs, or headers, or footers; all it knows about are simple instructions to draw a given letter at given coordinates. (Worse than that, some PDFs are actually just embedded bitmaps).
That means that converting a PDF into a conventional document is more akin to “optical character recognition” than ordinary file format conversion. It takes machine learning or sophisticated heuristics for software to figure out the structural relationships behind the document image. There is some effective software available to do this conversion, but it tends to be expensive because it’s such a hard problem and the capability is so valuable.
Best wishes
Jeremy.
> --
> You received this message because you are subscribed to the Google Groups "TiddlyWiki" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
tiddlywiki+...@googlegroups.com.
> To post to this group, send email to
tiddl...@googlegroups.com.
> Visit this group at
https://groups.google.com/group/tiddlywiki.
> To view this discussion on the web visit
https://groups.google.com/d/msgid/tiddlywiki/4b702ecd-3fbd-4e09-b6db-dd4092ca4000%40googlegroups.com.
> For more options, visit
https://groups.google.com/d/optout.