pdf 转换 html

15 views
Skip to first unread message

Shuge Lee

unread,
Jun 20, 2009, 11:02:48 AM6/20/09
to python-cn`CPyUG`华蟒用户组(中文Py用户组)
体验了一下
dev-libs/poppler

极度不靠谱,转一个中文pdf,就只有封面第一页是成功,其它均无效

理想的效果是,可以弄成google book那样,指定一个 pdf转成html,然后可以在broswer上不依赖adobe plugin就可以
直接查看

def pdf2html(pdf_path):
....
return html_index_uri

@@

unread,
Jun 20, 2009, 11:06:00 AM6/20/09
to pyth...@googlegroups.com
Google book是不是图片的

2009/6/20 Shuge Lee <shug...@gmail.com>

Shuge Lee

unread,
Jun 21, 2009, 3:38:05 AM6/21/09
to python-cn`CPyUG`华蟒用户组(中文Py用户组)
即使是转图片,也不知道如何实现的

On Jun 20, 11:06 pm, "@@" <ask...@gmail.com> wrote:
> Google book是不是图片的
>

> 2009/6/20 Shuge Lee <shuge....@gmail.com>

xxmplus

unread,
Jun 21, 2009, 3:41:35 AM6/21/09
to pyth...@googlegroups.com
一般都是ocr加上captcha吧

2009/6/21 Shuge Lee <shug...@gmail.com>:
> 即使是转图片,也不知道如何实现的
>

--
Any complex technology which doesn’t come with documentation must be the best
available.

Reply all
Reply to author
Forward
0 new messages