Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Announcing fulltext.

20 views
Skip to first unread message

Ben Timby

unread,
Feb 3, 2012, 12:18:44 PM2/3/12
to python-ann...@python.org
Fulltext is a simple Python library for converting document and media
files to text. It's main purpose is for use with full-text indexing
systems.

https://github.com/btimby/fulltext
http://pypi.python.org/pypi/fulltext/0.1-1

For example, to easily extract text from a PDF file:

> python
> import fulltext
> fulltext.get('resume.pdf')
'Experience: ...'
0 new messages