How can I open and parse MS Word documents (.docx) in NLTK

482 views
Skip to first unread message

Swarup Beria

unread,
Dec 24, 2014, 3:33:34 AM12/24/14
to nltk-...@googlegroups.com
Can NLTK directly work on .docx files or it has to be necessarily first converted to .txt?

Regards
Swarup

Maciej Ibrahim Pastuszka

unread,
Dec 24, 2014, 5:39:00 AM12/24/14
to nltk-...@googlegroups.com
I think you might be interested in this project if you want to work with .DOCX files:



--
You received this message because you are subscribed to the Google Groups "nltk-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to nltk-users+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages