Calling Apache Tika jar file from MongoDB

61 views
Skip to first unread message

Ravish Dave

unread,
Feb 4, 2016, 4:41:25 AM2/4/16
to mongodb-user
I want to call the following java jar file fro MongoDB enviornment:
java -jar tika-app-1.2.jar --text <filename>
 
Please suggest me how to do it

Wan Bachtiar

unread,
Feb 23, 2016, 1:28:18 AM2/23/16
to mongodb-user

java -jar tika-app-1.2.jar —text <filename>

Hi Ravish,

This command will read filename and output the content in plain text. Are you trying to read document(s) in MongoDB and output them as plain text ? Or maybe inserting the content of filename as plain text into MongoDB ?

Depending on what you are trying to do, you may find mongo-connector useful. mongo-connector creates a pipeline from a MongoDB cluster to one or more target systems, such as Solr, Elasticsearch, or another MongoDB cluster. Apache Tika is what Solr/Lucene uses to do text extraction from rich-text document types.


Would you mind sharing what you are trying to do with Tika/MongoDB ?

Regards,

Wan.

Reply all
Reply to author
Forward
0 new messages