java -jar tika-app-1.2.jar —text <filename>
Hi Ravish,
This command will read filename and output the content in plain text. Are you trying to read document(s) in MongoDB and output them as plain text ? Or maybe inserting the content of filename as plain text into MongoDB ?
Depending on what you are trying to do, you may find mongo-connector useful. mongo-connector creates a pipeline from a MongoDB cluster to one or more target systems, such as Solr, Elasticsearch, or another MongoDB cluster. Apache Tika is what Solr/Lucene uses to do text extraction from rich-text document types.
Would you mind sharing what you are trying to do with Tika/MongoDB ?
Regards,
Wan.