Hi All,
I currently exploring ravendb to see if its fits my needs. I am currently working on a project where I need to index a large amount of docs,xls,pdf etc. I want my users to not only query on the content of a document but also on its metadata. There for I use Tika to extract the metadata and add that information to my ravendb document. So I can ask raven to give me the top 10 most active authors for example. My test app seems to work fine, but I noticed that my database is kinda big. Which could be explained because I store the actual text (which I extracted from the document with Tika also) of the document as a property. So I can build a fulltext index. But there is no need for me to store this data inside my database, but I do want to offer my users the ability to do a fulltext search over the documents inside my document store. So what I want is index the content of the pdf for example but not save it inside the document store, is this possible?? if Yes, how can I achieve this, if No what the best alternative?
Tnx in advanced!