I am using SolrNet to extract contents from document Content in a dynamic field. My Solr Document(SolrDoc)
[SolrUniqueKey("id")]
public int Id { get; set; }
[SolrField("*")]
public Dictionary<string, object> DynamicFields { get; set; }
I can extract the contents and add them to the dynamic field
solrDoc.DynamicFields.Add("content", extractResponse.Content);
Problem occurs when document content length exceeds 35000. to extract the contents i do
ExtractResponse extractResponse =
solr.Extract(new ExtractParameters(fileStream, solrDoc.Id)
{
ExtractFormat = ExtractFormat.Text,
ExtractOnly = true,
AutoCommit = true,
StreamType = mimeType,
});
Anyway i can index complete document content using this dynamic field generation way. or any other better way to extract complete document contents?
I am using Solr 5.3.0
Thanks.
--
You received this message because you are subscribed to the Google Groups "SolrNet" group.
To unsubscribe from this group and stop receiving emails from it, send an email to solrnet+u...@googlegroups.com.
To post to this group, send email to sol...@googlegroups.com.
Visit this group at http://groups.google.com/group/solrnet.
For more options, visit https://groups.google.com/d/optout.
--
Caused by: java.lang.IllegalArgumentException: Document contains at least one immense term in field="content" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped. Please correct the analyzer to not produce such terms. The prefix of the first immense term is: '[10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 10, 84, 104, 101, 32, 85, 110]...', original message: bytes can be at most 32766 in length; got 42576
Any work around this, using SolrNet?
--
--