Facetting performance / warming issue?

18 views

Skip to first unread message

Achim Domma

unread,

Apr 1, 2015, 4:57:59 PM4/1/15

to helio...@googlegroups.com

Hi,

I'm debugging a confusing performance problem. I'm running HS 0.07 on a single EC2 r3.xlarge instance. The core has about 25M documents and about 42GB on disc. Our text queries are rather simple, but we do some filtering and a lot of facetting. What I observe is the following:

- A simple query, with lots of facets and fq=some_field: some_id takes up to 20-30sec, if some_Id is used the first time.

- I run the query three times: First calls takes 20-30sec, next ones 100-300ms.

- I do this for multiple ids, iterating over a list of valid ids. New ids always show the above behavior.

- I stop the script and restart it. "Old" ids are fast. As soon as I hit the first new one, the first call is slow.

- I do this in parallel for different fields. Fields seem not to interfere and "fast" ids are always fast. According to web interface, SOLR is just using <10GB of available 30GB.

So far, this looks like a warming issue. But: I check the disc activity on the server by using iostat. I don't see any read activity on the machine!?

Can anybody explain what's might be going on here? From where to where is the data moved during the first call, if not from disc into memory?

Any hint would be very appreciated.

kind regards,

Achim

Otis Gospodnetic

unread,

Apr 17, 2015, 11:28:01 PM4/17/15

to helio...@googlegroups.com

Hi Achim,

If you like command line tools, use something like dstat with its various options to watch more than just IO. This may reveal something. Alternatively, try SPM for visual representation of Solr (it can monitor HS 0.07, too) performance + JVM and OS metrics. Oh, and maybe you can add some pointcuts for HS/Solr using https://sematext.atlassian.net/wiki/display/PUBSPM/Transactions+Tracing and find the bottleneck in HS/Solr that way, too.

Otis
--

Monitoring * Alerting * Anomaly Detection * Centralized Log Management

Solr & Elasticsearch Support * http://sematext.com/

--
You received this message because you are subscribed to the Google Groups "heliosearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to heliosearch...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all

Reply to author

Forward

0 new messages