query.initial-hash-partitions
1. Is this property still being used ?
2. Can I set this property in the query session ?
3. How do we control partitioning/parallelism for different
parts of a query plan (like in Pig/Hive/MR) ?
4. Is the option of spill-to-disk available in 0.147 ?
-server
-Xmx12G
-XX:+UseG1GC
-XX:G1HeapRegionSize=32M
-XX:+UseGCOverheadLimit
-XX:+ExplicitGCInvokesConcurrent
-XX:+HeapDumpOnOutOfMemoryError
-XX:OnOutOfMemoryError=kill -9 %p
Please try making hash_partition_count=25 so that all workers participate.
If this doesn't help, try reducing the value of initial_splits_per_node by 2 (to display the current value please run "show session;").
Thanks,
Kamil
Thanks for your response Kamil,Here are my Presto worker JVM settings-server
-Xmx12G
-XX:+UseG1GC
-XX:G1HeapRegionSize=32M
-XX:+UseGCOverheadLimit
-XX:+ExplicitGCInvokesConcurrent
-XX:+HeapDumpOnOutOfMemoryError
-XX:OnOutOfMemoryError=kill -9 %pquery.max-memory-per-node = 6GNumber of nodes = 25 (workers) 1 (coordinator)So I tried setting the session property hash_partition_count=17and did 3 runs all of which failed with below stack trace.The query did make more progress than last time (hash_partition_count=8)but failed every time with this
I have seen Jetty throw this error when network bandwidth is saturated.
Any idea how to prevent this ?Should we try forcing local split scheduling ?Can node-scheduler.network-topology be set in a query session ?Thanks-Ankur
--
You received this message because you are subscribed to a topic in the Google Groups "Presto" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/presto-users/7_2rPASbkdw/unsubscribe.
To unsubscribe from this group and all its topics, send an email to presto-users+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
I did set the hash_partition_count=24 (no. of workers)
and reduced initial_splits_per_node by 6
the query no finishes successfully!
Thank you for your help!Much appreciated :-)The query performs poorly, though, takes13 minutes to process 44 GB of data.Let me read the teradata page on tuning to seewhat more I can find.Thanks again
-Ankur
To unsubscribe from this group and all its topics, send an email to presto-users...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Here is the query