Thanks, this clarify the parameter.
I still don't get a last point then.
My cassandra node is configured with concurrent_reads: 32
In my job I use 2 executors per node, with 4 cores each (8 cores used in a node).
Then I set concurrent.reads to 4. This means that in a specific moment in time I expect to have 32 reads coming from the connector.
In this configuration I am still able to see in Cassandra logs some timeouts:
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,702 MessagingService.java:1035 - READ messages were dropped in last 5000 ms: 106 for internal timeout and 0 for cross node timeout. Mean internal dropped latency: 16172 ms and Mean cross-node dropped latency: 0 ms
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,703 MessagingService.java:1035 - REQUEST_RESPONSE messages were dropped in last 5000 ms: 59 for internal timeout and 0 for cross node timeout. Mean internal dropped latency: 17166 ms and Mean cross-node dropped latency: 0 ms
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,703 MessagingService.java:1035 - COUNTER_MUTATION messages were dropped in last 5000 ms: 9 for internal timeout and 0 for cross node timeout. Mean internal dropped latency: 15527 ms and Mean cross-node dropped latency: 0 ms
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,703 StatusLogger.java:52 - Pool Name Active Pending Completed Blocked All Time Blocked
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,710 StatusLogger.java:56 - MutationStage 0 0 12556449 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,711 StatusLogger.java:56 - ViewMutationStage 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,711 StatusLogger.java:56 - ReadStage 0 0 828656 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,711 StatusLogger.java:56 - RequestResponseStage 0 0 318912944 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,711 StatusLogger.java:56 - ReadRepairStage 0 0 1 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,711 StatusLogger.java:56 - CounterMutationStage 0 0 205166223 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,715 StatusLogger.java:56 - MiscStage 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,715 StatusLogger.java:56 - CompactionExecutor 0 0 774043 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,715 StatusLogger.java:56 - MemtableReclaimMemory 0 0 687 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,716 StatusLogger.java:56 - PendingRangeCalculator 0 0 5 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,716 StatusLogger.java:56 - GossipStage 0 0 2794567 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,719 StatusLogger.java:56 - SecondaryIndexManagement 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,719 StatusLogger.java:56 - HintsDispatcher 0 0 176 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,720 StatusLogger.java:56 - MigrationStage 0 0 4558 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,720 StatusLogger.java:56 - MemtablePostFlush 0 0 826 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,720 StatusLogger.java:56 - PerDiskMemtableFlushWriter_0 0 0 687 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,720 StatusLogger.java:56 - ValidationExecutor 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,721 StatusLogger.java:56 - Sampler 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,721 StatusLogger.java:56 - MemtableFlushWriter 0 0 687 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,721 StatusLogger.java:56 - InternalResponseStage 0 0 71883 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,721 StatusLogger.java:56 - AntiEntropyStage 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:56 - CacheCleanupExecutor 0 0 0 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:56 - Native-Transport-Requests 1 0 371637085 0 146675
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:66 - CompactionManager 0 0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:78 - MessagingService n/a 0/0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:88 - Cache Type Size Capacity KeysToSave
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:90 - KeyCache 104854608 104857600 all
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,722 StatusLogger.java:96 - RowCache 0 0 all
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:103 - Table Memtable ops,data
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system_distributed.parent_repair_history 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system_distributed.repair_history 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system_distributed.view_build_status 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system.compaction_history 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system.hints 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system.schema_aggregates 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,723 StatusLogger.java:106 - system.IndexInfo 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.schema_columnfamilies 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.schema_triggers 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.size_estimates 76545,1026131
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.schema_functions 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.paxos 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.views_builds_in_progress 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.built_views 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.peer_events 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.range_xfers 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,724 StatusLogger.java:106 - system.peers 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,725 StatusLogger.java:106 - system.batches 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,725 StatusLogger.java:106 - system.schema_keyspaces 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,725 StatusLogger.java:106 - system.schema_usertypes 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,725 StatusLogger.java:106 - system.local 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,725 StatusLogger.java:106 - system.sstable_activity 4400,37740
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,725 StatusLogger.java:106 - system.available_ranges 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,739 StatusLogger.java:106 - system.batchlog 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,739 StatusLogger.java:106 - system.schema_columns 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,739 StatusLogger.java:106 - system_schema.columns 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.types 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.indexes 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.keyspaces 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.dropped_columns 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.aggregates 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.triggers 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.tables 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.views 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,740 StatusLogger.java:106 - system_schema.functions 0,0
[my keyspace table here]
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,750 StatusLogger.java:106 - system_auth.roles 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,750 StatusLogger.java:106 - system_auth.role_members 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,750 StatusLogger.java:106 - system_auth.resource_role_permissons_index 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,750 StatusLogger.java:106 - system_auth.role_permissions 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,750 StatusLogger.java:106 - system_traces.sessions 0,0
INFO [ScheduledTasks:1] 2017-12-20 02:26:34,750 StatusLogger.java:106 - system_traces.events 0,0
WARN [ScheduledTasks:1] 2017-12-20 02:26:34,751 MonitoringTask.java:150 - 23 operations timed out in the last 19979 msecs, operation list available at debug log level
DEBUG [ScheduledTasks:1] 2017-12-20 02:26:34,751 MonitoringTask.java:155 - 23 operations timed out in the last 19979 msecs:
SELECT * FROM keyspace.table WHERE ts = 2017-11-30 03:34+1100 LIMIT 5000: total time 15127 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-05 03:14+1100 LIMIT 5000: total time 15147 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-22 03:53+1100 LIMIT 5000: total time 15128 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-10 03:12+1100 LIMIT 5000: total time 15136 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-05 03:02+1100 LIMIT 5000: total time 15137 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-11-30 03:04+1100 LIMIT 5000: total time 15147 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-11-03 03:53+1100 LIMIT 5000: total time 15157 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-01 03:14+1100 LIMIT 5000: total time 15147 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-19 03:23+1100 LIMIT 5000: total time 15127 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-11-03 03:45+1100 LIMIT 5000: total time 15125 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-05 03:58+1100 LIMIT 5000: total time 15130 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-11-23 03:54+1100 LIMIT 5000: total time 15146 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-11-04 03:21+1100 LIMIT 5000: total time 15134 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-08 03:59+1100 LIMIT 5000: total time 15158 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-28 03:59+1100 LIMIT 5000: total time 15124 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-09-28 02:47+1000 LIMIT 5000: total time 15142 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-11-04 03:02+1100 LIMIT 5000: total time 15125 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-28 03:43+1100 LIMIT 5000: total time 15153 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-17 03:31+1100 LIMIT 5000: total time 15172 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-13 03:31+1100 LIMIT 5000: total time 15127 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-17 03:31+1100 LIMIT 5000: total time 15172 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-13 03:31+1100 LIMIT 5000: total time 15127 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-17 03:40+1100 LIMIT 5000: total time 15128 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-12-09 03:32+1100 LIMIT 5000: total time 15129 msec - timeout 5000 msec
SELECT * FROM keyspace.table WHERE ts = 2017-10-12 03:59+1100 LIMIT 5000: total time 15172 msec - timeout 5000 msec
(each partition, i.e. a minute, is 422KB on average)
The timeout we see here of 5000msec is the read_request_timeout_in_ms: 5000 in cassandra.yaml.
In my job I am using spark.cassandra.read.timeout_ms: 240000 (2min)
1) What could it be the reason for such timeouts (15 seconds!), considering that I am doing a number of parallel reads that matches exactly the cassandra configuration?
2) What is the meaning of spark.cassandra.read.timeout_ms? Is it somehow related to read_request_timeout_in_ms in cassandra config?
Thank you,
Davide