Hi,
We have the same problem.
We added one more replica to a shard, and it is trying to copy 8T to another DC.
On the new replica there is a lot of "Cannot read from istream" errors and timeout errors.
Are there any settings that we can use to increase these timeouts?
Or should we just copy data manually and then attach all tables on the new replica instead of creating them?
2017.03.02 18:24:32.035899 [ 37 ] <Error> DB::StorageReplicatedMergeTree::queueTask()::<lambda(DB::StorageReplicatedMergeTree::LogEntryPtr&)>: Poco::Exception. Code: 1000, e.code() = 11, e.displayText() = Timeout, e.what() = Timeout
2017.03.02 18:24:32.351434 [ 40 ] <Error> DB::StorageReplicatedMergeTree::queueTask()::<lambda(DB::StorageReplicatedMergeTree::LogEntryPtr&)>: Poco::Exception. Code: 1000, e.code() = 11, e.displayText() = Timeout, e.what() = Timeout
2017.03.02 18:24:33.091490 [ 38 ] <Error> DB::StorageReplicatedMergeTree::queueTask()::<lambda(DB::StorageReplicatedMergeTree::LogEntryPtr&)>: Poco::Exception. Code: 1000, e.code() = 11, e.displayText() = Timeout, e.what() = Timeout
2017.03.02 18:24:33.711482 [ 40 ] <Error> DB::StorageReplicatedMergeTree::queueTask()::<lambda(DB::StorageReplicatedMergeTree::LogEntryPtr&)>: Poco::Exception. Code: 1000, e.code() = 11, e.displayText() = Timeout, e.what() = Timeout
2017.03.02 18:24:34.243378 [ 38 ] <Error> DB::StorageReplicatedMergeTree::queueTask()::<lambda(DB::StorageReplicatedMergeTree::LogEntryPtr&)>: Poco::Exception. Code: 1000, e.code() = 11, e.displayText() = Timeout, e.what() = Timeout
2017.03.02 18:24:36.565250 [ 46 ] <Error> DB::StorageReplicatedMergeTree::queueTask()::<lambda(DB::StorageReplicatedMergeTree::LogEntryPtr&)>: Code: 23, e.displayText() = DB::Exception: Cannot read from istream, e.what() = DB::Exception, Stack trace:
0. clickhouse-server(StackTrace::StackTrace()+0x16) [0x1217286]
1. clickhouse-server(DB::Exception::Exception(std::string const&, int)+0x1f) [0xf7c44f]
2. clickhouse-server(DB::ReadBufferFromIStream::nextImpl()+0x97) [0xf85357]
3. clickhouse-server(DB::ReadBufferFromHTTP::nextImpl()+0x26) [0x1e35236]
4. clickhouse-server(DB::DataPartsExchange::Fetcher::fetchPartImpl(std::string const&, std::string const&, std::string const&, int, std::string const&, bool)+0x196e) [0x139698e]
5. clickhouse-server(DB::DataPartsExchange::Fetcher::fetchPart(std::string const&, std::string const&, std::string const&, int, bool)+0x61) [0x1397451]
6. clickhouse-server(DB::StorageReplicatedMergeTree::fetchPart(std::string const&, std::string const&, bool, unsigned long)+0x1f7) [0x12d3a07]
7. clickhouse-server(DB::StorageReplicatedMergeTree::executeLogEntry(DB::ReplicatedMergeTreeLogEntry const&)+0x7d0) [0x12d4d60]
8. clickhouse-server() [0x12d80ce]
9. clickhouse-server(DB::ReplicatedMergeTreeQueue::processEntry(std::function<std::shared_ptr<zkutil::ZooKeeper> ()>, std::shared_ptr<DB::ReplicatedMergeTreeLogEntry>&, std::function<bool (std::shared_ptr<DB::ReplicatedMergeTreeLogEntry>&)>)+0x3b) [0x137514b]
10. clickhouse-server(DB::StorageReplicatedMergeTree::queueTask()+0x148) [0x12b71e8]
11. clickhouse-server(DB::BackgroundProcessingPool::threadFunction()+0x3cc) [0x1305c1c]
12. clickhouse-server() [0x31b48ef]
13. /lib/x86_64-linux-gnu/libpthread.so.0(+0x8064) [0x7fba121d2064]
14. /lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7fba117fa62d]