TPC-H test

498 views
Skip to first unread message

Radek Tomšej

unread,
Dec 1, 2016, 12:06:50 PM12/1/16
to ClickHouse
Hi,

thank you for this awesome database. Currently I am trying to benchmark it to Oracle and some other databases. I want to use TPC-H test. Unfortunately TPC-H can generate only csv delimited by | (pipe) and CSV format in ClickHouse only supports , (comma). Is there any easy solution how to import data for TPC-H and make data loading comparable to other solutions?

Thank you!

man...@gmail.com

unread,
Dec 5, 2016, 8:30:07 PM12/5/16
to ClickHouse
Hi.

You could try one of the following solutions:
1. Edit source code of TPC-H generator programs, replace pipe with comma and recompile them.
2. Generate data and then prepare it, changing pipe to comma using tr/sed/perl.

BTW, I have not yet tried to load TPC-H data.

va...@percona.com

unread,
Mar 20, 2017, 10:36:51 AM3/20/17
to ClickHouse
Hi,

Speaking about TPC-H, what table structure and table engine is recommended for running this benchmark in ClichHouse.
there is often no Date column available, so MergeTree can't be used.

Any advises?

man...@gmail.com

unread,
Mar 27, 2017, 1:59:25 AM3/27/17
to ClickHouse
You could use MergeTree table engine even if your data doesn't have date column.

Just declare additional dummy column as
date Date DEFAULT '2000-01-01'.

This dummy date column will not affect performance of queries.

Usually, MergeTree engine is the most efficient.
Also you could try Log engine - simple table engine without support for indexes, MVCC and so on.
Reply all
Reply to author
Forward
0 new messages