Skip to content

TPC-DS Independent Benchmark Results

Kinetica continues to deliver impressive results in recent TPC-DS studies of leading analytical databases conducted by Radiant Advisors

What do these results show us?

What is TPC-DS?

TPC-DS is a sophisticated and comprehensive benchmark for SQL analytics databases, and an industry standard benchmark for general purpose decision support systems. It is a well tested and proven test suite for data warehouses and analytical systems.

The full suite of TPC-DS SQL queries can be found on Github.

Many of the queries are long with complex joins, aggregations and group-bys. To the right you can see one of the simpler queries to calculate top performing products and sale offers for college educated women.

 


/* Example Query - sql_07.sql */
select top 100 i_item_id, 
        avg(ss_quantity) agg1,
        avg(ss_list_price) agg2,
        avg(ss_coupon_amt) agg3,
        avg(ss_sales_price) agg4 
 from store_sales, customer_demographics, date_dim, item, promotion
 where ss_sold_date_sk = d_date_sk and
       ss_item_sk = i_item_sk and
       ss_cdemo_sk = cd_demo_sk and
       ss_promo_sk = p_promo_sk and
       cd_gender = 'F' and 
       cd_marital_status = 'W' and
       cd_education_status = 'College' and
       (p_channel_email = 'N' or p_channel_event = 'N') and
       d_year = 2001 
 group by i_item_id
 order by i_item_id;
tpc-ds

How were these tests run?

Radiant Advisors performed TPC-DS benchmarks on several high performance analytics databases in late 2022.

Tests were run with 200GB (SF200) of sample data comprising 24 tables in a snowflake schema. The tables store web, catalog and store sales from an imaginary retailer. The largest fact table had well over a billion rows.

Benchmarking was done using a consistent distributed hardware configuration of four Azure virtual machines: E48s v4 (48 vCPU, 384 GB RAM) with 2TB premium SSD, or equivalent setup.

More details in this TPC-DS Summary from Benchmark Advisors

Kinetica much faster than other databases overall

Using similar hardware, Kinetica beats other high-performance databases overall.
Benchmark Chart

Comparisons are made to Kinetica from a geomean of the queries the database was able to run. So Clickhouse was
13x slower overall on the 8 queries it was able to successfully complete.

vs

Kinetica is 5x faster than BigQuery

Kinetica is substantially faster than BigQuery over most queries

Overall Query Results

Kinetica's vectorized join algorithms surpass other high-performance databases on most queries

View the Individual TPC-DS Queries Here

What is notable from these results?

Vectorized architecture delivers performance

Kinetica's unique native vectorized join engine is able to process chunks of data in parallel, rather than sequentially working through rows of data line by line. This delivers very quick results, particularly with complex ad-hoc analysis of complex data.

Mature database capable of running complex SQL

Kinetica has been in development for over a decade. It is mature and battle-tested and able to reliably parse and deliver results on complex SQL queries. Kinetica is typically able to run all 99 queries of the TPC-DC harness.

Geospatial and time-series capabilities also included

Kinetica fast, real-time capabilities extend out to it's versatile suite of functionality for time-series and geospatial analysis, as well as graph query modes, and easy to use notebook tools for building multi-step queries.

Try Kinetica Yourself

In the cloud, or with your own hardware

The best way to appreciate the possibilities that Kinetica brings to large-scale geospatial analytics is to see it in action. You can try Kinetica with datasets up to 10GB for free on Kinetica Cloud

Or, contact us, and we can help you get started using it at scale with your own data, your own schemas and your own queries.