TPC-DS Independent Benchmark Results
Kinetica continues to deliver impressive results in recent TPC-DS studies of leading analytical databases conducted by Radiant Advisors
What do these results show us?
What is TPC-DS?
TPC-DS is a sophisticated and comprehensive benchmark for SQL analytics databases, and an industry standard benchmark for general purpose decision support systems. It is a well tested and proven test suite for data warehouses and analytical systems.
The full suite of TPC-DS SQL queries can be found on Github.
Many of the queries are long with complex joins, aggregations and group-bys. To the right you can see one of the simpler queries to calculate top performing products and sale offers for college educated women.
/* Example Query - sql_07.sql */ select top 100 i_item_id, avg(ss_quantity) agg1, avg(ss_list_price) agg2, avg(ss_coupon_amt) agg3, avg(ss_sales_price) agg4 from store_sales, customer_demographics, date_dim, item, promotion where ss_sold_date_sk = d_date_sk and ss_item_sk = i_item_sk and ss_cdemo_sk = cd_demo_sk and ss_promo_sk = p_promo_sk and cd_gender = 'F' and cd_marital_status = 'W' and cd_education_status = 'College' and (p_channel_email = 'N' or p_channel_event = 'N') and d_year = 2001 group by i_item_id order by i_item_id;
Kinetica much faster than other databases overall
Comparisons are made to Kinetica from a geomean of the queries the database was able to run. So Clickhouse was
13x slower overall on the 8 queries it was able to successfully complete.
Overall Query Results
What is notable from these results?
Vectorized architecture delivers performance
Kinetica's unique native vectorized join engine is able to process chunks of data in parallel, rather than sequentially working through rows of data line by line. This delivers very quick results, particularly with complex ad-hoc analysis of complex data.
Mature database capable of running complex SQL
Kinetica has been in development for over a decade. It is mature and battle-tested and able to reliably parse and deliver results on complex SQL queries. Kinetica is typically able to run all 99 queries of the TPC-DC harness.
Geospatial and time-series capabilities also included
Kinetica fast, real-time capabilities extend out to it's versatile suite of functionality for time-series and geospatial analysis, as well as graph query modes, and easy to use notebook tools for building multi-step queries.
Try Kinetica Yourself
The best way to appreciate the possibilities that Kinetica brings to large-scale geospatial analytics is to see it in action. You can try Kinetica with datasets up to 10GB for free on Kinetica Cloud
Or, contact us, and we can help you get started using it at scale with your own data, your own schemas and your own queries.