Overview and Prospects of using Integer Surrogate Keys for Data Warehouse Performance Optimization

Sviatoslav Stumpf and Vladislav Povyshev, ITMO University, RUSSIA; Sviatoslav Stumpf and Vladislav Povyshev, ITMO University, RUSSIA

Overview and Prospects of using Integer Surrogate Keys for Data Warehouse Performance Optimization

Authors

Sviatoslav Stumpf and Vladislav Povyshev, ITMO University, RUSSIA

Abstract

The aim of this paper is to examine and demonstrate how integer-based datetime labels (integer surrogate keys for time) can optimize data-warehouse and time-series performance, proposing practical formats and algorithms and validating their efficiency on real-world workloads. It is shown that replacing standard DATE and TIMESTAMP types with 32- and 64-bit integer formats reduces storage requirements by 30–60% and speeds up query execution by 25–40%. The paper presents indexing, aggregation, compression, and batching algorithms demonstrating up to an eightfold increase in throughput. Practical examples from finance, telecommunications, IoT, and scientific research confirm the efficiency and versatility of the proposed approach.

Keywords

integer labels, time series, optimization, performance, data warehouse, indexing, aggregation

CS&IT Conference Proceedings

Overview and Prospects of using Integer Surrogate Keys for Data Warehouse Performance Optimization