DimensionSlice: A main-memory data layout for fast scans of multidimensional data
- Authors
- Suh, Ilhyun; Chung, Yon Dohn
- Issue Date
- 12월-2020
- Publisher
- PERGAMON-ELSEVIER SCIENCE LTD
- Keywords
- Multidimensional data; Data layout; Main-memory processing; SIMD
- Citation
- INFORMATION SYSTEMS, v.94
- Indexed
- SCIE
SCOPUS
- Journal Title
- INFORMATION SYSTEMS
- Volume
- 94
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/51342
- DOI
- 10.1016/j.is.2020.101602
- ISSN
- 0306-4379
- Abstract
- Multidimensional data are exploited in many application areas such as scientific data analysis, business intelligence, and geographic information systems. One of the most frequent operations applied to such multidimensional data is the selection of a subspace of the given multidimensional space, which involves predicate evaluation on multiple dimensions. Existing main-memory data layouts optimized for evaluating predicates on the columnar data can be used to accelerate the subspace extraction by sequentially performing filter scans on each dimension one at a time. However, optimization opportunities emerge if we can consider all predicates together. In this paper, we propose DimensionSlice, a new main-memory data layout optimized for evaluating predicates on multiple dimensions. More specifically, the dimension values are sliced into portions and the portions with the same order of each dimension are arranged together. Multiple predicates are simultaneously evaluated with the sliced dimension values during the scan. In addition, by storing the different portions separately, unnecessary loads and computations of lower portions can be eliminated if the evaluation results are assured after examining the upper portions. For further acceleration of scans, the DimensionSlice layout is designed to easily leverage the SIMD capabilities that most mainstream processors are equipped with. Through experiments, we demonstrate the performance gains of the proposed method over the columnar mainmemory layout that evaluates the partial predicates one dimension at a time. We also show that the proposed method outperforms the state-of-the-art multidimensional index structure when the selectivity is over a very low threshold. (C) 2020 Elsevier Ltd. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.