Contributor
phoeinx

Optimize GROUP BY in MariaDB ColumnStore


Mentors
drrtuy
Organization
MariaDB
Technologies
mysql, c++, simd
Topics
databases, algorithms, Aggregation
This project optimized the GROUP BY operator in MariaDB ColumnStore for workloads requiring disk-based aggregation. To achieve this a) unnecessary I/O operations in the aggregation algorithm were reduced, b) bugs in the previous implementation of disk-based aggregation were fixed and c) vectorized access to the internal hash map used for aggregation was tested. Additionally, Theresa fixed other bugs and improved documentation and the development setup, especially for new developers.