DIMES25-AccessEvaluation

To stride or not to stride the memory access?

📄 Abstract

Due to the increasing gap between CPU performance and memory bandwidth, memory access patterns play more and more a significant role for efficient data processing. The current core assumption is that a sequential access pattern delivers the best performance, especially when the data to be processed is stored in adjacent memory locations (contiguous memory). Given the continuous advances in memory technologies, it is of course questionable whether this assumption still holds true. To answer this question, we present a comprehensive experimental comparison of the sequential and the strided access pattern for data stored in contiguous memory on modern disruptive memory systems in this paper. As we are going to show, the core assumption must be revised, as the strided access pattern with a well-chosen stride size clearly outperforms the sequential access pattern. Even a SIMD-accelerated sequential access is considerably slower than the best-performing scalar strided access. In particular, we explain the differences, highlight further advantages, and present open challenges of the strided access pattern on disruptive memory systems.

ℹ️ Overview

This project contains:

Source code to all the experiment present in the paper
R code to generate the plots like the one in the paper

⚙️ Benchmarking

To build the benchmark suit use

cmake . 
make

This will generate the following benchmarks:

Executalble Name	Description	Figure in Paper
experiment_fixed_data_size	Given a data amount, this benchmark systematically samples different stride and partition counts	1, 2, 3
experiment_fixed_stride_size	Given a stride size, this benchmark test the different partition counts from in a range of 2 to 50.	5a
experiment_fixed_partition_count	Given a partition count and a data range, the benchmark samples different stride sizes in the given range	5b, 5c
experiment_multithreading	Given a data amount, this benchmark evaluates the MIMD execution of our strided access. For each thread count the partition count of 1 to 72 gets evaluated	6

To run the benchmarks use sudo ./<benchmark_name>. It is adviced to use numactl to pin the execution and the memory to a NUMA-node. All the available options for the benchmarks can be read out with: ./<benchmark_name> --help.

Additionally the script/ directory includes multiple run_experiment_*.sh scripts to run the given benchmarks with a default configuration.

The benchmark results can be plotted using make plots the R script files or the plot_experiment_*.sh scripts found in script/. To use the make plots or plot_experiment_*.sh files you first need to build the docker container using ./script/build_docker.sh or make docker.

To use the R scripts directly you have to have RScript and the following R packages installed: magrittr, dplyr, ggplot2, data.table, collapse, latex2exp, tikzDevice

🧾 Third-Party Licenses

This project includes or uses code from the following third-party projects:

perf-cpp (version v0.11-dev) — LGPLv3+ https://github.com/jmuehlig/perf-cpp
page-info — MIT License https://github.com/travisdowns/page-info

A copy of the full license texts and attributions is included in the LICENSES/ folder.

🩺 Using a Modified perf-cpp Library

This project depends on the perf-cpp library (version v0.11-dev), licensed under the GNU Lesser General Public License, version 3 (LGPLv3). By default, CMake will automatically download and build perf-cpp from its official repository:

https://github.com/jmuehlig/perf-cpp

Rebuilding with a Modified perf-cpp

Under the LGPLv3, you have the right to replace perf-cpp with your own modified version and rebuild this program against it. To do so:

Fork or download your modified version of perf-cpp.
In your local checkout of this project, edit the CMake configuration such that it uses your modified perf-cpp source instead of fetching the official repository.
Reconfigure and rebuild this project with CMake such that it will link against your modified perf-cpp library.

License Information

This repository includes and links against perf-cpp under the terms of the GNU Lesser General Public License, version 3 or later.
A copy of the LGPLv3 license is provided in this distribution (LICENSES/LGPL-3.0-LICENSE-perf-cpp.txt).

💭 Feedback

For feedback, questions or discussions please feel free to contact us.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
LICENSES		LICENSES
include		include
lib/page-info		lib/page-info
results		results
script		script
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DIMES25-AccessEvaluation

📄 Abstract

ℹ️ Overview

⚙️ Benchmarking

🧾 Third-Party Licenses

🩺 Using a Modified perf-cpp Library

Rebuilding with a Modified perf-cpp

License Information

💭 Feedback

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

db-tu-dresden/DIMES25-AccessEvaluation

Folders and files

Latest commit

History

Repository files navigation

DIMES25-AccessEvaluation

📄 Abstract

ℹ️ Overview

⚙️ Benchmarking

🧾 Third-Party Licenses

🩺 Using a Modified perf-cpp Library

Rebuilding with a Modified perf-cpp

License Information

💭 Feedback

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages