Graph Processing Basic Notes

Graph System

Classification	In-memory	Out-of-core
Single machine	Ligra	GraphChi
	Polymer	X-Stream
	Galois	GridGraph
		Mosaic
Distributed	Pregel	Chaos
	PowerGraph
	PowerLyra
	Gemini

NUMA Architecture

引入了 Node 和 Distance:

对于 CPU 和 Memory 这两种最宝贵的硬件资源, NUMA 用近乎严格的方式划分了所属的资源组 (Node), 而每个资源组内的 CPU 和 Memory 几乎相等
资源组的数量取决于物理 CPU 的个数
Distance 用来定义各个 Node 之间调用资源的开销, 为资源调度优化算法提供数据支持

每个进程(或线程)都会从父进程继承 NUMA 策略, 并分配有一个优先 node. 如果 NUMA 策略允许的话，进程可以调用其他 node 上的资源.

CPU Schedule

CPU Node Bind: 规定进程运行在某几个 node 之上
Physical CPU Bind: 精细地规定进程运行在哪些核上

Memory Schedule

Local Allocation: 从当前 Node 上请求分配内存
Preferred: 比较宽松地指定了一个推荐的 node 来获取内存, 如果被推荐的 node 上没有足够内存, 进程可以尝试别的 node
Memory Bind: 可以指定若干个 node,进程只能从这些指定的 node 上请求分配内存
Interleave: 规定进程从指定的若干个 node 上以 RR (Round Robin) 交织地请求分配内存

NUMA 默认的内存分配策略是优先在进程所在 CPU 的本地内存中分配, 会导致 CPU 节点之间内存分配不均衡. 当某个 CPU 节点的内存不足时, 会导致 swap 产生, 而不是从远程节点分配内存, 这就是 swap insanity 现象

Dataset

Tools

Concurrency Lib

TaskFlow

Perf Tools

FlameGraph

Hardware Performance Counter

GCC

strict-alias warnings

for strict-aliasing warnings:

use a union to represent the memory need to reinterpret
use a reinterpret_cast, cast via char * at the point where reinterpret the memory - char * are defined as being able to alias anything
use a type which has __attribute__((__may_alias__))
turn off the aliasing assumptions globally using -fno-strict-aliasing

Time Stamp Counter

definition

RDTSC

Clock Get Time

gcc *.c -o *.o ... -lrt # link with librt

#include <time.h>

struct timespec ts;
clock_gettime(CLOCK_MONOTONIC, ts);
printf("%d %d", ts.tv_sec, ts.tv_nsec);

NUMA Tool

grep -i numa /var/log/dmesg

NUMA Control

installation

sudo apt install -y numactl

usage

numactl --show
numactl --hardware
numactl --interleave=all

NUMA Status

numastat

Graph Processing Basic Notes

Graph System​

NUMA Architecture​

CPU Schedule​

Memory Schedule​

Dataset​

Tools​

Concurrency Lib​

Perf Tools​

Hardware Performance Counter​

Parallel Programming​

Other Libs​

DRAMSim2​

GCC​

strict-alias warnings​

Time Stamp Counter​

RDTSC​

Clock Get Time​

NUMA Tool​

NUMA Control​

installation​

usage​

NUMA Status​