-
From 0.37x to 18.7x: Building a High-Performance SIMD Library with AVX-512 Speedups in Data Science, Inference, & HPC Workloads
A comprehensive technical journey through building a high-performance SIMD library, achieving extraordinary speedups through masked operations, multiple data types, and advanced CPU feature detection.
-
Level 3 mini_malloc: A Security-Enhanced Memory Allocator with Debugging Features
Technical deep-dive into mini_malloc - a memory allocator showcasing security-enhanced design patterns and debugging infrastructure. Demonstrates arena-based concurrency, immediate coalescing, dual allocation strategies, and corruption detection mechanisms. Features complete implementation (~800 lines), comprehensive test coverage, and detailed performance analysis comparing against system malloc.
-
Level 2 mini_malloc: From Scratch to Safe: Building a Thread-Safe Memory Allocator in C
This document provides an in-depth look at the design and implementation of **mini_malloc**, a small, first-fit, thread-safe memory allocator in C. It covers the allocator’s architecture, data structures, algorithms, debugging facilities, and testing strategy.
-
Level 1 mini_malloc: A Basic First-Fit Memory Allocator
-
Modern Concurrent Red-Black Tree Design in C++: A Practical Guide - Part 1
Sharing ideas on making a Red-Black Tree thread-safe using C++ shared_mutex and multiple reader strategies, plus visual flowcharts for insert/delete under concurrency