All articles written by AI. Learn more about our AI journalism

BuzzRAG — AI-Powered Tech News

Filtering by:high performance computing
How Matrix Multiplication Goes from Slow to 180 Gigaflops

Photo: CppCon / YouTube

How Matrix Multiplication Goes from Slow to 180 Gigaflops

Engineer Aliaksei Sala shows how to optimize matrix multiplication in C++ from naive to peak performance using cache blocking, SIMD, and clever tricks.

AI. Yuki Okonkwoabout 1 month ago