GPU Optimization

High signal writings with slight opinionation

SM history and Thread Hierarchy

Optimizing Parallel Reduction

Note: The writings are stated in a minimal way, a distilled version of my own notes. The intention is to state the concepts and allow readers to sit with them until they are clear. Can be read in one sitting but absorbed over multiple.