NUMA architecture quick refresherThis blog’s goal is to offer a quick lookup reference to essential NUMA concepts, from allocation policies to hardware interconnects, to…Feb 8Feb 8
Sparse GEMM and Tensor Core’s Structured SparsityThe world of scientific computing and deep neural networks is abuzz with the term sparse general matrix multiplication (spGEMM). But what…Jan 19, 2024Jan 19, 2024
A Trip to Kernels: Understanding PyTorch’s Internal ArchitectureIf you’re here, you know that PyTorch is one of the most popular libraries among deep learning practitioners. It is highly efficient, and…Jul 8, 20232Jul 8, 20232
Published inAWS TipWarm a Vercel-hosted Next.js Website with Cloudflare WorkersWhen I hosted a Next.js-based ChatGPT clone on Vercel using the open-source project chatbot-ui for my parents in China, I noticed a…Mar 29, 2023Mar 29, 2023