Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View drisspg's full-sized avatar

Block or report drisspg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A tool for working with stacked PRs on github.

Python 259 7 Updated Aug 26, 2024

Helpful tools and examples for working with flex-attention

Python 363 15 Updated Aug 17, 2024

This repository contains the experimental PyTorch native float8 training UX

Python 212 20 Updated Aug 1, 2024

Deep inspection of Python objects

Python 1,534 22 Updated Sep 2, 2024

Master programming by recreating your favorite technologies from scratch.

Markdown 302,238 28,346 Updated Sep 3, 2024

🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

C++ 43,898 1,916 Updated Sep 18, 2024
Shell 236 35 Updated Jun 10, 2024

Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.

8,045 880 Updated Aug 6, 2024

Count your code, quickly.

Rust 10,980 533 Updated Sep 19, 2024

Tile primitives for speedy kernels

Cuda 1,503 58 Updated Sep 28, 2024

Improved build system generator for CPython C, C++, Cython and Fortran extensions

Python 484 121 Updated Sep 23, 2024

Distribute and run LLMs with a single file.

C++ 19,333 981 Updated Sep 18, 2024

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 27,600 1,640 Updated Sep 27, 2024

Puzzles for learning Triton

Jupyter Notebook 1,008 64 Updated Sep 25, 2024

Containers for machine learning

Python 7,870 549 Updated Sep 27, 2024

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference

Python 107 6 Updated Mar 6, 2024

A native PyTorch Library for large model training

Python 2,233 164 Updated Sep 27, 2024

Ring attention implementation with flash attention

Python 539 41 Updated Sep 20, 2024

Annotated version of the Mamba paper

Jupyter Notebook 445 17 Updated Feb 27, 2024

Marp for VS Code: Create slide deck written in Marp Markdown on VS Code

TypeScript 1,562 73 Updated Sep 10, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,521 530 Updated Jul 25, 2024
Python 7,092 549 Updated Aug 12, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,428 916 Updated Sep 25, 2024

Cuda extensions for PyTorch

Cuda 10 2 Updated Jul 13, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,178 109 Updated Sep 27, 2024

CUDA Kernel Benchmarking Library

Cuda 486 63 Updated Jun 5, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 28,184 11,632 Updated Sep 28, 2024

Learnings + Exercises from the PMPP book!

C++ 2 Updated Jul 2, 2024

Modern C++ Programming Course (C++03/11/14/17/20/23/26)

HTML 11,849 794 Updated Aug 26, 2024
Next