Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View Ethan-TZ's full-sized avatar
🧭
Out of Memory
🧭
Out of Memory

Organizations

@RUCAIBox

Block or report Ethan-TZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Jan 25, 2025

Low-bit optimizers for PyTorch

Python 125 9 Updated Oct 9, 2023

Deep neural network framework for multiple GPUs

Cuda 33 15 Updated Jun 20, 2015

Accessible large language models via k-bit quantization for PyTorch.

Python 6,611 654 Updated Feb 6, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,017 164 Updated Mar 27, 2024

Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".

Python 45 1 Updated Jul 12, 2024

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Python 60 10 Updated Apr 19, 2022

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,171 498 Updated May 3, 2024

KarSein for CTR predict

Python 7 1 Updated Feb 5, 2025

The official implementation of Ada2Fair (RecSys'24 Short Paper).

Python 5 2 Updated Jan 21, 2025

xDCN: Combining Exponential and Linear Cross Network for Click-Through Rate Prediction

Python 40 6 Updated Jan 17, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,119 70 Updated Jul 14, 2024

[KDD 2024] This is the official PyTorch implementation for the paper: "Rotative Factorization Machines"

Python 3 Updated Aug 20, 2024

Long Range Arena for Benchmarking Efficient Transformers

Python 740 84 Updated Dec 16, 2023
Python 1,528 133 Updated Apr 27, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,446 704 Updated Jan 28, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 886 163 Updated Feb 7, 2025

[KDD 2024] This is the official PyTorch implementation for the paper: "Rotative Factorization Machines"

Python 3 1 Updated Aug 20, 2024
HTML 1 Updated May 17, 2024

Code for ACM RecSys 2023 paper "Turning Dross Into Gold Loss: Is BERT4Rec really better than SASRec?"

Jupyter Notebook 40 8 Updated Feb 24, 2024

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 736 94 Updated Jan 7, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 67,328 8,261 Updated Feb 7, 2025

Scripts for processing the Amazon Reviews 2023 dataset; implementations and checkpoints of BLaIR: "Bridging Language and Items for Retrieval and Recommendation".

Python 161 28 Updated Nov 26, 2024
Python 31 4 Updated Sep 3, 2023

Benchmarks for classification of genomic sequences

Jupyter Notebook 128 18 Updated Feb 12, 2024

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

Assembly 634 88 Updated Jun 15, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,646 4,863 Updated Feb 6, 2025

[SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vector Attention".

Python 10 2 Updated Oct 1, 2024

[SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vector Attention".

Python 17 1 Updated Oct 5, 2024

Mamba SSM architecture

Python 13,902 1,200 Updated Jan 18, 2025
Next