Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View ThisIsSoMe's full-sized avatar

Block or report ThisIsSoMe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,322 969 Updated Apr 24, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,144 120 Updated Nov 14, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 3,688 416 Updated Oct 22, 2024

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Python 218 16 Updated Sep 12, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 192 22 Updated Aug 6, 2024

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Jupyter Notebook 331 47 Updated Aug 25, 2024

A series of math-specific large language models of our Qwen2 series.

Python 590 56 Updated Oct 29, 2024

DataComp for Language Models

HTML 1,155 104 Updated Oct 27, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 283 9 Updated Jul 15, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 144 9 Updated Oct 28, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 19,027 1,872 Updated Nov 14, 2024

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Python 67 2 Updated Jun 7, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 708 49 Updated Nov 4, 2024

The official Meta Llama 3 GitHub site

Python 27,100 3,071 Updated Aug 12, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

1,866 141 Updated Oct 28, 2024

Home of StarCoder2!

Python 1,784 157 Updated Mar 21, 2024

Code for the curation of The Stack v2 and StarCoder2 training data

Jupyter Notebook 90 6 Updated Apr 11, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,165 180 Updated Aug 11, 2024

Awesome Papers related to Mamba.

1,208 63 Updated Oct 17, 2024

Question and Answer based on Anything.

Python 11,861 1,149 Updated Oct 26, 2024

Python SQL Parser and Transpiler

Python 6,731 703 Updated Nov 14, 2024

SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅

Python 46 3 Updated Feb 5, 2024

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Python 465 44 Updated Sep 29, 2024
Jupyter Notebook 71 1 Updated Dec 29, 2023

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 882 46 Updated Jun 25, 2024

The user home repository for the Mathematics in Lean tutorial.

HTML 266 197 Updated Nov 11, 2024

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

978 40 Updated Jul 31, 2024
Next