ThisIsSoMe

Wu Kun ThisIsSoMe

11 followers · 10 following

Achievements

Lists (1)

Sort

🚀 My stack

1 repository

Stars

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Python 3,322 969 Updated Apr 24, 2024

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,144 120 Updated Nov 14, 2024

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 3,688 416 Updated Oct 22, 2024

deepseek-ai / DeepSeek-Prover-V1.5

Python 224 23 Updated Aug 16, 2024

OFA-Sys / gsm8k-ScRel

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Python 218 16 Updated Sep 12, 2024

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 192 22 Updated Aug 6, 2024

TIGER-AI-Lab / MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Jupyter Notebook 331 47 Updated Aug 25, 2024

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 590 56 Updated Oct 29, 2024

MARIO-Math-Reasoning / Super_MARIO

Python 245 16 Updated Oct 12, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,155 104 Updated Oct 27, 2024

dvlab-research / Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 283 9 Updated Jul 15, 2024

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 144 9 Updated Oct 28, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 19,027 1,872 Updated Nov 14, 2024

chujiezheng / LLM-Extrapolation

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Python 67 2 Updated Jun 7, 2024

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 708 49 Updated Nov 4, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,100 3,071 Updated Aug 12, 2024

eosphoros-ai / Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

1,866 141 Updated Oct 28, 2024

bigcode-project / starcoder2

Home of StarCoder2!

Python 1,784 157 Updated Mar 21, 2024

bigcode-project / the-stack-v2

Code for the curation of The Stack v2 and StarCoder2 training data

Jupyter Notebook 90 6 Updated Apr 11, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,165 180 Updated Aug 11, 2024

yyyujintang / Awesome-Mamba-Papers

Awesome Papers related to Mamba.

1,208 63 Updated Oct 17, 2024

netease-youdao / QAnything

Question and Answer based on Anything.

Python 11,861 1,149 Updated Oct 26, 2024

tobymao / sqlglot

Python SQL Parser and Transpiler

Python 6,731 703 Updated Nov 14, 2024

CLUEbenchmark / SuperCLUE-Math6

SuperCLUE-Math6：新一代中文原生多轮多步数学推理数据集的探索之旅

Python 46 3 Updated Feb 5, 2024

HowieHwong / TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Python 465 44 Updated Sep 29, 2024

iiis-turing-llm / llm-training-calculator

Python 41 6 Updated Aug 5, 2024

pldlgb / nuggets

Jupyter Notebook 71 1 Updated Dec 29, 2023

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 882 46 Updated Jun 25, 2024

leanprover-community / mathematics_in_lean

The user home repository for the Mathematics in Lean tutorial.

HTML 266 197 Updated Nov 11, 2024

xianshang33 / llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

978 40 Updated Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wu Kun ThisIsSoMe

Achievements

Achievements

Block or report ThisIsSoMe

Lists (1)

🚀 My stack

Stars

junxiaosong / AlphaZero_Gomoku

opendilab / LightZero

wdndev / llm_interview_note

deepseek-ai / DeepSeek-Prover-V1.5

OFA-Sys / gsm8k-ScRel

YuxiXie / MCTS-DPO

TIGER-AI-Lab / MAmmoTH

QwenLM / Qwen2.5-Math

MARIO-Math-Reasoning / Super_MARIO

mlfoundations / dclm

dvlab-research / Step-DPO

google-deepmind / loft

microsoft / graphrag

chujiezheng / LLM-Extrapolation

princeton-nlp / SimPO

meta-llama / llama3

eosphoros-ai / Awesome-Text2SQL

bigcode-project / starcoder2

bigcode-project / the-stack-v2

eric-mitchell / direct-preference-optimization

yyyujintang / Awesome-Mamba-Papers

netease-youdao / QAnything

tobymao / sqlglot

CLUEbenchmark / SuperCLUE-Math6

HowieHwong / TrustLLM

iiis-turing-llm / llm-training-calculator

pldlgb / nuggets

pjlab-sys4nlp / llama-moe

leanprover-community / mathematics_in_lean

xianshang33 / llm-paper-daily