-
The Chinese University of Hong Kong
- Hong Kong
- https://rulegreen.github.io/
Highlights
- Pro
Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
DSPy: The framework for programming—not prompting—language models
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Python Implementation of Reinforcement Learning: An Introduction
Train transformer language models with reinforcement learning.
An Autonomous LLM Agent for Complex Task Solving
A framework for few-shot evaluation of language models.
a state-of-the-art-level open visual language model | 多模态预训练模型
Utilities intended for use with Llama models.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
⚡ A distributed crawler for weibo, building with celery and requests.
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Multi-Task Deep Neural Networks for Natural Language Understanding
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
🦄 State-of-the-Art Conversational AI with Transfer Learning