Location via proxy:
[ UP ]
[Report a bug]
[Manage cookies]
No cookies
No scripts
No ads
No referrer
Show this form
Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
OpenRLHF
community
https://github.com/OpenRLHF
AI & ML interests
None defined yet.
Team members
7
models
9
Sort: Recently updated
OpenRLHF/Llama-3-8b-iter-dpo-179k
Text Generation
•
Updated
Jul 28
•
7
OpenRLHF/Llama-3-8b-rm-mixture
Updated
Jul 17
•
2.13k
OpenRLHF/Llama-3-8b-rm-700k
Updated
Jul 17
•
80
OpenRLHF/Llama-3-8b-rlhf-100k
Text Generation
•
Updated
Jun 24
•
73
•
1
OpenRLHF/Llama-3-8b-sft-mixture
Text Generation
•
Updated
Jun 14
•
4.02k
OpenRLHF/Llama-2-7b-sft-model-ocra-500k
Text Generation
•
Updated
Jun 9
•
168
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
Updated
Jun 9
•
103
•
1
OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt
Updated
Jan 24
•
1
OpenRLHF/Llama-2-13b-sft-model-ocra-500k
Text Generation
•
Updated
Jan 5
•
8
•
1
datasets
3
Sort: Recently updated
OpenRLHF/preference_700K
Viewer
•
Updated
Jul 13
•
700k
•
2
OpenRLHF/prompt-collection-v0.1
Viewer
•
Updated
Jun 14
•
179k
•
1.98k
•
1
OpenRLHF/preference_dataset_mixture2_and_safe_pku
Viewer
•
Updated
Jun 14
•
555k
•
350
•
2