Nova: A Practical and Advanced Alignment

Lin, Mingan; Yang, Fan; Shen, Yanjun; Sun, Haoze; Li, Tianpeng; Zhang, Tao; Zhu, Chenzheng; Zhang, Tao; Zheng, Miao; Li, Xu; Zhou, Yijie; Chen, Mingyang; Qin, Yanzhao; Li, Youquan; Liang, Hao; Li, Fei; Li, Yadong; Wang, Mang; Dong, Guosheng; Fang, Kun; Xu, Jianhua; Cui, Bin; Zhang, Wentao; Zhou, Zenan; Chen, Weipeng

Abstract:We introduce Nova, a suite of practical alignment techniques employed in a series of empirically validated high-performing models. This represents the first comprehensive account of alignment methodologies, offering valuable insights for advancing AI research. We investigate the critical components that enhance model performance during the alignment process, including optimization methods, data strategies, capability enhancements, and evaluation processes. The process spans three key stages: Prompt Augmentation System(PAS), Supervised Fine-Tuning(SFT), and Preference Alignment. The problems encountered, the solutions applied, and the improvements made are thoroughly recorded.
Through comparisons across well-established benchmarks, we highlight the technological advancements enabled by Nova Alignment. Importantly, Qwen2-Nova-72B and Llama3-PBM-Nova-70B are instruct versions of the Qwen2-72B and Llama-3-70B base models, optimized through Nova. The Nova models show significant core improvements, with user experience gains of 17% to 28%, and excels on specialized benchmarks. In open-source benchmark evaluations, both Qwen2-Nova-72B and Llama3-PBM-Nova-70B consistently outperform their respective official instruct versions across nearly all datasets. This report aims to clarify the key technologies behind the alignment process, fostering a deeper understanding within the community. Llama3-PBM-Nova-70B model is available at this https URL.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2410.14940 [cs.LG]
	(or arXiv:2410.14940v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.14940

Computer Science > Machine Learning

Title:Nova: A Practical and Advanced Alignment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators