CPM-2: Large-scale Cost-effective Pre-trained Language Models.

scholar.google.com › citations

… Large-scale cost-effective pre-trained language models
Zhang · Cited by 81

CPM-2: Large-scale Cost-effective Pre-trained Language Models - arXiv

Jun 20, 2021 · Based on our cost-effective pipeline, we pre-train two models: an encoder-decoder bilingual model with 11 billion parameters (CPM-2) and its ...

CPM-2: Large-scale cost-effective pre-trained language models

www.sciencedirect.com › article › pii

In this work, we propose a cost-effective pipeline for large-scale pre-trained language models, including pre-training with knowledge inheritance, fine-tuning ...

CPM-2 Explained | Papers With Code

paperswithcode.com › method › cpm-2

CPM-2 is a 11 billion parameters pre-trained language model based on a standard Transformer architecture consisting of a bidirectional encoder and a ...

TsinghuaAI/CPM: Introduction to CPM - GitHub

github.com › TsinghuaAI › CPM

CPM is an open-source program on large-scale pre-trained models, which is conducted by Beijing Academy of Artificial Intelligence and Tsinghua University, ...

CPM-2: Large-scale Cost-effective Pre-trained Language Models

www.semanticscholar.org › paper › CPM...

A unified framework named ERNIE 3.0 is proposed for pre-training large-scale knowledge enhanced models that fuses auto-regressive network and auto-encoding ...

People also search for

CPM: A large-scale Generative Chinese pre trained Language Model

cpm-2 ilsco

CPM2

Cpm 2 download apk

Multitask prompted training Enables zero-shot task generalization

Cpm 2 alpha

CPM: A large-scale generative Chinese Pre-trained language model

www.sciencedirect.com › article › pii

CPM is a Transformer-based autoregressive language model, with 2.6 billion parameters and 100 GB Chinese training data.

Images

View all

PDF] CPM-2: Large-scale Cost-effective Pre-trained Language Models ...

Must-Read Papers on Pre-trained Language Models (PLMs) - GitHub

github.com › thunlp › PLMpapers

We keep training and releasing large-scale PLMs in recent years, which are listed as follows. Welcome to try them. CPM-2. Cost-Effective Pre-trained Language ...

A Comprehensive Overview of Large Language Models - arXiv

arxiv.org › html

Apr 9, 2024 · 3. CPM-2 CPM-2 : Cost-efficient Pre-trained language Models (CPM-2) pre-trains bilingual (English and Chinese) 11B and 198B mixture-of-experts ...

Yuxian Gu | Papers With Code

paperswithcode.com › author › yuxian-gu

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved great success and become a milestone in the field of artificial intelligence ( ...

People also search for

Cpm 2 play store

GPT-Neo: Large Scale Autoregressive Language modeling with Mesh-Tensorflow

Scholarly articles for CPM-2: Large-scale Cost-effective Pre-trained Language Models.