[B! LoRA] manboubirdのブックマーク

manboubird id:manboubird

LoRAに関するmanboubirdのブックマーク (3)

ReLoRA: High-Rank Training Through Low-Rank Updates
Despite the dominance and effectiveness of scaling, resulting in large networks with hundreds of billions of parameters, the necessity to train overparameterized models rem ains poorly understood, while training costs grow exponentially. In this paper, we explore parameter-efficient training techniques as an approach to training large neural networks. We introduce a novel method called ReLoRA, whic
manboubird 2023/07/16
paper

llm

LoRA
リンク
GitHub - shi3z/peft_pretraining
manboubird 2023/07/16
llm

training

model

llamaIndex

LoRA
リンク
これぞ革命!?ゼロから大規模言語モデルを学習できるReLORA登場(7/18追記あり)｜shi3z
導入　本当に革命的な技術なのか? 「君たちはどう生きるか」で驚いている間にすごい論文が世界の話題を掻っ攫っていた。その名も「ReLORA」簡単に言えば、「事前学習にLoRAを使う」というものである。これは本当に革命的な発見かもしれないので、僕の仮説も含めて丁寧に説明する。まず、大前提として、「LoRA」という技術について LoRAは、「Low Rank Adaptation(日本語で言うとすれば低階適応)」という技術で、これまでは主にファインチューニングに使われてきた。ファインチューニングとは、あらかじめ学習されたニューラルネットワークに対して追加で学習させ、概念を強調させたり新しく覚えさせたりする。たとえば、僕の顔でStableDiffusionをファインチューニングすれば、僕みたいな顔の絵がどんどん出てくる。言語モデルにおけるLoRAも同様で、新しい概念や「こういうやりとり
manboubird 2023/07/16
LoRA

model

generativeAi

llm

paper
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx