Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
FUDGE: Controlled Text Generation With Future Discriminators
๐Ÿ€

FUDGE: Controlled Text Generation With Future Discriminators

Tags
NLP
TLDR๋…ผ๋ฌธ๋ฆฌ๋ทฐ
Published
Published July 22, 2021
notion image

๊ธฐ์กด ๋ฌธ์ œ

  • Controlled Text Generation์— ์—ฌ๋Ÿฌ๊ฐ€์ง€ ๋ฐฉ๋ฒ•์ด ์žˆ์Œ
  • CCLM: GPT-2๊ฐ™์€ ์ผ๋ฐ˜ LM์„ ํŠน์ • ๋„๋ฉ”์ธ corpus๋กœ ์ง์ ‘ Finetuneํ•˜๋Š” ๋ฐฉ๋ฒ•
  • PPLM: GPT-2 output์— final layer์— Linear Gradient ascending ๋ถ™์—ฌ์„œ ํŠน์ •ํ•œ Attribute๋กœ ์ œ์–ด

FUDGE

  • Language Model G
    • G์˜ Random Var P(x)
  • ์šฐ๋ฆฌ๊ฐ€ ๊ตฌํ•˜๊ณ ์ž ํ•˜๋Š” ๊ฒƒ์€ ๋กœ ์กฐ๊ฑด์— ๋”ฐ๋ฅธ Generation (conditioning)
    • ์ด๋•Œ ์ง์ ‘์ ์œผ๋กœ ๋ฅผ ๊ตฌํ•˜๋Š” ๋ฐฉ๋ฒ•์ด โ†’ CCLM
  • fudge์—์„œ๋Š” ๋ฅผ ๋”ฐ๋ฅด๋Š” Bayesian factorization์„ ์ด์šฉ
    • ์ด ๋ถ€๋ถ„์€ GeDI์—์„œ ์‚ฌ์šฉํ•œ ๋ฐฉ๋ฒ•๋ก ๊ณผ ๊ฑฐ์˜ ํก์‚ฌํ•˜๋‹ค.
    • Prediction Model โ†’ P(a|x_1:i)๋Š” '์™„์„ฑ๋œ ๋ฌธ์žฅ'์— ๋Œ€ํ•œ Classifier
    • ์ฆ‰, Top-200(Top-k) ์ƒ˜ํ”Œ๋ง๋œ ๊ฒƒ์— ๋Œ€ํ•ด classifier(LSTM๊ธฐ๋ฐ˜)
      • ์—ฌ๊ธฐ์„œ๋Š” 200๋ฒˆ์— ๋Œ€ํ•ด inference ํ•˜๋Š” ๊ฒƒ์— ๋Œ€ํ•œ Computing cost์— ๋Œ€ํ•ด Excuse๊ฐ€ ์—†์Œ
ย