A Roadmap for Robust End-to-End Alignment

Hoang, Lê Nguyên

Computer Science > Artificial Intelligence

arXiv:1809.01036 (cs)

[Submitted on 4 Sep 2018 (v1), last revised 25 Feb 2020 (this version, v4)]

Title:A Roadmap for Robust End-to-End Alignment

Authors:Lê Nguyên Hoang

View PDF

Abstract:This paper discussed the {\it robust alignment} problem, that is, the problem of aligning the goals of algorithms with human preferences. It presented a general roadmap to tackle this issue. Interestingly, this roadmap identifies 5 critical steps, as well as many relevant aspects of these 5 steps. In other words, we have presented a large number of hopefully more tractable subproblems that readers are highly encouraged to tackle. Hopefully, this combination allows to better highlight the most pressing problems, how every expertise can be best used to, and how combining the solutions to subproblems might add up to solve robust alignment.

Comments:	21 pages, 2 figures
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1809.01036 [cs.AI]
	(or arXiv:1809.01036v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1809.01036

Submission history

From: Lê Nguyên Hoang [view email]
[v1] Tue, 4 Sep 2018 15:19:44 UTC (577 KB)
[v2] Sun, 21 Oct 2018 11:01:41 UTC (695 KB)
[v3] Mon, 25 Feb 2019 09:32:09 UTC (834 KB)
[v4] Tue, 25 Feb 2020 08:45:45 UTC (296 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2018-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Lê Nguyên Hoang

export BibTeX citation

Computer Science > Artificial Intelligence

Title:A Roadmap for Robust End-to-End Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:A Roadmap for Robust End-to-End Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators