Alignment Forum - Newcomer's Guide to Technical AI Safety Field

blog

2022·Alignment Forum·alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-g...

Author

zeshen

Credibility Rating

3/5

Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Rating inherited from publication venue: Alignment Forum

A useful onboarding resource for those new to AI safety, published on the Alignment Forum, which aggregates serious technical and conceptual work in the field; best read early in one's engagement with AI safety research.

Metadata

Importance: 65/100blog posteducational

Summary

A comprehensive introductory guide for those new to technical AI safety research, mapping the landscape of key problems, research agendas, and approaches in the field. It helps orient newcomers by explaining major research directions, key organizations, and how different subfields relate to each other. The guide serves as a starting point for understanding what technical AI safety research involves and how to engage with it.

Key Points

•Provides an overview of the major technical AI safety research directions including alignment, interpretability, and robustness
•Explains the motivations behind AI safety research and why researchers believe current approaches may be insufficient
•Maps key organizations and research groups working on technical AI safety problems
•Helps newcomers understand the relationships between different subfields and research agendas
•Offers guidance on how to get involved and where to focus learning efforts depending on background and interests

Cited by 1 page

Page	Type	Quality
Elicit (AI Research Tool)	Organization	63.0

Cached Content Preview

HTTP 200Fetched Mar 15, 202632 KB

[A newcomer’s guide to the technical AI safety field](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#)

12 min read

•

[Motivations](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Motivations)

•

[Some terminology](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Some_terminology)

•

[Brief history of AI and AI safety](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Brief_history_of_AI_and_AI_safety)

•

[Different paradigms](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Different_paradigms)

•

[AI risks](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#AI_risks)

•

[Threat models](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Threat_models)

•

[Seed AI](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Seed_AI)

•

[Prosaic AI](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Prosaic_AI)

•

[Brain-like AI](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Brain_like_AI)

•

[Polarity](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Polarity)

•

[Research type](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Research_type)

•

[Conceptual](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Conceptual)

•

[Theoretical](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Theoretical)

•

[Applied](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Applied)

•

[Conclusion](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Conclusion)

•

[Useful resources](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safety-field#Useful_resources)

[AI Alignment Intro Materials](https://www.alignmentforum.org/w/ai-alignment-intro-materials)[AI Alignment Fieldbuilding](https://www.alignmentforum.org/w/ai-alignment-fieldbuilding)[Has Diagram](https://www.alignmentforum.org/w/has-diagram)[Refine](https://www.alignmentforum.org/w/refine)[AI](https://www.alignmentforum.org/w/ai)[Community](https://www.alignmentforum.org/w/community)
Frontpage

# 12

# [A newcomer’s guide to the technical AI safetyfield](https://www.alignmentforum.org/posts/5rsa37pBjo4Cf9fkE/a-newcomer-s-guide-to-the-technical-ai-safe

... (truncated, 32 KB total)

Resource ID: 157551fb2e364a2b | Stable ID: sid_W2kiNUUcr9