Welcome to the AI Safety Concept Map! This is an attempt to visualize the conceptual landscape of AI existential safety (as of June 2024) in order to help orient newcomers to the field. The current diagram is largely based on the content of the AI Safety Fundamentals Alignment Course run by BlueDot Impact.

AI Safety Map

The diagram organizes different approaches to AI safety as items which are grouped into relevant themes. The size of the circle an item is in is roughly correlated to how mature the item/approach is. Arrows are intended to indicate how the different themes broadly interact with one another. A breakdown of the items and themes visualized above alongside links to learn more is given below.

This diagram is mostly a sense-making effort from an interested amateur with <100hrs of exposure to the field of AI safety but I hope you still find it helpful! If you have any feedback on anything, checkout the “About” page or comment on AI Safety Concept Map Sheet (which is also the source of the info below).

Posts

subscribe via RSS