Unit 04

Project ideation

Units 1-3 give you the landscape. This unit helps you decide where effort might compound: implementing a small piece of a published mechanism, sharpening an open question, or testing an assumption the field keeps repeating.

The goal is not to write a grand proposal. The goal is to make your theory of change, smallest experiment, and failure conditions explicit enough that the project can actually begin.

Theory of change

Verification work can become technically interesting without becoming useful. Before committing, identify which institution, decision, or policy lever your result would plug into.

Reading on theory of change

Links to come

Curated set of theory-of-change resources will go here. Expect two to four short pieces that walk through what theory of change means for technical AI safety and verification work, and how to write one for your own project.

Course author note: links to be added.

Scoping a project

Work through these prompts in a notebook or with a collaborator. Aim for one sentence per prompt. The point is to force choices, not to produce polished prose.

  • Which mechanism family from Unit 3 are you most drawn to, and why is your existing technical background useful there.
  • Pick one specific paper from that family. What is the smallest extension of it that would teach you something the field does not already know.
  • If your project succeeded, what would change in the world. Who would act on the result. Who is the named person or institution you can picture using it.
  • What is the cheapest experiment that would tell you whether the project is worth doing at all.
  • What is your biggest unknown right now, and which other reading or which conversation would resolve it.
  • What would make you abandon the project. Pre-commit to the failure conditions before you start.

Where to take this further

A few concrete routes if you want to move from reading into applied work.

  • The Hardware Assurance Programme is the most direct route from this course to applied project work. Six days in Cambridge with residents from FLI, MIRI, Intel, ARM, and others working on the open problems above.
  • The AI Security Forum's RFD list is the closest thing to a public problem queue. Pick one, scope it down, and start.
  • If you are based at a university, identify the closest faculty member (hardware security, cryptography, or AI policy) and ask if they would supervise even an informal reading group on one of the agendas in Unit 2.
  • If you write up your project sketch and want feedback, the Cambridge AI Safety Hub team is happy to read drafts. Email hello@caish.org.