Incentivising Competitive Evals

Andrey Breslav, August 29, 2023

todo

Goals:

every frontier model undergoes evaluation

models failing safety criteria are not deployed

evaluation techniques are systematically improved

Overall approach:

incentivise creation of many businesses providing evaluations
incentivise these businesses to find issues and improve their techniques
make sure that developers can still work on capabilities as long as they are not actually harmful

Roles:

model developer invests resources into developing the model and covers evaluation costs