Toward understanding and preventing misalignment generalization

2 points by amrrs 2 weeks ago | 0 comments