Accuse the agent of potentially cheating its algorithm implementation while pursuing its optimizations, so tell it to optimize for the similarity of outputs against a known good implementation (e.g. for a regression task, minimize the mean absolute error in predictions between the two approaches)
Андрей Шеньшаков,详情可参考一键获取谷歌浏览器下载
Keir Mackenzie,in Canterbury,。业内人士推荐下载安装 谷歌浏览器 开启极速安全的 上网之旅。作为进阶阅读
Фото: Yanya / Shutterstock / Fotodom
Дарья Устьянцева (редактор отдела «Мир»)