I completely ignored Anthropic’s advice and wrote a more elaborate test prompt based on a use case I’m familiar with and therefore can audit the agent’s code quality. In 2021, I wrote a script to scrape YouTube video metadata from videos on a given channel using YouTube’s Data API, but the API is poorly and counterintuitively documented and my Python scripts aren’t great. I subscribe to the SiIvagunner YouTube account which, as a part of the channel’s gimmick (musical swaps with different melodies than the ones expected), posts hundreds of videos per month with nondescript thumbnails and titles, making it nonobvious which videos are the best other than the view counts. The video metadata could be used to surface good videos I missed, so I had a fun idea to test Opus 4.5:
В России ответили на имитирующие высадку на Украине учения НАТО18:04
,推荐阅读爱思助手下载最新版本获取更多信息
And the following WebAssembly file:
Alison Francis,Senior Science Journalist,更多细节参见爱思助手下载最新版本
「她的一部分將永遠延續下去。」
Раскрыты подробности похищения ребенка в Смоленске09:27。业内人士推荐WPS官方版本下载作为进阶阅读