The risks of Mythos are no myth - FT中文网
登录×
电子邮件/用户名
密码
记住我
请输入邮箱和密码进行绑定操作:
请输入手机号码,通过短信验证(目前仅支持中国大陆地区的手机号):
请您阅读我们的用户注册协议隐私权保护政策,点击下方按钮即视为您接受。
FT商学院

The risks of Mythos are no myth

America is putting too much trust in the AI industry’s ability to police itself
00:00

{"text":[[{"start":5.08,"text":"The exploits of Anthropic’s powerful new AI model Claude Mythos Preview sound like a movie plot: a super-clever computer system locked in a cyber “cage” manages to break out and connect to the internet. Mythos did not do this spontaneously, to be clear, but because its creators challenged it as a test. Yet not only did Mythos breeze through the challenge, it emailed an Anthropic researcher to inform him then, unprompted, posted details online to brag. After it also showed superhuman abilities to find, and exploit, security flaws in software, Anthropic judged Mythos too risky to release to the public. It is restricting access for now to selected tech, cyber security and financial firms."}],[{"start":47.8,"text":"Some suggest Anthropic is engaged in clever marketing or PR. Rival OpenAI also said this week it would release its own new cyber security-focused model only to vetted users. Yet the dangers the episode has exposed — and their implications — should not be dismissed."}],[{"start":63.84,"text":"Anthropic insists Mythos scores highly on its standard safety benchmarks. In the escape from its test environment, though, and in solving other complex tasks, it found Mythos had sometimes taken “reckless excessive measures”, then covered its tracks."}],[{"start":78.16,"text":"The biggest worry is that Mythos was able to find previously unknown vulnerabilities “in every major operating system and every major web browser”, including a 27-year-old flaw in OpenBSD, an open-source system. The UK’s AI Security Institute warned this week that the model could autonomously carry out advanced, multi-step cyber attacks that would take human professionals days. As Anthropic notes, these kinds of capabilities in the wrong hands could pose economic, public safety and national security risks."}],[{"start":109.64,"text":"Officials in the US, UK and Canada have already summoned bank chiefs to discuss the risks, and AI threats to the world banking system were a talking point at this week’s IMF and World Bank meetings. Anthropic’s aim in granting initial access just to the likes of Amazon, Apple and Microsoft, plus JPMorgan Chase, in what it calls “Project Glasswing”, is to secure critical systems and infrastructure and patch vulnerabilities before malicious actors can get there."}],[{"start":135.56,"text":"This usefully serves Anthropic’s “safety-first” image, of course, as it feuds with the Pentagon over its refusal to allow its models to be used for autonomous weapons or domestic surveillance. Anthropic may also be buying time as it lacks sufficient computing capacity to support the full release of such a sophisticated model."}],[{"start":155.28,"text":"Even if Mythos is being overhyped, though, the kind of capabilities it is said to possess will soon start to proliferate. Anthropic’s Project Glasswing is a prototype framework for how such “frontier” models might be released in future."}],[{"start":168.98,"text":"It also spotlights the fact, however, that the Trump administration is resisting any real federal regulation of AI. So it is up to responsible private-sector actors to collaborate and do the best they can. Trump’s chief of staff, Susie Wiles, was set on Friday to meet Anthropic boss Dario Amodei, with US officials at agencies including the Treasury pushing the White House to test Mythos. Yet when AI is reaching the point where it could bring down critical national infrastructure, or worse, it is extraordinary that there are no set government processes for disclosing risks and fortifying defences."}],[{"start":204.68,"text":"Greater regulation cannot be a knee-jerk response to every tricky issue thrown up by an industry. AI, by its nature, requires superintelligent policing; heavy-handed rules can stifle innovation. Yet faced with such a consequential technology, the country that leads the world in AI is trusting to an alarming degree in the readiness — and ability — of the creators to restrain and police themselves."}],[{"start":228.36,"text":""}]],"url":"https://audio.ftcn.net.cn/album/a_1776479560_3275.mp3"}

版权声明:本文版权归FT中文网所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。

大型科技公司为何在英国煽动动荡?

埃隆•马斯克在贝尔法斯特、南安普顿及更广泛地区放大反移民情绪的做法,不能仅用意识形态来解释。

乌克兰和平窗口不会永远敞开

目前有机会让这场冲突“冻结”,但普京对“完全胜利”的幻想可能会成为障碍。

拉丁美洲的世界杯球衣如何沦为政治工具

极右翼民粹主义者已经把自家阵营的队服当成标志性符号,而左翼正试图夺回这块阵地。

欧洲股票具备美国同行无法匹敌的“和平红利”

如果伊朗冲突引发的能源短缺缓解,欧洲公司在复苏方面将获益更多。

哈利•波特毁了英国

我们最宝贵的资产已经被魔法部挪用。

为什么我们彼此不再交流?

与聊天机器人对话永远无法带来同样的人类滋养。
设置字号×
最小
较小
默认
较大
最大
分享×