Conned by a chatbot - FT中文网
登录×
电子邮件/用户名
密码
记住我
请输入邮箱和密码进行绑定操作:
请输入手机号码,通过短信验证(目前仅支持中国大陆地区的手机号):
请您阅读我们的用户注册协议隐私权保护政策,点击下方按钮即视为您接受。
卧底经济学家

Conned by a chatbot

Like tricksters, LLMs have perfected the art of plausibility
00:00

{"text":[[{"start":4.65,"text":"Marathon day. An early train into London, then an unfamiliar journey across a race-disrupted city from Paddington to Blackheath, all in good time for the start of the race. I was nervous, of course, but was cheered by the sight of another bib-wearing runner — more experienced at marathons, less familiar with London."}],[{"start":24.4,"text":"Me: “How do you plan to get to the start line?”"}],[{"start":27.599999999999998,"text":"He: “I’ve asked ChatGPT. It says Elizabeth Line to Liverpool Street, then the train to Blackheath.”"}],[{"start":34.849999999999994,"text":"That didn’t sound right. Was there a train from Liverpool Street to Blackheath? Google Maps and Citymapper suggested getting to Blackheath from Charing Cross or Waterloo."}],[{"start":44.74999999999999,"text":"Me: “Are you sure? I’d suggest the Circle or Bakerloo to Charing Cross.”"}],[{"start":49.849999999999994,"text":"He frowned for a moment and pulled out his phone. “No, ChatGPT says that ‘The Circle Line is not a good choice on marathon day. It will be too crowded. There are too many stops and too many steps. It’s a route for tourists, not for runners.’”"}],[{"start":65.3,"text":"I checked Google Maps. Sure enough, there is no train from Liverpool Street to Blackheath. ChatGPT’s recommendation would leave him stranded, trying to catch a bus over the marathon route, then trying to get on to the train from Charing Cross at a busy London Bridge. I told him that sounded like a bad idea. He frowned again and typed another query into his phone. “Oh, you’re right. ChatGPT says, ‘Correction: take the Elizabeth Line straight to London Bridge.’”"}],[{"start":95.6,"text":"Me: “The Elizabeth Line doesn’t go to London Bridge.”"}],[{"start":99.85,"text":"You’ve heard tales of artificial intelligence hallucinations before, but it’s not the AI that fascinates here: it’s the human. "}],[{"start":107.5,"text":"The route-finding algorithm on Google Maps is a minor miracle. It will solve a complex optimisation problem across multiple modes of transport, taking into account real-time congestion or delays, and it’s been available on smartphones and browsers for years. It is a proven, practical example of AI in action. So on marathon day, when the stakes are high and the clock is ticking, why would anyone turn instead to a fancy word-guessing machine such as ChatGPT?"}],[{"start":137.5,"text":"Perhaps it’s that ChatGPT seems so human. It served up an uncanny impersonation of a friendly and knowledgeable local guide. The Circle Line? Pfft, it’s fine for tourists but you’re a marathon runner: think about all those steps! (It’s true, the creaky old Circle Line does have steps.) "}],[{"start":155.85,"text":"Part of the bot patter reminded me of clickbait ads: INSURANCE COMPANIES HATE THIS LOOPHOLE! ChatGPT wasn’t just giving a route, but giving a rationale, even explaining why we shouldn’t listen to the lamestream advice of Google Maps. This is the approach of a confidence trickster."}],[{"start":174.15,"text":"In the introduction to her book The Confidence Game, psychologist Maria Konnikova explains: “The true con artist doesn’t force us to do anything: he makes us complicit in our own undoing . . . we believe because we want to.” One difference between the con artist and the large language model (LLM) is that the con artist knows the truth and is trying to conceal it. One similarity between the con artist and the LLM is that both of them have perfected seeming plausible."}],[{"start":204.35,"text":"A recent paper in Nature finds that when LLMs are trained to be warm and friendly, they also produce dramatically less accurate answers, “promoting conspiracy theories, providing inaccurate factual information and offering incorrect medical advice”. That sounds bad. I’d suggest that the reality is worse: the sycophantic AI not only produces mistakes, it persuades us to believe them. "}],[{"start":227.85,"text":"In 1950 Alan Turing, the mathematician and visionary of the computer age, famously proposed an “imitation game” in which a human judge would communicate through a teleprompter with a human and a computer. The computer’s job was to imitate human conversation convincingly enough to persuade the judge. "}],[{"start":246.29999999999998,"text":"Turing’s test remains intriguing, but there is a longstanding difficulty: the fallibility of the judge. A primitive 1960s chatbot, Eliza, responded like a parody of a therapist (“How does that make you feel?” “Why do you feel sad?” “Please go on.”). People lapped it up; it’s nice to feel listened to. A 1980s chatbot, MGonz, just fired off insults and was perfectly plausible, partly because insults are simple to deliver and mostly because they prompt rage rather than reflection in the human recipient. And Robert Epstein, an expert in the Turing Test, has written entertainingly about how he was fooled into a four-month correspondence with a sexy Russian lady who was, in fact, a 2006-era chatbot. None of these bots had a thousandth of the sophistication of a modern LLM, but they didn’t need it: when humans are sad, angry or amorous, we aren’t very sophisticated judges, either."}],[{"start":307.2,"text":"We are all going to find ourselves in strange variations of the Turing Test in years to come, and I wonder if we are up to it. And not just us, but those with power over us. As Cory Doctorow, author of Enshittification, is fond of observing: you won’t be replaced because an AI can do your job, you’ll be replaced because an AI salesman convinces your boss that it can. If my journey to the marathon start line is any guide, that salesman will have an easy job."}],[{"start":334.34999999999997,"text":"The capabilities of modern AI are impressive. But what determines whether we use it is not the capability, but the impressiveness. They are correlated but they are not the same thing."}],[{"start":344.9,"text":"There’s a tale about the French poet Jacques Prévert seeing a fellow begging for change on the streets of Venice with a sign that read “Blind man without a pension”."}],[{"start":354.25,"text":"Prévert stopped to chat to him; not many people were moved to contribute, and Prévert offered to write a new sign."}],[{"start":360.55,"text":"The next day, he returned to find the man overjoyed. “It’s incredible; I’ve never received so much money in my life.” "}],[{"start":367.6,"text":"Prévert had written: “Spring is coming, but I won’t see it.” "}],[{"start":371.20000000000005,"text":"The new sign contained no news — in fact, it was less informative than the old. But it told a story. Google Maps was the first sign: it told me where to get my train. ChatGPT was the second sign: it told my companion not just where to go, but how to feel about taking such a clever route."}],[{"start":390.20000000000005,"text":"I left him at Paddington, urging him not to try to take the non-existent Elizabeth Line train to London Bridge. I am not sure I was as convincing as ChatGPT."}],[{"start":401.15000000000003,"text":"Tim Harford ran the London Marathon in support of the Teenage Cancer Trust: tinyurl.com/HarfordMarathon  "}],[{"start":410.1,"text":"Find out about our latest stories first — follow FT Weekend Magazine on X and FT Weekend on Instagram"}],[{"start":421.45000000000005,"text":""}]],"url":"https://audio.ftcn.net.cn/album/a_1779022147_1460.mp3"}

版权声明:本文版权归FT中文网所有,未经允许任何单位或个人不得转载,复制或以任何其他方式使用本文全部或部分,侵权必究。

现代战争的血腥一如往昔

科技的进步并没有减少俄乌战争中的伤亡,武装无人机和AI正把前线变成险恶的杀戮地带,惨烈程度堪比一战。

帕拉贝利斯医药公司于与再生元达成交易次日披露IPO计划,上市热潮升温

成立已有十年且资金雄厚、从格雷格•维尔丁在哈佛实验室孵化出的“不可成药”生物技术公司——帕拉贝利斯医药公司,正寻求成为今年第12家进行首次公开募股的药物研发企业
16小时前

英伟达部署900亿美元助推AI繁荣

黄仁勋正成为依赖其芯片的AI相关公司的最大资助者之一。这些支出涉及逾145家公司,从AI模型开发商、云服务提供商到基础设施供应商不一而足。

Lex专栏:股市投资者信心爆棚,但现金见底

鉴于标普500指数高度依赖以人工智能为驱动的公司,股市出现小问题和大问题的可能性都很大。

FT社评:埃博拉疫情暴露全球应对大流行病准备不足

援助资金减少以及特朗普政府对全球公共卫生理念的敌意,正危及我们所有人。

“四大”急聘AI专业人才,岗位数量盖过传统审计师

全球最大的几家会计师事务所正竞相适应颠覆性的技术变革。
设置字号×
最小
较小
默认
较大
最大
分享×