Conned by a chatbot

00:00

{"text":[[{"start":4.65,"text":"Marathon day. An early train into London, then an unfamiliar journey across a race-disrupted city from Paddington to Blackheath, all in good time for the start of the race. I was nervous, of course, but was cheered by the sight of another bib-wearing runner — more experienced at marathons, less familiar with London."}],[{"start":24.4,"text":"Me: “How do you plan to get to the start line?”"}],[{"start":27.599999999999998,"text":"He: “I’ve asked ChatGPT. It says Elizabeth Line to Liverpool Street, then the train to Blackheath.”"}],[{"start":34.849999999999994,"text":"That didn’t sound right. Was there a train from Liverpool Street to Blackheath? Google Maps and Citymapper suggested getting to Blackheath from Charing Cross or Waterloo."}],[{"start":44.74999999999999,"text":"Me: “Are you sure? I’d suggest the Circle or Bakerloo to Charing Cross.”"}],[{"start":49.849999999999994,"text":"He frowned for a moment and pulled out his phone. “No, ChatGPT says that ‘The Circle Line is not a good choice on marathon day. It will be too crowded. There are too many stops and too many steps. It’s a route for tourists, not for runners.’”"}],[{"start":65.3,"text":"I checked Google Maps. Sure enough, there is no train from Liverpool Street to Blackheath. ChatGPT’s recommendation would leave him stranded, trying to catch a bus over the marathon route, then trying to get on to the train from Charing Cross at a busy London Bridge. I told him that sounded like a bad idea. He frowned again and typed another query into his phone. “Oh, you’re right. ChatGPT says, ‘Correction: take the Elizabeth Line straight to London Bridge.’”"}],[{"start":95.6,"text":"Me: “The Elizabeth Line doesn’t go to London Bridge.”"}],[{"start":99.85,"text":"You’ve heard tales of artificial intelligence hallucinations before, but it’s not the AI that fascinates here: it’s the human. "}],[{"start":107.5,"text":"The route-finding algorithm on Google Maps is a minor miracle. It will solve a complex optimisation problem across multiple modes of transport, taking into account real-time congestion or delays, and it’s been available on smartphones and browsers for years. It is a proven, practical example of AI in action. So on marathon day, when the stakes are high and the clock is ticking, why would anyone turn instead to a fancy word-guessing machine such as ChatGPT?"}],[{"start":137.5,"text":"Perhaps it’s that ChatGPT seems so human. It served up an uncanny impersonation of a friendly and knowledgeable local guide. The Circle Line? Pfft, it’s fine for tourists but you’re a marathon runner: think about all those steps! (It’s true, the creaky old Circle Line does have steps.) "}],[{"start":155.85,"text":"Part of the bot patter reminded me of clickbait ads: INSURANCE COMPANIES HATE THIS LOOPHOLE! ChatGPT wasn’t just giving a route, but giving a rationale, even explaining why we shouldn’t listen to the lamestream advice of Google Maps. This is the approach of a confidence trickster."}],[{"start":174.15,"text":"In the introduction to her book The Confidence Game, psychologist Maria Konnikova explains: “The true con artist doesn’t force us to do anything: he makes us complicit in our own undoing . . . we believe because we want to.” One difference between the con artist and the large language model (LLM) is that the con artist knows the truth and is trying to conceal it. One similarity between the con artist and the LLM is that both of them have perfected seeming plausible."}],[{"start":204.35,"text":"A recent paper in Nature finds that when LLMs are trained to be warm and friendly, they also produce dramatically less accurate answers, “promoting conspiracy theories, providing inaccurate factual information and offering incorrect medical advice”. That sounds bad. I’d suggest that the reality is worse: the sycophantic AI not only produces mistakes, it persuades us to believe them. "}],[{"start":227.85,"text":"In 1950 Alan Turing, the mathematician and visionary of the computer age, famously proposed an “imitation game” in which a human judge would communicate through a teleprompter with a human and a computer. The computer’s job was to imitate human conversation convincingly enough to persuade the judge. "}],[{"start":246.29999999999998,"text":"Turing’s test remains intriguing, but there is a longstanding difficulty: the fallibility of the judge. A primitive 1960s chatbot, Eliza, responded like a parody of a therapist (“How does that make you feel?” “Why do you feel sad?” “Please go on.”). People lapped it up; it’s nice to feel listened to. A 1980s chatbot, MGonz, just fired off insults and was perfectly plausible, partly because insults are simple to deliver and mostly because they prompt rage rather than reflection in the human recipient. And Robert Epstein, an expert in the Turing Test, has written entertainingly about how he was fooled into a four-month correspondence with a sexy Russian lady who was, in fact, a 2006-era chatbot. None of these bots had a thousandth of the sophistication of a modern LLM, but they didn’t need it: when humans are sad, angry or amorous, we aren’t very sophisticated judges, either."}],[{"start":307.2,"text":"We are all going to find ourselves in strange variations of the Turing Test in years to come, and I wonder if we are up to it. And not just us, but those with power over us. As Cory Doctorow, author of Enshittification, is fond of observing: you won’t be replaced because an AI can do your job, you’ll be replaced because an AI salesman convinces your boss that it can. If my journey to the marathon start line is any guide, that salesman will have an easy job."}],[{"start":334.34999999999997,"text":"The capabilities of modern AI are impressive. But what determines whether we use it is not the capability, but the impressiveness. They are correlated but they are not the same thing."}],[{"start":344.9,"text":"There’s a tale about the French poet Jacques Prévert seeing a fellow begging for change on the streets of Venice with a sign that read “Blind man without a pension”."}],[{"start":354.25,"text":"Prévert stopped to chat to him; not many people were moved to contribute, and Prévert offered to write a new sign."}],[{"start":360.55,"text":"The next day, he returned to find the man overjoyed. “It’s incredible; I’ve never received so much money in my life.” "}],[{"start":367.6,"text":"Prévert had written: “Spring is coming, but I won’t see it.” "}],[{"start":371.20000000000005,"text":"The new sign contained no news — in fact, it was less informative than the old. But it told a story. Google Maps was the first sign: it told me where to get my train. ChatGPT was the second sign: it told my companion not just where to go, but how to feel about taking such a clever route."}],[{"start":390.20000000000005,"text":"I left him at Paddington, urging him not to try to take the non-existent Elizabeth Line train to London Bridge. I am not sure I was as convincing as ChatGPT."}],[{"start":401.15000000000003,"text":"Tim Harford ran the London Marathon in support of the Teenage Cancer Trust: tinyurl.com/HarfordMarathon  "}],[{"start":410.1,"text":"Find out about our latest stories first — follow FT Weekend Magazine on X and FT Weekend on Instagram"}],[{"start":421.45000000000005,"text":""}]],"url":"https://audio.ftcn.net.cn/album/a_1779022147_1460.mp3"}

尊敬的用户您好，这是来自FT中文网的温馨提示：如您对更多FT中文网的内容感兴趣，请在苹果应用商店或谷歌应用市场搜索“FT中文网”，下载FT中文网的官方应用。

{"text":[[{"start":4.65,"text":"Marathon day. An early train into London, then an unfamiliar journey across a race-disrupted city from Paddington to Blackheath, all in good time for the start of the race. I was nervous, of course, but was cheered by the sight of another bib-wearing runner — more experienced at marathons, less familiar with London."}],[{"start":24.4,"text":"Me: “How do you plan to get to the start line?”"}],[{"start":27.599999999999998,"text":"He: “I’ve asked ChatGPT. It says Elizabeth Line to Liverpool Street, then the train to Blackheath.”"}],[{"start":34.849999999999994,"text":"That didn’t sound right. Was there a train from Liverpool Street to Blackheath? Google Maps and Citymapper suggested getting to Blackheath from Charing Cross or Waterloo."}],[{"start":44.74999999999999,"text":"Me: “Are you sure? I’d suggest the Circle or Bakerloo to Charing Cross.”"}],[{"start":49.849999999999994,"text":"He frowned for a moment and pulled out his phone. “No, ChatGPT says that ‘The Circle Line is not a good choice on marathon day. It will be too crowded. There are too many stops and too many steps. It’s a route for tourists, not for runners.’”"}],[{"start":65.3,"text":"I checked Google Maps. Sure enough, there is no train from Liverpool Street to Blackheath. ChatGPT’s recommendation would leave him stranded, trying to catch a bus over the marathon route, then trying to get on to the train from Charing Cross at a busy London Bridge. I told him that sounded like a bad idea. He frowned again and typed another query into his phone. “Oh, you’re right. ChatGPT says, ‘Correction: take the Elizabeth Line straight to London Bridge.’”"}],[{"start":95.6,"text":"Me: “The Elizabeth Line doesn’t go to London Bridge.”"}],[{"start":99.85,"text":"You’ve heard tales of artificial intelligence hallucinations before, but it’s not the AI that fascinates here: it’s the human. "}],[{"start":107.5,"text":"The route-finding algorithm on Google Maps is a minor miracle. It will solve a complex optimisation problem across multiple modes of transport, taking into account real-time congestion or delays, and it’s been available on smartphones and browsers for years. It is a proven, practical example of AI in action. So on marathon day, when the stakes are high and the clock is ticking, why would anyone turn instead to a fancy word-guessing machine such as ChatGPT?"}],[{"start":137.5,"text":"Perhaps it’s that ChatGPT seems so human. It served up an uncanny impersonation of a friendly and knowledgeable local guide. The Circle Line? Pfft, it’s fine for tourists but you’re a marathon runner: think about all those steps! (It’s true, the creaky old Circle Line does have steps.) "}],[{"start":155.85,"text":"Part of the bot patter reminded me of clickbait ads: INSURANCE COMPANIES HATE THIS LOOPHOLE! ChatGPT wasn’t just giving a route, but giving a rationale, even explaining why we shouldn’t listen to the lamestream advice of Google Maps. This is the approach of a confidence trickster."}],[{"start":174.15,"text":"In the introduction to her book The Confidence Game, psychologist Maria Konnikova explains: “The true con artist doesn’t force us to do anything: he makes us complicit in our own undoing . . . we believe because we want to.” One difference between the con artist and the large language model (LLM) is that the con artist knows the truth and is trying to conceal it. One similarity between the con artist and the LLM is that both of them have perfected seeming plausible."}],[{"start":204.35,"text":"A recent paper in Nature finds that when LLMs are trained to be warm and friendly, they also produce dramatically less accurate answers, “promoting conspiracy theories, providing inaccurate factual information and offering incorrect medical advice”. That sounds bad. I’d suggest that the reality is worse: the sycophantic AI not only produces mistakes, it persuades us to believe them. "}],[{"start":227.85,"text":"In 1950 Alan Turing, the mathematician and visionary of the computer age, famously proposed an “imitation game” in which a human judge would communicate through a teleprompter with a human and a computer. The computer’s job was to imitate human conversation convincingly enough to persuade the judge. "}],[{"start":246.29999999999998,"text":"Turing’s test remains intriguing, but there is a longstanding difficulty: the fallibility of the judge. A primitive 1960s chatbot, Eliza, responded like a parody of a therapist (“How does that make you feel?” “Why do you feel sad?” “Please go on.”). People lapped it up; it’s nice to feel listened to. A 1980s chatbot, MGonz, just fired off insults and was perfectly plausible, partly because insults are simple to deliver and mostly because they prompt rage rather than reflection in the human recipient. And Robert Epstein, an expert in the Turing Test, has written entertainingly about how he was fooled into a four-month correspondence with a sexy Russian lady who was, in fact, a 2006-era chatbot. None of these bots had a thousandth of the sophistication of a modern LLM, but they didn’t need it: when humans are sad, angry or amorous, we aren’t very sophisticated judges, either."}],[{"start":307.2,"text":"We are all going to find ourselves in strange variations of the Turing Test in years to come, and I wonder if we are up to it. And not just us, but those with power over us. As Cory Doctorow, author of Enshittification, is fond of observing: you won’t be replaced because an AI can do your job, you’ll be replaced because an AI salesman convinces your boss that it can. If my journey to the marathon start line is any guide, that salesman will have an easy job."}],[{"start":334.34999999999997,"text":"The capabilities of modern AI are impressive. But what determines whether we use it is not the capability, but the impressiveness. They are correlated but they are not the same thing."}],[{"start":344.9,"text":"There’s a tale about the French poet Jacques Prévert seeing a fellow begging for change on the streets of Venice with a sign that read “Blind man without a pension”."}],[{"start":354.25,"text":"Prévert stopped to chat to him; not many people were moved to contribute, and Prévert offered to write a new sign."}],[{"start":360.55,"text":"The next day, he returned to find the man overjoyed. “It’s incredible; I’ve never received so much money in my life.” "}],[{"start":367.6,"text":"Prévert had written: “Spring is coming, but I won’t see it.” "}],[{"start":371.20000000000005,"text":"The new sign contained no news — in fact, it was less informative than the old. But it told a story. Google Maps was the first sign: it told me where to get my train. ChatGPT was the second sign: it told my companion not just where to go, but how to feel about taking such a clever route."}],[{"start":390.20000000000005,"text":"I left him at Paddington, urging him not to try to take the non-existent Elizabeth Line train to London Bridge. I am not sure I was as convincing as ChatGPT."}],[{"start":401.15000000000003,"text":"Tim Harford ran the London Marathon in support of the Teenage Cancer Trust: tinyurl.com/HarfordMarathon "}],[{"start":410.1,"text":"Find out about our latest stories first — follow FT Weekend Magazine on X and FT Weekend on Instagram"}],[{"start":421.45000000000005,"text":""}]],"url":"https://audio.ftcn.net.cn/album/a_1779022147_1460.mp3"}

Conned by a chatbot

卧底经济学家

相关话题

现代战争的血腥一如往昔

帕拉贝利斯医药公司于与再生元达成交易次日披露IPO计划，上市热潮升温

英伟达部署900亿美元助推AI繁荣

Lex专栏：股市投资者信心爆棚，但现金见底

FT社评：埃博拉疫情暴露全球应对大流行病准备不足

“四大”急聘AI专业人才，岗位数量盖过传统审计师