The recent misuse of audio deepfakes, including a robocall purporting to be Joe Biden in New Hampshire and spear-phishing campaigns, has prompted questions about the ethical considerations and potential benefits of this emerging technology. Nauman Dawalatabad, a postdoctoral researcher, discussed these concerns in a Q&A prepared for MIT News.
According to Dawalatabad, the attempt to obscure…
Researchers from MIT have developed an image dataset that simulates peripheral vision in machine learning models, improving their object detection capabilities. However, even with this modification, the AI models still fell short of human performance. The researchers discovered that size and visual clutter, factors that impact human performance, largely did not affect the AI's ability.…
Audio deepfakes have recently been in the news, particularly in regards to their negative impacts, such as fraudulent robocalls pretending to be Joe Biden, encouraging people not to vote. These malicious uses could negatively affect political campaigns, financial markets, and lead to identity theft. However, Nauman Dawalatabad, a postdoc student at MIT, argues that deepfakes…
Peripheral vision, most humans' mechanism to see objects not directly in their line of sight, although with less detail, does not exist in AI. However, researchers at MIT have made significant progress towards this by developing an image dataset to simulate peripheral vision in machine learning models. The research indicated that models trained with this…
Nauman Dawalatabad, a postdoctoral researcher discusses the concerns and potential benefits of audio deepfake technology in a Q&A with MIT News. He addresses ethical considerations regarding the concealment of a source speaker’s identity in audio deepfakes, noting that speech contains a wealth of sensitive personal information beyond identity and content, such as age, gender and…
MIT researchers are replicating peripheral vision—a human's ability to detect objects outside their direct line of sight—in AI systems, which could enable these machines to more effectively identify imminent dangers or predict human behavior. By equipping machine learning models with an extensive image dataset to imitate peripheral vision, the team found these models were better…
Recently, an AI-generated robocall mimicking Joe Biden urged New Hampshire residents not to vote. Meanwhile, "spear-phishers" – phishing campaigns targeting specific people or groups – are using audio deepfakes to extract money. However, less attention has been paid to how audio deepfakes could positively impact society. Postdoctoral fellow Nauman Dawalatabad does just that in a…
Peripheral vision, the ability to see objects outside of our direct line of sight, has been simulated by researchers at MIT to be used with AI technology. Unlike human vision, AI lacks the capability to perceive peripherally. Enhancing AI with this ability could greatly enhance its proactivity in identifying threats, and could even predict if…
Audio deepfakes, or AI-generated audio, have lately been in the limelight due to harmful deception applied by ill-intentioned individuals. Cases such as robocalls impersonating political figures, spear-phishers tricking individuals into revealing personal information, and actors misusing technology to preserve their voices have surfaced in the media. While these negative instances have been widely publicized, MIT…
A team from MIT has created an image dataset aimed at simulating peripheral vision in machine learning models, a characteristic which AI typically lacks. This could improve the models' ability to recognise approaching threats and predict whether a human driver would spot an oncoming object. In experiments, these models improved in terms of hazard detection,…
In this Q&A article for MIT News, postdoc Nauman Dawalatabad discusses the ethical considerations, challenges, and positive impacts of audio deepfakes - the AI-generated audio that can mimic human voices. Recently, the technology has been misused causing public concern, for example, a robocall imitating Joe Biden’s voice instructed New Hampshire residents not to vote, while…
A new technique has been proposed by researchers from the Massachusetts Institute of Technology (MIT) and other institutions that allows large language models (LLMs) to solve tasks involving natural language, math and data analysis, and symbolic reasoning by generating programs. Known as natural language embedded programs (NLEPs), the approach enables a language model to create…