{"id":198354,"date":"2025-06-19T23:16:09","date_gmt":"2025-06-19T23:16:09","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/198354\/"},"modified":"2025-06-19T23:16:09","modified_gmt":"2025-06-19T23:16:09","slug":"boffins-devise-voice-altering-tech-to-jam-vishing-ploys-the-register","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/198354\/","title":{"rendered":"Boffins devise voice-altering tech to jam &#8216;vishing&#8217; ploys \u2022 The Register"},"content":{"rendered":"<p>Researchers based in Israel and India have developed a defense against automated call scams.<\/p>\n<p>ASRJam is a speech recognition jamming system that uses a sound modification algorithm called EchoGuard to apply natural audio perturbations to the voice of a person speaking on the phone. It&#8217;s capable of subtly distorting human speech in a way that baffles most speech recognition systems but not human listeners.<\/p>\n<p>The tech is needed because recent advances in machine learning, text-to-speech (TTS), and automatic speech recognition (ASR) have made it quite easy to automatically make phone calls with the intent to scam or defraud.<\/p>\n<p>These &#8220;vishing&#8221; attacks \u2013 like email-based phishing but using voice instead of text \u2013 see criminals and scammers use TTS to create a realistic-sounding voice that speaks words they hope will lure victims. If the recipient of a call responds, the crook&#8217;s ASR system attempts to convert their vocal response to text, so the back-end model can decipher what was said, devise a response, and conduct a conversation long enough to elicit sensitive information or prompt the victim to take an action.<\/p>\n<p>Vishing increased 442 percent between the first and second half of 2024, according to CrowdStrike&#8217;s <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/www.crowdstrike.com\/en-us\/global-threat-report\/\">2025 Global Threat Report<\/a>. During the first half of that year, the US Federal Trade Commission <a target=\"_blank\" href=\"https:\/\/www.theregister.com\/2024\/02\/08\/sorry_scammers_the_fcc_says\/\" rel=\"noopener\">said<\/a> that using AI-generated voices for phone calls is illegal.<\/p>\n<p>As Crystal Morin, former intelligence analyst for the US Air Force and cybersecurity strategist at infosec vendor Sysdig, <a target=\"_blank\" href=\"https:\/\/www.theregister.com\/2024\/12\/29\/llm_supply_chain_attacks\/\" rel=\"noopener\">told The Register<\/a> in December 2024, voice-based phishing is becoming harder to detect as AI models get better.<\/p>\n<p>Freddie Grabovski (Ben-Gurion University of the Negev), Gilad Gressel (Amrita Vishwa Vidyapeetham), and Yisroel Mirsky (Ben-Gurion University of the Negev) have come up with a defense against vishing, described in <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/arxiv.org\/abs\/2506.11125\">a pre-print paper<\/a> titled &#8220;ASRJam: Human-Friendly AI Speech Jamming to Prevent Automated Phone Scams.&#8221;<\/p>\n<p>They argue that the ASR component of the scammers&#8217; setups represents the weakest link.<\/p>\n<p>&#8220;Our key insight is that by disrupting ASR performance, we can break the attack chain,&#8221; they explain in their paper. &#8220;To this end, we propose a proactive defense framework based on universal adversarial perturbations, carefully crafted noise added to the audio signal that confuses ASR systems while leaving human comprehension intact.&#8221;<\/p>\n<p>The researchers say they believe they&#8217;re the first to propose a proactive defense against automated voice scams that&#8217;s practical enough to deploy.<\/p>\n<p>ASRJam defends against vishing by running the EchoGuard algorithm in real time on end-user devices. The tool is invisible to attackers, making it more difficult to circumvent.<\/p>\n<p>EchoGuard is also universal \u2013 it works, to varying degrees, against any AI model. It is also zero-query, meaning it doesn&#8217;t require sample ASR output to generate an audio perturbation capable of breaking the ASR model.<\/p>\n<p>The authors say that while other ASR jamming techniques have been proposed over the past few years (<a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/ieeexplore.ieee.org\/document\/10145466\">AdvDDoS<\/a>, <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/arxiv.org\/abs\/1910.05262\">Kenansville<\/a>, and <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/www.usenix.org\/conference\/usenixsecurity23\/presentation\/wu-xinghui\">Kenku<\/a>), &#8220;none are suitable for interactive scenarios; their perturbations, though often intelligible, are perceptually harsh and impractical for interactive scenarios.&#8221;<\/p>\n<p>ASRJam is better, they argue, because EchoGuard modifies the voice in three ways: reverberation, microphone oscillation, and transient acoustic attenuation.<\/p>\n<p>By altering sound reflection characteristics, simulating microphone positioning changes, and subtle sound shortening, the researchers claim their method &#8220;strikes the best balance between clarity and pleasantness,&#8221; based on a survey they conducted with an unspecified number of participants.<\/p>\n<p>They&#8217;ve published <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/sites.google.com\/view\/impulse-response-asr-attack\/home\">a website<\/a> that includes an original speech sample and copies that have been processed with EchoGuard and other algorithms for comparison.<\/p>\n<p>The researchers evaluated ASRJam\/EchoGuard and the other techniques against three public datasets (Tedlium, SPGISpeech, and LibriSpeech) and six ASR models (DeepSpeech, Wav2Vec2, Vosk, Whisper, SpeechBrain, and IBM Watson).<\/p>\n<p>&#8220;Across the board, EchoGuard consistently outperforms all baseline jammers,&#8221; the authors state in their paper. &#8220;Our method achieves the highest attack success rate on every ASR system tested, across all datasets, with only one minor exception: <a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/speechbrain.github.io\/\">SpeechBrain<\/a> (SB), where it is slightly outperformed by the others.&#8221;<\/p>\n<p>The authors say they consider this acceptable since SpeechBrain isn&#8217;t common in real-world deployments and its performance isn&#8217;t great for general ASR systems.<\/p>\n<p>They also note that all the automatic speech recog jamming techniques tested underperform against OpenAI&#8217;s Whisper model, which they suggest is better at filtering out adversarial noise because developers trained it on a particularly large set of data that included a lot of noisy samples.<\/p>\n<p>Nonetheless, EchoGuard does better than the other jammers against Whisper.<\/p>\n<p>&#8220;Importantly, while the absolute attack success rate on Whisper may seem modest (e.g., 0.14 on LibriSpeech), this still implies that 1 in 6 transcriptions is significantly corrupted, a degradation level that could be sufficient to disrupt scam conversations, especially in the context of interactive dialogue where misrecognition of key terms or intents can derail an LLM\u2019s generation,&#8221; they claim.<\/p>\n<p>Lead researcher Grabovski told The Register that he believes future work will improve how ASRJam and EchoGuard perform against Whisper.<\/p>\n<p>&#8220;ASRJam is currently a research project, but we&#8217;re actively working on improvements with the goal of commercializing it in the near future,&#8221; he said. \u00ae<\/p>\n","protected":false},"excerpt":{"rendered":"Researchers based in Israel and India have developed a defense against automated call scams. ASRJam is a speech&hellip;\n","protected":false},"author":2,"featured_media":198355,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3163],"tags":[323,1942,53,16,15],"class_list":{"0":"post-198354","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-technology","11":"tag-uk","12":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/114712581457267736","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/198354","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=198354"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/198354\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/198355"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=198354"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=198354"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=198354"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}