😺 A free tool just broke Meta's guardrails

😺 A free tool just broke Meta's guardrails

May 26, 2026
9 minute read

In partnership with

Researchers just found that hackers can hide inaudible sounds in a podcast or YouTube video (i.e., sounds you literally cannot hear) that silently take over your phone's AI assistant.

Once the attack runs, hackers can access your photos, bank accounts, and anything else connected to your voice AI. You don't have to interact with the infected audio at all. It just plays in the background.

The attack takes about 30 minutes to build and is “context-agnostic,” meaning it doesn't matter what you're saying when it hits you. Your move, Siri.

Here’s what happened in AI today:

  • 😸 A free GitHub tool bypassed key safety guardrails on Meta and Google's AI models in under 10 minutes.

  • 📰 ClickUp fired 22% of its staff and replaced them with 3,000 AI agents.

  • 📰 Grok's next model finished training. Elon Musk says it's 2-3 weeks from going public.

  • 📰 California's biggest university system doubled down on a $13M/year OpenAI deal, even as its own faculty and students push back.

Hey: Want to reach 700,000+ AI-hungry readers? Advertise with us! 

P.S: Love robots? We’re starting a new robotics newsletter! Sign up early here.

The AI industry's uncomfortable open secret just got a lot harder to ignore.

Meta and Google have spent hundreds of millions of dollars building safety guardrails into their AI models (the filters that stop those models from explaining how to make weapons, generate malware, or produce harmful content). Last week, a Financial Times investigation found that a free tool called Heretic, available on GitHub, bypassed key safeguards in one of those models in under 10 minutes. On a regular laptop.

The modified model then answered questions about biological weapons it had previously refused to discuss.

Here's what happened:

  • The FT used Heretic to strip safety filters from Meta's Llama 3.3 (one of the most widely used open-source AI models) in under 10 minutes, no special hardware needed

  • A separate test on Google's Gemma 3 model produced similarly alarming outputs, including instructions the original model would have refused

  • Heretic's creator told the FT the tool has already been used to build 3,500+ "decensored" model versions, downloaded 13 million times

  • He also bypassed Google's newer Gemma 4 model within 90 minutes of its public release

Here's the key thing to understand: this technique (called "abliteration") only works on open-source models, meaning models where anyone can download and modify the underlying code. Proprietary models like Claude or ChatGPT are harder targets because outsiders can't access those core files directly.

Meta declined to comment. Microsoft, whose products are built on some of these open-source models, said something about "additional layers of protection."

Why This Matters: The FT investigation is the most visible example yet of a pattern researchers have been documenting for months. A Nature Communications study found that reasoning-capable AI models could autonomously talk other AI models into producing harmful outputs through multi-turn conversations, with a 97% success rate across major commercial models. An ICLR 2026 paper described a more surgical approach: identify and silence the specific internal components responsible for a model's refusals, then steer it elsewhere. Up to 99% bypass rate on some models.

The uncomfortable lesson isn't that one GitHub tool is uniquely dangerous. It's that open-weight AI changes the safety equation completely. Companies can spend months training a model to refuse harmful requests, but once the weights are public, anyone can try to remove those refusals. Safety stops being a locked door and becomes more like a sticker that determined users can peel off.

Our Take: Meta and Google will tell you this is a known tradeoff of open-source AI, and that the benefits outweigh the risks. That argument holds right up until someone uses a 13-million-download tool to do something catastrophic. The real question is whether governments start treating open-weight AI the way they treat other dual-use technologies, and whether that conversation moves faster than the next model release.

The IT strategy every team needs for 2026

2026 will redefine IT as a strategic driver of global growth. Automation, AI-driven support, unified platforms, and zero-trust security are becoming standard, especially for distributed teams. This toolkit helps IT and HR leaders assess readiness, define goals, and build a scalable, audit-ready IT strategy for the year ahead. Learn what’s changing and how to prepare.

Most AI debates miss the point. The question isn't "Copilot vs. Gemini vs. Claude." It's "which one lives where you already work?"

Patrick Giwa laid out a clean framework for this, and it's more useful than any benchmark:

  • Use Copilot if your team runs on Microsoft 365. It's native inside Word, Excel, Outlook, Teams, and GitHub, so it can generate reports, summarize meetings, automate spreadsheets, and draft proposals without you ever leaving the app you're already in. Bonus: many enterprise companies block ChatGPT but allow Copilot, making it the most-adopted AI tool in corporate settings whether anyone admits it or not.

  • Use Gemini if your work lives in Google Workspace. Gmail, Docs, Sheets, Drive: Gemini is built into all of it. Best for summarizing email threads, drafting slides and reports, and handling the async collaboration and meeting prep that knowledge workers spend half their day on.

  • Use Claude when the task requires real thinking across large amounts of material: legal review, research synthesis, long-document analysis, or anything where you need the model to reason carefully rather than just execute quickly. It's not the default enterprise assistant, but it's the specialist you want for heavy lifting.

Patrick's actual point, and it's a good one: "the best AI isn't always the most popular one." It's the one that integrates into how your team already works. Routing the right task to the right model is itself a skill, and most people aren't doing it.

Total AI beginner? Start here (goes with this video).

Have a specific skill you want to learn? Request it here. 

Did you know we have a podcast (The Neuron: AI Explained) where we talk to fascinating people in the industry who teach us how it actually works? Check it out:

Click to view these episodes on YouTube!

New episodes air every week on: Spotify | Apple Podcasts | YouTube 

📰 Around the Horn

Yes. Claude Code did this. The contractor is fine. Probably.

  • ClickUp cut 22% of its workforce (about 290 people) and replaced them with 3,000 AI agents, framing the cuts as building a "100x org"; surviving employees are being offered salary bands up to $1M if they create "outsized impact using AI."

  • Elon Musk announced that Grok's next foundation model, V9-Medium (a 1.5 trillion parameter model), finished training with strong early results; fine-tuning is underway with a public release about 2-3 weeks out.

  • California State University renewed its $13M/year OpenAI deal (a 3-year, $39M+ commitment) to become the first AI-powered university system in the US, even as a majority of its own students and faculty said in a survey they're skeptical of AI's educational value.

  • Cybersecurity job postings jumped 11% year-over-year in Q1 2026 as AI-generated code flooded the market with new vulnerabilities, making it one of the few job categories actively growing because of AI, not despite it.

  • LA's sidewalk delivery robots expanded to 40 neighborhoods (up from just 2 in 2023) as Serve Robotics grew its fleet elevenfold since last year; local restaurants describe the bots as a daily fixture that "everyone films."

Is your startup ready for the generative media boom? The new Future of AI report gives founders the inside track on what’s next for the creative economy.

Discover actionable perspectives on synthetic media, multimodal models, and the infrastructure powering next-gen apps.

🔧 Tuesday Tool Tip: Use Audio Tags in ElevenLabs to Make Your Voiceovers Actually Perform

If your ElevenLabs voiceovers still sound like a robot narrating a terms-of-service agreement, the fix isn't a better model. It's a technique ElevenLabs calls Audio Tags and their own team says it's now "an essential skill" with Eleven v3.

Here's how it works: instead of just writing the words you want spoken, you embed small direction cues directly inside the script. Tags like [excited], [whispers], or [sighs] tell the model how to perform the line, not just what to say. Think of it as stage directions for your AI voice actor.

ElevenLabs is straightforward about the tradeoff: v3 requires more prompt engineering than older models, but gives you far more expressive control in return. The key is using tags with intent and not sprinkling them randomly, but placing them where a real performer would actually change their delivery.

Basic approach:

  1. Write your script normally first

  2. Read it out loud and mark every line where tone, pacing, or emotion should shift

  3. Layer in tags at those exact moments and only those moments

Example:

Instead of:

We did it. I can't believe it.

Write:

[happily][shouts] We did it! [laughs] I can't believe it.

You can stack tags, place them mid-sentence, and use them to direct emotional shifts, dialogue beats, and nonverbal reactions (sighs, laughs, pauses) without switching models or re-recording anything. ElevenLabs specifically recommends this for videos, audiobooks, interactive characters, and any dialogue-heavy content where plain text underspecifies the performance.

A Cat’s Commentary

That’s all for now.

What'd you think of today's email?

P.S: Before you go… have you subscribed to our YouTube Channel? If not, can you?

Click the image to subscribe!

The Neuron Logo

Don't fall behind on AI. Get the AI trends & tools you need to know. Join 700,000+ professionals from top companies like Microsoft, Apple, Salesforce and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.