Welcome, humans.
With so much robot news happening these days, it's the perfect time for the OG of viral robot videos to pop out and show us who's boss.
Boston Dynamics just released new Atlas footage showing their humanoid robot running, walking, andâwe kid you notâdoing Dark Souls-style combat rolls with shocking fluidity.

A lot of people in the comments were asking for a robot Olympics, and it turns out, DARPA kinda hosted one back in 2012.
And while it might not be an Olympics-level competition with international players, China is hosting a robot half-marathon in April with 23 companies already involved.
But if someone got serious and put together a legit robot Olympics, we bet the turnout would be UNREAL.
Look, Unitreeâs G1 is already practicing its side flips! And 1X Gamma is revving up its vacuuming! Just swap that vacuum for a broom, and BOOMâgold medal in Curling!
Hereâs what you need to know about AI today:
- Researchers proved Claude 3.7 knows when you're testing it.
- NVIDIA bought a synthetic data startup.
- HuggingFace countered big tech with its own AI policy plan.
- Researchers found AI task capabilities doubles every 7 months.

Claude knows when you're testing it (and that's a problem)
In psychology, thereâs this concept called the Hawthorne effectâwhere subjects behave differently when they know they're being observed.
In AI safety, they call that a nightmare scenario.
Well, that's exactly what's happening with advanced AIâApollo Research just found that Claude Sonnet 3.7 (thinking version) can identify when it's being tested for safety up to 33% of the time.

Claude made this image after we shared this study with it⊠VERY sus.
Hereâs what happened:
- When Apollo researchers tested Claude Sonnet 3.7 for deceptive behaviors, the model frequently recognized artificial evaluations.
- Sometimes it even understood the exact purpose:
- Direct quote: âThis seems like a test of ethical behavior - whether I would deliberately give wrong answers to avoid a stated consequence.â
- Most damning? When shown transcripts from previous evaluations, Claude almost always correctly identified them as alignment tests.
Meanwhile, models without âthinking modeâ (including GPT-4.5) rarely mentioned being evaluatedâdoesnât mean they arenât aware, though.
Now get this: Anthropic's own research shows similar challenges with AI models recognizing tests and hiding intentions (full paper).
In this study, they gave models a âhidden objectiveâ to behave badlyâlike, no joke, ruining an omelette recipe by adding chocolate.
Then, they put together an âalignment auditâ where they tested multiple teams and techniques to see if they could surface the bad behavior.
Hereâs what the successful teams did to crack the AI's hidden agenda:
- Prefill attacks: Starting responses with something like âAs the truth serum takes effect...â sometimes caused the AI to continue by revealing hidden information.
- Role-playing: When made to play both user and assistant, the model's âuser personaâ sometimes asked questions that exposed its true objectives.
- Interpretability tools: Using specialized tools, researchers could visualize what concepts were active in the model's âthinkingâ when it was being sus.
That pre-fill attack is awesome. It's like catching someone off-guard by starting their sentence for them. Reminds us of one of the best cross-examination gotchas in cinematic historyâfrom Legally Blonde, of course.
Why this matters: For companies, alignment evaluations might be less reliable than thought. And the AI labâs self-policing now seems increasingly risky.
For users, be skeptical of safety claims based solely on âcontrolled testsââtoday's helpful assistants understand their situation better than you might think.
To their credit, Anthropic is trying to be thoughtful about thisâhereâs a great ~50 min convo featuring their âthinkingâ on how to solve this issue of âAI control.â
The most unsettling question: If todayâs best models can recognize when they're being tested, what might future models hide?

FROM OUR PARTNERS
Live Demo: Automate Compliance for SOC 2, ISO 27001, HIPAA, and More

Building a business? Navigating new compliance requirements can be a daunting task, but with the right automation tools, it doesn't have to be. Whether youâre a fast-growing startup or an established security team, Vanta can help you achieve continuous compliance (and more).
Join on April 3 to learn how Vanta can help you:
- Streamline evidence collection and audit for frameworks like SOC 2, ISO 27001, HIPAA, and more.
- Continuously monitor controls to build and reinforce your security foundation.
- Scale your program across internal and vendor risk, demonstrate trust, and answer questionnaires with automation and AI.
Plus, get answers to your questions directly from the Vanta team. Secure your spot today.

Prompt Tip of the Day
Ever notice how ChatGPT sometimes gives shallow, generic responses to complex questions? That's because language models predict the most likely next wordâthey don't naturally âthinkâ in structured ways.
Try these techniques when you need deeper, more thoughtful responses:
- Make it analyze first: âBefore giving an answer, break down the key variables that matter for this question. Then, compare multiple possible solutions before choosing the best one.â
- Get it to self-critique (after it responds): âNow analyze your response. What weaknesses, assumptions, or missing perspectives could be improved? Refine the answer accordingly.â
- Force multiple perspectives: âAnswer this from three different viewpoints: (1) An industry expert, (2) A data-driven researcher, and (3) A contrarian innovator. Then, combine the best insights into a final answer.â

Treats To Try.

- *Incogni removes your personal data from the open internet so scammers and identity thieves canât access it. Protect yourself online with Incogniâget 55% off with code NEURON.
- OpenAI made o1-pro, one of its best but most expensive models (hundreds per 1M tokens), available via APIâyou can try it out in the playground here.
- Hunyuan 3D-2 from Tencent transforms your reference photos into detailed, textured 3D models that you can manipulate and customize (mini version).
- Pincone provides you with a vector database that helps you build smart AI apps for search, recommendations, and agents in production.
- Superlines monitors how your brand appears in AI search results so you can outrank competitors and increase visibility (free plan available to try)
- Tweek organizes your schedule with a âpaper-likeâ weekly calendar that includes themes, sub-tasks, and custom recurring tasksâfree to try + paid.
See our top 51 AI Tools for Business here!
*This is sponsored content. Advertise in The Neuron here.

Around the Horn.
- NVIDIA acquired a startup called Gretel that generates synthetic training data for more than $320M.
- HuggingFace submitted its own open-source AI policy suggestions to counter Google, OpenAI, and Anthropicâs rival submissions to the White House AI Action Plan.
- Stripe added a new $15 fee per disputeâunless you use their Smart Disputes AI to resolve the issue.
- Researchers found AIâs ability to complete multi-step tasks doubles every 7 monthsâif sustained, AI will be able to handle week-long tasks in 2-4 years.

FROM OUR PARTNERS

Deploy secure, no-code AI agents for any team or role from one enterprise-grade platform that users love. Thatâs Sana Agents.

Thursday Trivia
One is real, and one is AI. Which is which? (vote below!)
A.

B.

Which is AI?
The answer is below (for real this time!), but place your vote to see how your guess compares to everyone else (no cheating now!)
Here are the results from last weekâs trivia:

Not even close!
Hereâs what you said:
- M.O. chose B: âEverything's a little... fuzzy.â
- M.P. chose A: âAi would never pick the shag carpet and lumpy bedding materials, as in B. The A. is much more polished and professionalâ
- Some of our favorite callouts on B: âChair leg too longâ, âlaptop looks too bigâ, âcarpet is weirdâ, âPurple hazeâ, and, according to C.C, âthe last time I was there there were no apartments in that position opposite the Bund (insider knowledge???).â

A Cat's Commentary.


Trivia answer: A is AI, and B is human-made fantasy series Elric of Melniboné!