Welcome, humans.
It looks like small businesses are getting deepfaked now: Meet "Ethos," Austin's fake #1 restaurant. All AI-generated⊠even the croissant hippo:
The most criminal part about this is the fact that croissant Hippo isnât real⊠would 100% buy.
Reddit sleuths investigated and found that the only part of the website that works is the merch store. Weâd buy a Hippo Croissant shirt⊠accidental genius business idea?
Even though the restaurant has 72K followers and tons of engagement, Redditors are thinking those are probably AI, too.
Hereâs what you need to know about AI today:
- Apple researchers published a paper debunking AIâs ability to reason.
- 40% of the $79B in âcloudâ funding went to genAI startups.
- Boston Dynamics and Toyota teamed up on robot AI.
- Parents sued a school over AI cheating accusation.
Apple researchers want you to know that language models canât actually âreason.â
Apple's AI team just dropped a truth bomb on the world of large language models. In a new paper, they've shown that popular AI chatbots like ChatGPT aren't the math whizzes we thought they were.
By the contraryâthey struggle with basic reasoning that most humans take for granted.
The researchers developed a new benchmark called GSM-Symbolic to put these models through their paces. Here's what they found:
- Changing numerical values in questions caused significant performance drops across all models tested.
- As questions got more intricate (adding more clauses), accuracy plummetedâŠand variance skyrocketed.
- Adding a single irrelevant sentence to a math problem caused accuracy to nosedive by up to 65%.
To illustrate this, consider their "kiwiâ problem:
When asked to count kiwis, many top models stumbled (including GPT and Claude), subtracting five âsmallerâ kiwis when their size shouldn't have mattered at all. It was a random detail that needed to be filtered out.
Therefore, the study concluded that current language models aren't truly reasoning. Instead, theyâre performing âsophisticated pattern matching" that falls apart under scrutiny.â Duh, thatâs why theyâre called âpredictive modelsâ, Apple!
For his part, Ilya, formerly of OpenAI, says predicting the next word does lead to understanding. So could more predictions lead to more understanding, like reasoning? Thatâs the idea behind o1, anyway (full Ilya interview here btw, great listen).
Our take: If Apple doesnât think AI can reason, why is it about to go all in on AI on its devices? Aha! HEREâs whyâŠ
FROM OUR PARTNERS
Automate your sales follow-up emails with AI
Did you know you can automate your follow-up game and close more deals with Attention?
While youâre drowning in follow-ups and data entry, your competitors are leveraging Attention's AI to save a ton of time.
Attention lets you:
- Send personalized follow-up emails within minutes of ending a call.
- Get cross-call insights to understand why you're winning (or losing) deals.
- Receive live coaching during calls to handle objections like a pro.
The result? Attentionâs users are LOVING the time saved:
- âGame-changer for our productivity. Thanks to Attention, our reps save 20-30 minutes per call with automated CRM data entryâ - Yan Kessler, VP Sales at Aspire Technologies.
Around the Horn.
Good day for hippo content, TBH. Try it here on iOS + read more here.
- Accel released a new report that found 40% of all VC funding for cloud startups (around $79B) went to those that focused on genAI.
- LatticeFlow's new LLM Checker tool tested the top AI models for how well they comply with the EU AI Actâit found most did well overall, but some struggled with bias and cybersecurity issues.
- Boston Dynamics and Toyota Research Institute announced a collaboration to develop AI for the electric Atlas humanoid robot.
- Parents sued a school after their son was accused of cheating by using AI on his assignment. Their argument? There were no âestablished rules, policies or proceduresâ about how AI can or canât be used, or how staff should handle it.
Treats To Try.
- *You need actionable, real-time insights to close more deals and boost CX. Gladia just made that dream come true with Gladia Real-Time. Specialized in state-of-the-art speech AI APIs, Gladia Real-Time helps you unlock instant insights from any voice conversation. Click here to try for free.
- Dropbox launched Dash for Business which lets you find and manage content across all your work apps, tabs, and files.
- Focus Buddy updates your to-do list in real-time, checks in to help you overcome procrastination, and provides weekly insights on your work habits.
- FAQ Widget creates FAQ popups for your website that answer visitor questions to help increase sales.
- Tattoon is an app that lets you demo what a tattoo would look like on your body to see if you like it.
- Check out this person who self-hosted Llama 3.2 on their home server, and their step by step guide for you to replicate.
- Mistral released a new model called Les Ministraux thatâs built to run locally on phones and laptopsâthe 8B version is available to try today for research purposes, with a 128K context window.
- Just for fun: Political Debate Simulator lets you pick any model you want to have U.S. Presidential candidates Kamala Harris and Donald Trump debate each other on any topicâask them to debate who ate the last slice of pizza!
See our top 51 AI Tools for Business here!
*This is sponsored content. Advertise in The Neuron here.
Is what youâre building future-proof? Find out at Spectra 2024 on October 23rd
Hear from edge computing leaders like TensorFlow pioneer Pete Warden, Edge Impulse co-founders Zach Shelby and Jan Jongboom, Qualcomm AI Hub head Siddhika Nevrekar, and Particle CEO Zach Supalla to gain insider insights into tomorrowâs smart tech and equip yourself with the necessary tools to create the next wave of intelligent products.
Register now â 100% free and virtual!
Thursday Trivia.
One is a real snapshot from the mall in 1989, and one is AI. Which is which?
A.
B.
Which is which?
The answer is below, but place your vote to see how your guess compares to everyone else (no cheating now!)
A.
B.