Welcome, humans.
OpenAI’s exec team (including Sam) did a Reddit AMA that’s worth a skim. Some highlights: the company is working on a new DALL-E update which Altman said “will be worth the wait”, and AI agents that can perform tasks autonomously and message users first will be “a big theme in 2025.”
ALSO: Camera mode for ChatGPT is in development, o1 may soon be able to see images and sing, and they're planning to implement hands-free way to end voice conversations. And then there was this little nugget…
Here’s what you need to know about AI today:
- We break down the best AI for major tasks (and why).
- T-Mobile invested $100M in OpenAI's customer service AI.
- Apple enabled ChatGPT Plus upgrades in iOS settings.
- OpenAI hired Meta's AR / VR hardware leader.
The Best AI models for every business task (yes, we did the research)
Nobody in AI wanted to launch anything new yesterday (why not, was something going on?), so we figured it would be a good time to do a recap on all the top AI models and which one is best for which use case.
We dug into the latest benchmark data, pricing, and real-world performance metrics to create the definitive guide to choosing the right model for your needs. Let's break down which models excel at what:
For Coding & Development: Claude 3.5 Sonnet (new)
- Consistently outperforms GPT-4o in real-world coding tasks.
- Better at understanding complex codebases.
- More reliable at generating working code.
- Costs less for similar tasks.
For Data Analysis & Processing: Gemini 1.5 Pro
- 2M token context window (largest available).
- Superior at handling large datasets.
- Built-in data visualization capabilities.
- Strong integration with Google's ecosystem.
For Content Creation: Claude 3.5 Sonnet (new)
- Best-in-class creative writing capabilities.
- More consistent tone and style.
- Better understanding of context.
- Lower hallucination rate.
For Visual Tasks: GPT-4o
- Most accurate image analysis.
- Better at complex visual reasoning.
- Strong multimodal capabilities.
- More reliable at following visual instructions.
The Budget-Friendly Options:
If you're watching costs (who isn't?), here are the best value picks.
- Gemini 1.5 Flash: $0.35/1M tokens
- Ministral 3B: $0.04/1M tokens
- GPT-3.5 Turbo: $0.5/1M tokens
Speed Champions
Need blazing fast responses? These models lead in performance:
- Llama 3.2 1B: 555 tokens/second.
- Gemini 1.5 Flash: 311 tokens/second.
- GPT-4 Turbo: 125 tokens/second.
And as you know, o1-mini and o1 preview are best for complicated reasoning tasks (in that order), like solving tough math problems. While o1-preview tops most benchmarks, its high cost means it's overkill for many business tasks.
For most companies and business needs, a mix of models will provide the best balance of performance and cost—use specialized models for specific tasks rather than trying to find one model to rule them all.
For example: We use Sonnet to help write (it’s hands down the best), and bounce between Perplexity and ChatGPT Search for search tasks. For projects involving lots of context, we use Claude Projects, but are seriously considering trying Gemini for them soon. If you want to get real fancy, you could take your first draft from Claude to ChatGPT Canvas and edit there.
FROM OUR PARTNERS
Leading marketing teams are transforming their workflows with this AWS genAI guide
We all want to use AI to be more efficient, but your marketing and sales workflow is likely wasting hours of your day–-even with the help of ChatGPT.
Thankfully, AWS released a powerful new eBook: “Keys to revolutionize marketing and sales with generative AI.”
Inside, you’ll find real-world use-cases to help you:
- Augment your human skills by automating repetitive tasks.
- Enhance customer experiences with personalized content.
- Boost productivity and streamline business operations through automation.
Download your copy right here.
Around the Horn.
- OpenAI signed a new $100M deal with T-Mobile to build a new customer service chatbot that remembers customer details to be released next year.
- Apple will let you upgrade to ChatGPT Plus directly from your iPhone Settings when ChatGPT integration gets released as part of iOS 18.2 in December.
- Google is launching an AI hub in Saudi Arabia to research new AI models and research “Saudi-specific AI applications.”
- Meta’s former AR/VR hardware leader announced she will join OpenAI, suggesting the company will make a serious push into physical devices.
Treats To Try.
- *In danger of losing deals to lower-priced competitors? DIVACS gives you the tools to prove your higher price tag delivers exponentially more value. Request a demo and try it out yourself here.
- TherapyAI built a tool to help you manage election-related stress.
- OpenHands is an assistant for coding with zero set up required, and Siter.io turns your Figma designs into live websites without any code needed.
- MockFlow sketches and visualizes website or app designs via wireframes, turning your UI ideas into simple drawings that your team can iterate on.
- ScalerX.ai creates custom chatbots for your Telegram channels that can sell products, answer questions, and handle support requests for you.
- Crisp organizes your messy customer chats into a shared team inbox to help you respond faster.
- Melies turns your movie ideas into animated films by generating scripts, characters, and scenes for you.
- Nifty combines your tasks, docs, chats, and roadmaps into one workspace so your team stops switching between apps.
See our top 51 AI Tools for Business here!
*This is sponsored content. Advertise in The Neuron here.
FROM OUR PARTNERS
Unlock the full potential of your workday with cutting-edge AI strategies and actionable insights, empowering you to achieve unparalleled excellence in the future of work. Download the free guide today!
Prompt tip of the week
Need some fresh ChatGPT prompts? “God of Prompt” has a massive list of 500+ conversation starters that'll help you do everything from writing blog posts to planning your next vacation.
The best part? These aren't your typical “write me a story” prompts—they're structured to get you specific, useful outputs. Think: turning research into papers, brainstorming marketing strategies, or building custom workout plans. Check ‘em out!