- The Neuron
- Posts
- đș Gemini Ultra vs. ChatGPT-4
đș Gemini Ultra vs. ChatGPT-4
PLUS: Meta resists the "AI boys club".
Welcome, humans.
Unlike most AI research labs, Metaâs is primarily comprised of women, including 60% of its leadership team. Exciting times!
Among other benefits, this could mean that Meta's AI models end up excelling in EQ and multitasking. :)
Hereâs what you need to know about AI today:
Bard is now Gemini, and Google launched its ChatGPT-4 competitor.
ChatGPT-4 outperforms Gemini Ultra in most work-related tasks.
OpenAI's next move might be GPT-4.5...or something entirely new.
Tech execs say they have to spend money to make money on AI.
Introducing Gemini Ultraâhow it stacks up against ChatGPT-4. đ
Gemini
After 10 months of anticipation, 1 faked demo, and 100s of hype-y TechCrunch pieces, Googleâs rival to ChatGPT-4 is here: Gemini Ultra.
Also, Bard has gotten a name upgrade to âGeminiâ.
Hereâs a breakdown for the name-challenged (like us):
Gemini Pro 1.0 (previously Bard) is akin to ChatGPT-3.5, making the best free chatbot on the market Claude 2.1âyes, you heard that right.
Now, Google claims that Gemini Ultra is the best available chatbot, outperforming ChatGPT-4 on benchmarks. So, we put the two head-to-head across various work-related tasks to see whatâs what.
*TLDR: Gemini Ultra is a big upgrade from Bard, but is still mildly worse than GPT-4.
Where ChatGPT-4 wins:
Faster responses.
Superior logical reasoning, creativity, & handling complex verbal tasks.
Produces code with fewer errors.
Maintains context better across long convos.
Hallucinates and gets off topic less.
Can handle PDFs, data analysis, & math tasks better.
Better at image generation & understanding.
Less censoredâearly reports suggest Gemini avoids controversial topics entirely.
Gemini ALSO mixed up the man/woman categories
Where Gemini wins:
Better at translating languages.
Tries harder to answer your question & initiates follow-ups.
Enhanced search capabilities, integrating Google search results.
Better integrations âYouTube, Maps, & soon Workspace apps (Gmail, Docs, Sheets, etc).
You can test Ultra for 2 months at no charge here.
Where they tie:
Which should you use?
Our guidance mirrors our previous Bard vs. ChatGPT piece: For work tasks, ChatGPT-4 is preferable, but turn to Bard for fact-based questions.
(Perplexity *seems* to edge out Gemini in search capabilitiesâŠstay tuned for a detailed comparison.).
If your operations heavily rely on Google Workspace, consider trying Gemini Ultra during the two-month free trial to assess its utility in everyday work tasks.
*we'll update our advice as we continue exploring Gemini Ultra.
FROM OUR PARTNERS
Attend this AI conference and accelerate your business.
Join us at the Imagine AI Live conference, March 27-28, at the new Fontainebleau Las Vegas, and discover how AI is transforming the business landscape.
Learn from industry experts, and network with like-minded professionals.
Discover how AI can speed up your enterprise, expand your business model, and improve your value proposition.
Spots are filling up fast! Don't miss out on this chance to future-proof your business.
Slow & steady doesnât win the race, at least for nowâŠ
Why it matters: YesâGemini Ultra means Google is closing the gap with OpenAI, but itâs still behind by almost 12 months.
We donât see much changing. Maybe Google can convince some Workplace users to pay for Ultra, but ChatGPT+ power users arenât going anywhere.
Whatâs next for AI?
Probably an upgrade to ChatGPT-4.5 sometime in 2024.
In the long term, though, we expect all these chatbots to converge on the forthcoming wave of AI: agents.
agents visualization from Abacus
Agents are the type of language Microsoft has been using with Copilot, Amazon with Bedrock, and Google with Workspace.
The Information reported on Wednesday that OpenAI is developing two types of AI agents that can automate entire workflows, not just individual tasks.
Hereâs what this could look like:
An AI agent takes over your device â migrates data from documents to spreadsheets â prepares the expense report â inputs it into accounting software.
(Currently, ChatGPT can assist with pieces of this workflow, provided it's prompted well).
We have a lottttt more to say on agents. Stay tuned!
Intelligent Insights.
Feast your eyes on our top picks of the week!
This AI learnt language by seeing the world through a babyâs eyes (link).
Tech execs are telling investors they have to spend money to make money on AI (link).
Mamoon Hamid and Ilya Fushman of Kleiner Perkins: âMore than 80%â of pitches now involve AI (link).
OpenAIâs Secret Weapon Is Sam Altmanâs 33-Year-Old Lieutenant (link).
Friday Faculty.
ML/AI Engineer, Infrastructure at Figma (SF or NYC).
Child Safety Enforcement Specialist at OpenAI (SF).
43 open jobs at Stack AV (Pittsburgh/remote).
Technical Architect at Databricks (Denver).
34 open positions at Lambda (Bay Area).
Search from over 10,000 AI roles on The Neuronâs Jobs Board.
A Cat's Commentary.
Thatâs all for today, for more AI treats, check out our website. Get your brand in front of 330,000+ professionals here. See you cool cats on Twitter: @nonmayorpete & @noahedelman02 |
|