Everyday AI
Posts
Claude 3.5 Sonnet Updates: AI can use computers now?

Claude 3.5 Sonnet Updates: AI can use computers now?

Apple Intelligence beta is live, Google and Qualcomm partner for AI-powered cars, Midjourney's web image editing tool and more!

Everyday AI
October 23, 2024

👉 Subscribe Here | 🗣 Hire Us To Speak | 🤝 Partner with Us | 🤖 Grow with GenAI

Outsmart The Future

Today in Everyday AI
9 minute read

🎙 Daily Podcast Episode: Anthropic just released its Claude 3.5 update and all we can say is daaangg. We show you what’s new and why this will change the future of work. Give it a listen.

🕵️‍♂️ Fresh Finds: Google makes AI text watermark available, OpenAI adds first Chief Economist and Character.AI & Google faces a lawsuit. Read on for Fresh Finds.

🗞 Byte Sized Daily AI News: Apple Intelligence beta is live, Google and Qualcomm partner for AI-powered cars and Midjourney unveils web image editing tool. For that and more, read on for Byte Sized News.

🚀 AI In 5: We’re showing you a secret ChatGPT hack to read images inside PDFs. See it here

🧠 Learn & Leveraging AI: Anthropic Claude 3.5 comes with some new amazing features. We break down everything you need to know about this update and how it’ll change the way you work. Keep reading for that!

↩️ Don’t miss out: Did you miss our last newsletter? We talked about the Claude 3.5 update, Advanced Voice mode lands in EU, OpenAI and Microsoft partner for local news. Check it here!

Claude 3.5 Sonnet Updates - AI can use computers now? 🧑‍💻️

AI can use computers now?

Yup.

With Claude 3.5 Sonnet updates, Anthropic's LLM now has access to 'Computer Use.'

Is this new mode going to change how we use LLMs? And what else is noteworthy with Claude's new updates in 3.5?

We go over it all.

Join the conversation and ask Jordan questions on Anthropic Claude here.

Also on the pod today:

• Claude Model Benchmarks 📊
• New Computer Use Feature 💻
• Potential for Business Applications 🏢

It’ll be worth your 52 minutes:

Listen on our site:

Click to listen

Subscribe and listen on your favorite podcast platform

Listen on:

Spotify | Apple Podcasts |
Google Podcasts | Amazon Music |

Here’s our favorite AI finds from across the web:

New AI Tool Spotlight – Topview creates marketing videos with GPT-4o & AI avatars, AMA is an AI marketing assistant and Beloga is a personal AI knowledge amplifier.

Google – Google is making SynthID Text, its technology that lets developers watermark and detect text generated by generative AI models, generally available.

OpenAI – Dr. Ronnie Chatterji has been named OpenAI’s first Chief Economist.

NVIDIA - NVIDIA’s CEO says that the design flaw that Blackwell AI chips had are now fixed.

He also spoke on the EU lagging behind the U.S. and China in AI investments.

Trending in AI – A new lawsuit is blaming Character.AI and Google for the death of a 14-year-old boy.

Microsoft – Microsoft Photos is getting a new AI super resolution feature that lets you upscale low-quality photos.

AI Governance - The U.S. AI Safety Institute is at risk of being dismantled if Congress doesn’t authorize it.

AI Models – Ideogram has launched infinite Canvas for manipulating and combining generated images.

AI in Media – Over 11,500 creative professionals have signed an open letter demanding the prohibition of using human-created art for AI training without permission.

Read This – Liquid AI is redesigning the neural network.

AI Security - Researchers have found a 'Deceptive Delight' method to jailbreak AI models.

1. Google and Qualcomm Team Up for AI-Powered Cars 🚙

Qualcomm and Google are set to transform your driving experience with their upcoming "digital cockpit," integrating Qualcomm's Snapdragon technology with Google's Android Automotive OS. This partnership aims to introduce features like intuitive voice assistants and real-time updates that could make your car not just a vehicle, but an extension of your digital life, allowing seamless connections with your devices and even autonomous parking capabilities.

With Mercedes Benz and Li Auto as initial partners, drivers can expect an enhanced journey that anticipates needs, such as automatically finding parking while they enjoy their dinner plans.

2. Apple Unveils AI Image Editing with Watermarks 🖼️

In a bid to preserve the authenticity of photography, Apple has introduced a new "Clean Up" feature in iOS 18.1 that allows users to remove unwanted objects from images while clearly marking them as modified. Craig Federighi, Apple’s software chief, emphasized the importance of maintaining trust in photographic content amidst rising concerns about AI's potential for deception.

Unlike competitors like Google and Samsung, which enable more extensive AI enhancements, Apple is taking a cautious approach, focusing on subtle edits that don’t alter the fundamental meaning of an image.

3. RunwayML Unveils Act-One: A Game Changer for Facial Animation 👱

RunwayML has just launched Act-One, a groundbreaking AI model that revolutionizes facial animation by enabling the transfer of an actor's performance directly to animated characters using only video and voice recordings. This innovative technology allows for the creation of realistic animations without complex equipment, requiring just a smartphone to capture subtle details.

Beyond enhancing animated films and games, Act-One also facilitates the portrayal of multiple characters by a single actor in a single scene, opening new avenues for storytelling

4. Midjourney Unveils New Web Image Editing Tool 🛠️

Midjourney is set to roll out an enhanced web tool next week, allowing users to edit uploaded images with its generative AI, including a feature to retexture objects based on captions. This move comes amid rising concerns over AI-edited images, as platforms grapple with how to label content generated or modified by AI.

In a bid to prevent misuse, the tool will initially be available to a select group of users, backed by increased human moderation and advanced AI oversight.

5. Canva Unveils Dream Lab with Leonardo.AI 💭

Canva has just launched its new Dream Lab, powered by Leonardo.AI, which the company acquired three months ago. This innovative hub enhances the platform's capabilities by allowing users to generate stunning visuals in over 15 styles, including 3D renders and illustrations, significantly boosting design possibilities.

With an expanded content library featuring Artlist's Premium Video Library and additional photos from Pocstock, Canva is set to redefine graphic design for creators everywhere.

6. Stability AI Unveils Stable Diffusion 3.5 Update 🤯

Stability AI has just launched Stable Diffusion 3.5, a significant upgrade aimed at reclaiming its edge in the competitive text-to-image generative AI landscape. This latest iteration introduces multiple customizable models, including an 8 billion parameter version promising superior quality and prompt adherence, as well as a faster, distilled variant.

Notably, the update incorporates advanced techniques like Query-Key Normalization and enhancements to the MMDiT-X architecture, boosting both image quality and multi-resolution capabilities.

7. Apple Unveils Exciting AI Features Ahead of iOS 18.1 Release 🍎

Apple has officially announced a beta version of its new Apple Intelligence features, including the highly anticipated integration with ChatGPT, which is set to launch publicly next week alongside iOS 18.1. This rollout aims to enhance user experience on newer devices, with innovative tools like Genmoji for creating custom emojis, Image Playground for AI-generated images, and Image Wand for effortlessly removing distractions from photos.

Notably, Siri will now seek ChatGPT’s assistance for more complex inquiries, allowing users to tap into advanced AI insights without needing an OpenAI account.

Secret ChatGPT trick to read images inside of PDFs

Click Image To Play Video 👆

Can ChatGPT analyze images within a PDF?

Most AI experts would say no.

BUT we found a secret hack to make it possible! We show you how it works.

Check out today's AI in 5.

🦾How You Can Leverage:

Anthropic has entered the (AI) chat.

While many people are eyeing Anthropic’s updates to Sonnet 3.5 and the forthcoming Sonnet 3.5 updates, we didn’t pay them TOO much attention.

Because we think two simple words could change how we interact with technology in the future.

Computer Use.

What is it?

Put simply, it’s talking to Anthropic’s updated LLM, in natural language, and then the ‘Computer Use’ mode …. Uses a computer.

Yes. An AI can navigate on a virtual desktop, launch programs, type, click and executive actions.

Just like a human.

That’s the new LLM-powered feature that Anthropic just revealed, and we gave the new model updates AND the Computer Use mode a deeper dive on today’s show.

So, what’s worth paying attention to and what’s just fluff and hype?

Glad you (rhetorically) asked.

Let’s break down key insights from today’s show.

3, 2, 111111111……

1 – New model, who dis? 🤩

Alright this one’s a bit confusing.

But a highlight of Anthropic’s recent release was updating two of its marquee models.

In short, Claude 3 launched with three varieties:

March 2024

Haiku 3 — the smallest and least capable, but fast and cheap (via the API)

Sonnet 3 — the middle of the pack

Opus 3 — the most powerful and most expensive model to use (via the API)

June 2024

Sonnet 3.5 — Only the middle model got a shiny update.

October 2024

Connect 3.5 (New) — Not sure what’s wrong with 3.6, but 3.5 New is available now for the front-end and API.

Haiku 3.5 — Haiku gets an update, but not yet available.

Opus……. Poor Opus. Still rocking V3

What it means:

OK, model soup. We know.

More on the performance and benchmark below. While we didn’t focus TOO much today on these new updates and performance, we DID do a quick/live rundown on our YouTube channel yesterday.

Go check that out.

2 – To o1, or not to o1? 🤔

Alright…. Here’s the benchmarks for us dorks!

With the new updates to Sonnet 3.5 (new) and Haiku 3.5, Anthropic also dropped the prerequisite benchmark chart.

Oh, what’s that at the bottom?

Anthropic straight up said, ‘Na, we’re not gonna compare ourselves to OpenAI’s o1 reasoning model.

* Our evaluation tables exclude OpenAl's o1 model family as they depend on extensive pre-response computation time, unlike typical models. This fundamental difference makes performance comparisons difficult.

Sure, that’s fair.

Buuuuuuutttt….. while Anthropic on one hand says it’s not gonna compare itself to OpenAI’s Strawberry o1 model, it then cherry picks an instance to …. YES! Compare itself to o1 when it’s convenient?

Huh.

What it means:

We went into MUCH greater depth on this in today’s show, including some under-the-hood changes in how Sonnet 3.5 seems to be using a bit of reasoning/Chain of Thought just like… OpenAI’s o1.

But if you want an apples-to-apples comparison of how o1 and Sonnet 3.5 (new) REALLY stack up against each other, here ya go.

3 – Computer Use: Will we all use it? 🧑‍💻️

Alright, now we can get to the main event?

Bet.

Like we said, the new ‘Computer Use’ module is now available in beta, and is used via the API.

(Also in Amazon’s and Google’s platforms.)

So while we hope this makes its way to a downloadable version for the masses, right now it’s a bit developer-centric.

Also, we gotta give huge props to Anthropic.

Even though the ‘Computer Use’ is far from perfect, Anthropic just shipped it.

No hype.

No waitlists.

Just shipped it.

And admitted that it was flawed and error-prone.

Regardless of your thoughts on that GTM strategy, we’d say this approach is a breath of fresh-ish air.

What it means:

Hot dang y’all.

While the beta of Computer Use is super buggy and incomplete while running on a slimmed-down virtual machine and is chock full of guardrails, it does show huge promise.

The fact that today you (and your company) can literally give a LLM natural text commands and it can autonomously use a computer is kinda bonkers.

Transfer data between a PDF and spreadsheet? Yup.

Help research and plan your next business trip on its own? Sure.

Automagically do most of your computer-based work with little human oversight? Not yet.

Here’s the hot take: Computer Use is very beta.

And while the floor is pretty low, the ceiling is CRAZY high on this one.

What are your thoughts?

⌚

Numbers to watch

$20 Million

Granola, an AI meeting notepad, has raised $20 million.

Now This …

Let us know your thoughts!

Vote to see live results

How do you currently search for things?

Reply

or to participate.

Claude 3.5 Sonnet Updates: AI can use computers now?

Apple Intelligence beta is live, Google and Qualcomm partner for AI-powered cars, Midjourney's web image editing tool and more!

Outsmart The Future

Today in Everyday AI9 minute read

Claude 3.5 Sonnet Updates - AI can use computers now? 🧑‍💻️

Also on the pod today:

Subscribe and listen on your favorite podcast platform

Listen on:

Spotify | Apple Podcasts | Google Podcasts | Amazon Music |

Secret ChatGPT trick to read images inside of PDFs

🦾How You Can Leverage:

$20 Million

Now This …

How do you currently search for things?

Reply

Today in Everyday AI
9 minute read

Spotify | Apple Podcasts |
Google Podcasts | Amazon Music |