- Everyday AI
- Posts
- 7 common LLM mistakes and how to avoid them
7 common LLM mistakes and how to avoid them
Amazon Alexa gets Claude upgrade, Apple and NVIDIA look to invest in OpenAI, Dell shares rise and more!
š Subscribe Here | š£ Hire Us To Speak | š¤ Partner with Us | š¤ Grow with GenAI
Sup yāall š
Here in the good ol U-S of A, we have a federal holiday on Monday. So, weāll see ya back in action on Tuesday with AI News that Matters.
Hope you can join us.
Oh, weāre doing a little giveaway. All you gotta do is VOTE YES if youāre interested in winning a free 90-minute GenAI consult.
(For your own learning or for your biz. You choose!)
Do you want a free, 1-on-1, 90-minute LLM training session?Vote YES to be entered, and check our announcement next week which we'll be making exclusively in our newsletter. |
Weāll announce the winner exclusively in our newsletter next week, so make sure you read/watch the for announcement.
āļø
Jordan
(Letās connect. Just tell me who ya are! lol)
Outsmart The Future
Today in Everyday AI
8 minute read
š Daily Podcast Episode: Weāve trained thousands of business leaders on using Large Language Models, and we see the same mistakes. Over and over and over again. So, weāre tackling the 7 most common LLM mistakes and how to avoid them. Give it a read or listen.
šµļøāāļø Fresh Finds: FDA to consolidate AI efforts, Big techās secret to acquire AI unicorns, Tom Hanks warns of fake AI ads and more. Read on for Fresh Finds.
š Byte Sized Daily AI News: NVIDIA and Apple look to invest in OpenAI, Amazon Alexa gets a Claude AI makeover, Dell shares rise and more. Read on for Byte Sized News.
š AI In 5: Can AI help you go viral? See if this AI video tool can help catapult you to viral vertical video fame. See it here
š§ Learn & Leverage AI: Pitfalls. Pitfalls everywhere. Weāve taken hundreds of hours of LLM training and have written an easy guide to help you avoid common LLM pitfalls. Keep reading for that!
ā©ļø Donāt miss out: Did you miss our last newsletter? We talked about Google Gems explained, Big tech and US Gov. partner and California AI bill advances. Check it here!
Stop making these 7 Large Language Model miļ»æstakes š
You wouldn't ride a unicycle on a highway. š³
Sure, that's technically a way you can travel.
ā³ But that doesn't mean pedaling a unicycle is an acceptable way to travel from point A to point B.
ā³ That's how people are using Large Language Models.
ā³ There's millions using LLMs like riding a unicycle on an interstate.
Don't worry.
We'll set the record straight and help you trade in that unicycle for a friggin Bentley.
(Or like a 2009 Toyota Prius hybrid. Whatever's your speed.)
On todayās show, we showed you how to Stop making these 7 Large Language Model mistakes.
Also on the pod today:
ā¢ Avoiding common LLM mistakes āļø
ā¢ Staying up to date with GenAI š°
ā¢ Preparing for the future of work š®
Itāll be worth your 43 minutes:
Listen on our site:
Subscribe and listen on your favorite podcast platform
Listen on:
Hereās our favorite AI finds from across the web:
New AI Tool Spotlight ā AFFiNE AI uses AI to help you better draw, write and present, Storyville gives you (or maybe your kids) personalized bedtime stories with the help of AI and Flownote uses AI to transcribe your meetings into concise summaries.
Trending in AI ā Tom Hanks has issued a statement on Instagram to warn about fake AI video ads of himself circulating social media.
AI in Medical ā The FDAās drug center is consolidating its AI efforts under one council.
Big Tech ā Hereās how big tech companies are acquiring AI companies without having to buy them.
LLMs ā Cohere has updated its Command R series enterprise AI models.
Weāre releasing improved versions of the Command R series, our enterprise-grade AI models optimized for business use cases.
You can access them on our API, @awscloud Sagemaker, and additional platforms soon.
ā cohere (@cohere)
2:09 PM ā¢ Aug 30, 2024
Read This ā The BBC is starting to use AI to generate subtitles.
AI in Politics ā Japanās military is planning to spend on AI and automation to combat its recruitment crisis.
AI Image Models ā Leonardo AI has released an API to its foundational model, Phoenix.
1. NVIDIA and Apple Consider Investment in OpenAI š
Whoa.
NVIDIA and Apple are reportedly in discussions to contribute to OpenAIās upcoming fundraising round, which could elevate the ChatGPT creatorās valuation to an astonishing $100 billion. This news follows Bloomberg's initial report and comes as OpenAI seeks new capital to combat a projected $5 billion loss by year-end while expanding its AI training and staffing efforts.
Additionally, Microsoft, which already holds a 49% stake in OpenAI, may also participate in this fundraising, highlighting the growing interest from major tech players.
2. Amazon's Alexa Set for Upgrade with Anthropic's Claude AI š£
Amazon is gearing up to enhance its voice assistant with the upcoming āRemarkable Alexa,ā which will be powered by Anthropicās Claude AI after previous versions struggled to meet user expectations. With a substantial $4 billion investment in Anthropic, the new assistant is expected to debut in mid-October, featuring improvements like daily AI-generated news summaries and a child-friendly chatbot.
However, users should prepare for a subscription model, potentially costing between $5 to $10 per month
3. Dell Technologies Gains Momentum with AI Server Demand š
Dell Technologies saw a 4% increase in shares following a strong demand for its AI-powered servers. The partnership with NVIDIA has paid off, leading to a 38% rise in revenue from Dell's infrastructure solutions group, totaling $25.03 billion, while their AI pipeline is now estimated between $11 billion and $13 billion.
Analysts are optimistic, with most rating Dell as a "buy" and raising their price targets, despite the stock being down 36% from its all-time high in May.
4. Googleās Approach to Disease Detection Through Sound Signals š
Google is making strides in healthcare by utilizing sound signals to predict early signs of diseases, including tuberculosis. The tech giant has trained its AI model with 300 million audio samples of coughs and labored breathing, collaborating with Indian startup Salcit Technologies to potentially deploy this technology on smartphones in underserved areas.
This initiative could significantly improve early disease detection in high-risk populations, showcasing Google's commitment to revolutionizing healthcare accessibility.
5. Oprah Hosts Star-Studded Special on AIās Impact šŗ
Oprah Winfrey is set to host a compelling ABC special titled āAI Future Us,ā premiering on September 12 at 8 PM ET. This event will feature notable figures, including OpenAI CEO Sam Altman and Bill Gates, as they explore the transformative effects of artificial intelligence on everyday life.
The program promises insightful discussions and demonstrations on AI's potential impact on jobs and society, with additional appearances by content creator Marques Brownlee and technology advocate Tristan Harris.
AI-powered shortcut Shortcut to create viral vertical videos?
Ever wondered how to create those viral vertical videos that dominate social media feeds?
Well todayās AI in 5 is for you!
Weāre breaking down Spikes Studio, an AI-generated video creator that takes your long form video and creates recommended viral clips in vertical format.
Check out today's AI in 5.
Weāve literally taught thousands of business leaders how to prompt inside large language models.
And for the past year, weāve seen the same mistakes.
Debunked the same myths.
And prioritized the same truths.
So, we thought it was time for a dedicated episode going over the 7 most common LLM mistakes that people make.
So, letās get to it. š
Mistake 7 ā Not understanding a LLMās knowledge cutoff š§
Forget what the companies are trying to tell you in their marketing. Even if a model is āconnected to the internetā itās not always up to date.
In short:
Models are trained on data.
Data is scraped from the internet. (Whether thatās legal or not will be decided in the coming years)
Humans train the models based on that data.
But the process between steps 2 and 3?
Thereās an expiration date, of sorts. The model training process can take many months, in which case the modelās training data (and knowledge cutoff) only get more and more stale.
Do this instead:
The LMSYS Chatbot Arena has a pretty up-to-date list of what each popular modelās knowledge cutoff is.
Side note ā it incorrectly lists the GPT-3.5 knowledge cutoff as September 2021 whereas itās actually January 2022. (The rest all look good!)
Need to know more about what a knowledge cutoff is, how it impacts LLMs and how they work?
Mistake 6 ā Not investigating internet connectivity š
If Big Tech pinky promises that their model is connected to the internet, that means knowledge cutoffs donāt matter, right?
And that we can also feel confident in any modelās output?
Wrrroooooong.
The approach (and consistency) of how different models talk to the internet varies.
And sometimes, itās downright awful. (Weāre looking at you, Google!)
Do this instead:
Nothing beats first-hand experience of trying different time-sensitive queries, and observing how different āinternet-connectedā LLMs act.
Or, you can sit on the couch and watch as we did the heavy lifting for you.
This episode compares with real-world tests how LLMs interact with the internet. This single episode is gonna save you time, improve your accuracy, and cut down on those dang hallucinations.
Mistake 5 ā Not managing your memory āļø
LLMs arenāt infinitely smart.
Just like us (and goldfish) they can only remember so many things.
And while models like Claude-3 and Gemini have stolen the show in terms of big memories and long context windows, theyāre not always accurate.
And the GPT models still lag a bit behind compared to Anthropic and Googleās big brained models.
Do this:
Itās like how the Star Wars text scrolls up and out of the screen ā LLMs can only retain so much information at a time.
Wanna dork out and go all-in on understanding tokenization and memory? If you really wanna up your LLM game, this is essential reading/watching.
Mistake 4 ā Paying attention to screenshots š„ļø
Guess what a screenshot from a large language model means?
Absolutely nothing.
You can tell a model to parrot anything you want then share that screenshot online.
Iām rich!
Do this:
Screenshots are a dime a dozen.
āAI expertsā are trying to share screenshots showing their super duper AI skills.
AI skeptics are trying to share their screenshots showing how dumb LLMs are.
All those things mean?
Those people donāt understand how models work.
If you really wanna show your work, you can always just share the chat URL, like this.
(Yeah, Jordan really didnāt win the lottery after all.)
Mistake 3 ā Thinking that LLMs are deterministic š«
AI chats arenāt like search engines.
You can put 1 prompt in 100 times and get 100 very different answers.
Or 50 different answers.
Or 2 slightly different answers.
Large Language Models are generative by nature, which means their next-token prediction abilities are meant to be generative.
A little random. A bit unpredictable.
Do this:
Go into the OpenAI playground, and play around with Top-P, temperature and more. Weād love to walk you through this, step-by-step, but going through the process on your own really helps you understand how generative models actually work.
(Reply to this email if youād be interested in an episode on the OpenAI Playground.)

Mistake 2 ā Thinking copy and paste prompts work š¾
If you see someone shilling copy-and-paste prompts promising to solve all of your problems, run for the hills.
Donāt pay attention to Billy Boys like this.

Hereās the truth ā prompts donāt really do anything.
Sure, they can get you from an F to a C- pretty quick, but thatās about all copy-and-pasting prompts is good for. Going from hot garbage to lukewarm trash.
Do this:
If youāre feeling reaaaaalllllly spicy, go read this 43-page research paper on chain-of-thought prompting. (We have.)
Or, you can just look at this graph and agree with math, science and logic: copy-and-paste prompts give you poor outputs and proper prompt engineering wins out every single time.

(If youāre not a graphs person, this says that āfew shotā prompting always outperforms zero-shot prompting. In other words, having a conversation with an LLM and giving it examples and working with it like an expert will always give you better results than copy-paste prompts.)
Mistake 1 ā Not understanding LLMs are the future of work š®
This aināt a hot take.
Google ā all in on LLMs.
Amazon ā all in on LLMs.
Microsoft ā all in on LLMs.
Meta ā all in on LLMs.
Apple ā (reportedly) all in on LLMs.
Thatās 5 of the 6 largest companies in the U.S. (and the other is NVIDIA ā the one literally powering the GenAI and LLM revolution.)
If you think AI is a fad or something thatās gonna come and go, weāll try to be nice when we say this: youāre very wrong.
Do this:
Hereās a fun little trick we started talking about last year. Instead of using the term āGenerative AIā or āLarge Language Model,ā start using the term Internet.
Would you use the internet to help you get a job? (Yes)
Would you use the Internet to do your work? (Yes)
Would you use the Internet to grow your business? (Yes)
Thatās just how the world works now.
The future of our personal and professional lives is based around LLM and Generative AI technology.
So the next time youāre thinking: āShould we use a LLM for this?ā then just swap out and use the word internet.
Or just know the answer is almost always āYes.ā
ā
Numbers to watch
$320 Million
Generative AI coding startup Magic lands $320M investment from Eric Schmidt, Atlassian and others.
(Or your fave LLM like Claude, Gemini, Copilot, etc)
Reply