815 words
4 minutes

What Kind of Data Does AI Use

Cover image for What Kind of Data Does AI Use

Hey, you brilliant tech babes (and kings — I see you too 👀)!

You already know how AI learns and where it shows up in real life. But there’s one more thing you absolutely need to know to truly get AI:

It’s all about the data, darling.
Like, literally. No data = no AI.

Let’s break it down, step by step — from the types of data to why they matter, and what can go wrong if your data’s a mess (spoiler: everything).

So grab your iced latte and let’s talk dirty — data dirty.


Structured vs. Unstructured — What’s the Tea?#

Structured Data = Neat girl energy#

Think spreadsheets, columns, drop-down menus. Basically, info that fits in tidy rows and is super easy to organize.

Real-life example:
Your online store tracks shoppers by name, age, and how much they spent last month.

  • Name: Emma
  • Age: 28
  • Purchase: $89.99
  • VIP status: Yes

Perfect for AI to eat up and analyze.
This kind of data is the go-to for supervised learning. It’s simple, predictable, and super model-friendly.

Unstructured Data = Total chaos… but gold#

This is all the messy stuff AI needs to decode:

  • Tweets
  • DMs
  • Selfies
  • Voice notes
  • TikToks
  • Review texts
  • Screenshots of what your friend sent you at 2am

This data is harder to process — but it’s also where the juicy, human stuff lives.
With the right tools (like NLP or computer vision), AI can feel what people are saying, seeing, and reacting to.


The Data Types AI Actually Works With#

Here’s what shows up in real-world projects — broken down like a cute cheat sheet:

Data TypeLooks Like…ExampleWhat AI Does With It
NumericalNumbers, prices, scores5, 29.99, 90%Normalize, scale, compare
CategoricalLabels, options, categories”Beginner”, “NY”, “Tote bag”Convert to machine-friendly codes
TextReviews, messages, bios“Ugh, I loved it so much!”Understand mood, topic, emotion
ImageSelfies, product pics, screenshotsPhoto of a red dressDetect shapes, colors, objects
AudioVoice memos, customer callsVoice note: “Sooo… mad or just tired?”Turn into text or audio features
VideoInsta Reels, tutorials, dashcamsMakeup GRWM TikTokAnalyze image + sound together

⚙️ What Makes Data “Model-Ready”?#

You don’t just throw your raw data into an AI model and hope for the best — that’s like showing up to a glam shoot with bed hair and mismatched socks.

Your data needs a serious prep routine:

  1. Cleaning — remove duplicates, typos, missing stuff
  2. Formatting — make sure prices are numbers, dates are dates, and text is readable
  3. Labeling — if your model needs answers, you have to tell it what’s what (e.g. “This is a cat. That’s a croissant.”)
  4. Balancing — avoid over-representing one side (like 90% apples and 10% oranges — we’re not doing fruit bias today)

Why Bad Data = Bad AI#

Let’s be real: even the smartest AI won’t save your butt if your data’s a mess.
Here’s what happens if you skip the prep:

  • Your model makes wrong predictions
  • It becomes biased (like recommending only men’s products if it’s seen mostly male data)
  • It just… breaks. And blames you silently.

Example:

Imagine you want your AI to recognize whether a voice note is angry or chill.
But most of your training data is people being excited.
Now when someone calmly says “I’m fine,” the model might think they’re thrilled — or worse, lying.

That’s not AI’s fault. That’s ✨data drama✨.


So… What Should You Know as a Beginner?#

Even if you’re not building models (yet), just knowing this gives you a major edge:

  • Data is the fuel AI runs on
  • Clean, balanced data = accurate, smart AI
  • Unstructured data is harder, but so much more real and emotional
  • You can totally start working with it — one TikTok caption or product review at a time

Final Thoughts from a Data-Loving IT Girl#

People think AI is all about code and algorithms. But girl, no.
AI is 80% data, 20% logic — and 100% what you feed it.

So if you’re learning AI, working in tech, or just curious about how Netflix knows you want to watch K-dramas at 2am… it all comes down to the data.

Clean it. Understand it. Respect it.

Because even your future AI model needs a solid skincare routine before it glows.


Tools I Personally Trust#

If you want to make your digital life a little calmer - here are two tools I use every day:

🛸 Proton VPN - A trusted VPN that secures your Wi-Fi, hides your IP, and blocks trackers. Even in that no-password café Wi-Fi, you’re safe.

🔑 Proton Pass - A password manager with on-device encryption. Passwords, logins, 2FA - always with you, and only for you.

These are partner links - you won’t pay a cent more, but you’ll be supporting DevOps.Pink. Thank you - it really means a lot 💖


Social Channels#

🎬 YouTube
🐦 X (Twitter)
🎨 Instagram
🐘 Mastodon
🧵 Threads
🎸 Facebook
🦋 Bluesky
🎥 TikTok
💻 LinkedIn
📣 daily.dev Squad
✈️ Telegram
🐈 GitHub


Community of IT Experts#

👾 Discord


Refill My Coffee Supplies#

💖 PayPal
🏆 Patreon
🥤 BuyMeaCoffee
🍪 Ko-fi
Telegram Boost


Is this content AI-generated?

Absolutely not! Every article is written by me, driven by a genuine passion for Docker and backed by decades of experience in IT. I do use AI tools to polish grammar and enhance clarity, but the ideas, strategies, and technical insights are entirely my own. While this might occasionally trigger AI detection tools, rest assured—the knowledge and experience behind the content are 100% real and personal.

What Kind of Data Does AI Use
https://www.devops.pink/what-kind-of-data-does-ai-use/
Author
Tatiana Mikhaleva
Published at
2025-04-14
License
CC BY-NC-SA 4.0