Domain 1 β€” Module 1 of 7 14%
1 of 27 overall
Domain 1: Core Data Concepts Free ⏱ ~10 min read

Your First Look at Data

Data comes in three flavours β€” structured, semi-structured, and unstructured. Understanding the difference is the first step to working with data in the cloud.

What is data?

Simple explanation

Data is just information written down so a computer can use it.

Think of a supermarket. The price sticker on a can of beans is data. The photo on the label is data. The handwritten note from your flatmate saying β€œwe need more beans” is also data.

All three are information β€” but they’re organised very differently. That difference matters because it changes how computers store, search, and analyse them.

The three types of data

Structured data

Structured data fits neatly into rows and columns β€” like a spreadsheet or a database table. Every row follows the same pattern, and every column has a defined type (text, number, date).

Aisha’s example: The campus food ordering app stores orders in a table:

OrderIDStudentItemPriceDate
1001AishaChicken wrap8.502026-04-20
1002LiamCoffee4.002026-04-20

Every order has the same fields. You can sort by price, filter by date, or count orders per student β€” because the structure is predictable.

Key features:

  • Fixed schema (the column names and types are defined in advance)
  • Stored in relational databases or tabular files (CSV, Excel)
  • Easy to query using languages like SQL
  • Works best when every record has the same fields

Semi-structured data

Semi-structured data has some organisation, but it’s flexible. Not every record needs the same fields. The data describes itself using tags, keys, or markers.

Aisha’s example: Her app also stores user profiles, but some students fill in more details than others:

// Student 1 β€” minimal profile
"name": "Aisha", "email": "aisha@uni.ac.nz"

// Student 2 β€” detailed profile
"name": "Liam", "email": "liam@uni.ac.nz",
"phone": "021-555-0199", "dietary": "vegetarian"

Liam has extra fields that Aisha doesn’t. That’s fine β€” the data is flexible. Each record carries its own labels (keys like β€œname”, β€œdietary”), so the system knows what each value means.

Key features:

  • Flexible schema β€” records can have different fields
  • Self-describing β€” tags or keys label each value
  • Common formats: JSON, XML, YAML
  • Stored in document databases (like Azure Cosmos DB) or NoSQL stores

Unstructured data

Unstructured data has no predefined format. It’s the photos, videos, audio files, PDFs, and emails that don’t fit into rows or columns.

Aisha’s example: Students upload photos of their meals for reviews. Each photo is a blob of pixels β€” no columns, no keys, no rows. A computer can store it and display it, but it can’t β€œquery” the photo the way it queries a table.

Key features:

  • No schema β€” no rows, columns, or tags
  • Examples: images, videos, audio, PDFs, Word documents
  • Stored in file storage or blob storage (like Azure Blob Storage)
  • Requires specialised tools (AI, search engines) to analyse content
Structured vs semi-structured vs unstructured data
FeatureStructuredSemi-StructuredUnstructured
SchemaFixed (defined upfront)Flexible (self-describing)None
FormatTables (rows & columns)JSON, XML, YAMLImages, video, audio, PDFs
Queryable?Yes β€” SQL queriesYes β€” with the right toolsNot directly β€” needs AI or search
ExampleOrder table in a databaseUser profile in JSONPhoto uploaded by a customer
Azure storageAzure SQL DatabaseAzure Cosmos DBAzure Blob Storage
Exam tip: how the exam tests this

The exam loves giving you a scenario and asking β€œwhat type of data is this?” Look for these clues:

  • Fixed columns, every row the same β†’ Structured
  • Tags or keys, flexible fields β†’ Semi-structured
  • Files with no internal organisation β†’ Unstructured

A tricky pattern: β€œan email” can be semi-structured (it has To, From, Subject fields) OR unstructured (the body text). Similarly, log files can be semi-structured (if they have timestamps and severity levels in a consistent format) or unstructured (if they’re free-form text). The exam usually clarifies by asking about a specific aspect.

Why does it matter?

The type of data determines:

  1. Where you store it β€” relational database? Document database? Blob storage?
  2. How you query it β€” SQL? API calls? AI-powered search?
  3. How you process it β€” direct queries? ETL pipelines? Machine learning?

Choosing the wrong storage for your data type leads to wasted money, slow performance, or lost information. The rest of this course teaches you how to make those choices on Azure.

Real-world mix: most systems use all three

Real applications almost always combine all three types. Take Priya’s FreshMart grocery chain:

  • Structured: Sales transactions (item, price, quantity, date, store)
  • Semi-structured: Product metadata from suppliers (JSON files with varying attributes per category)
  • Unstructured: Security camera footage, customer feedback emails, scanned invoices

A complete data solution handles all three β€” that’s why Azure offers different services for each.

Meet the characters

Throughout this course, you’ll follow four people working with data:

CharacterWho They AreData Scenarios
πŸ“ˆ Priya SharmaSenior Data Analyst at FreshMart Groceries (50 stores)Analysing sales trends, building dashboards, Power BI reports
πŸ’Ό Tom ChenBusiness Ops Manager at Pacific Freight (logistics, 200 drivers)Tracking deliveries, querying databases, optimising routes
πŸŽ“ Aisha MohammedCS student at Auckland University (building a campus food app)Learning data basics, choosing storage, first database decisions
🏒 Jake BennettSolo DBA at CloudPulse (SaaS startup, 15 developers)Managing production databases, scaling storage, choosing Azure services

Flashcards

Question

What are the three types of data?

Click or press Enter to reveal answer

Answer

Structured (rows & columns, fixed schema), semi-structured (flexible schema, self-describing tags like JSON), and unstructured (no schema β€” images, videos, PDFs).

Click to flip back

Question

What makes semi-structured data 'self-describing'?

Click or press Enter to reveal answer

Answer

Each value is paired with a label (key or tag), so the data itself explains what each field means β€” like JSON keys or XML tags. No separate schema definition is needed.

Click to flip back

Question

Give one example each of structured, semi-structured, and unstructured data.

Click or press Enter to reveal answer

Answer

Structured: a sales table in a database. Semi-structured: a JSON product catalogue. Unstructured: a customer photo uploaded to a website.

Click to flip back

Knowledge check

Knowledge Check

Aisha's food ordering app stores meal reviews as free-text comments with no fixed format. What type of data is this?

Knowledge Check

Tom's logistics company receives shipment data from partners in JSON format. Each partner includes different fields β€” some include weight, others include dimensions. What type of data is this?

Knowledge Check

Which of the following is the BEST description of structured data?

Next up: Data File Formats: CSV, JSON, Parquet & More β€” learn how data is physically stored in files.