Domain 1 β€” Module 6 of 7 86%
6 of 27 overall
Domain 1: Core Data Concepts Free ⏱ ~10 min read

Data Roles: DBA, Engineer & Analyst

Data doesn't manage itself. Meet the three key roles that keep data flowing β€” the database administrator, data engineer, and data analyst.

Who works with data?

Simple explanation

Think of data like water in a city.

The data engineer builds the pipes and pumps β€” they design the infrastructure that moves data from where it’s created to where it’s needed.

The database administrator (DBA) maintains the reservoirs β€” they keep databases running, secure, backed up, and performing well.

The data analyst is the person who turns on the tap and uses the water β€” they query data, build reports, and turn raw numbers into insights for decision-makers.

The three data roles

Database Administrator (DBA)

The DBA is the guardian of the database. They keep it running, secure, and recoverable.

Jake IS a DBA at CloudPulse. His daily responsibilities:

ResponsibilityWhat Jake Actually Does
Manage databasesProvisions Azure SQL databases, configures settings, monitors health
SecuritySets up user permissions, configures firewall rules, enables encryption
Backup & recoveryConfigures automated backups, tests restore procedures, plans disaster recovery
Performance tuningIdentifies slow queries, creates indexes, adjusts resource allocation
AvailabilityEnsures databases are online 24/7, sets up failover groups
Patching & updatesApplies security patches, plans maintenance windows

Key tools: Azure SQL Database management tools, Azure portal, SQL Server Management Studio (SSMS)

Data Engineer

The data engineer is the builder of pipelines. They design the systems that move data from source to destination.

At FreshMart, the data engineering team builds the infrastructure that feeds Priya’s dashboards:

ResponsibilityWhat the Engineering Team Does
Build data pipelinesCreate ETL/ELT processes that extract data from POS systems, clean it, and load it into the warehouse
Design data architectureChoose between data lakes, warehouses, and lakehouses β€” plan how data flows through the organisation
Ensure data qualityValidate data, handle missing values, enforce standards across sources
Manage data platformsSet up and maintain Microsoft Fabric, Azure Databricks, or Azure Data Factory
Optimise for scaleDesign systems that handle growing data volumes without breaking
Security & complianceImplement data governance, manage access to sensitive data in pipelines

Key tools: Microsoft Fabric, Azure Data Factory, Azure Databricks, Apache Spark, Python, SQL

Data Analyst

The data analyst is the storyteller. They turn raw data into insights that drive business decisions.

Priya IS a data analyst at FreshMart. Her responsibilities:

ResponsibilityWhat Priya Actually Does
Explore dataQueries the data warehouse to understand patterns and anomalies
Build reports & dashboardsCreates Power BI reports showing sales trends, inventory levels, store performance
Data modellingDesigns star schemas, creates measures and calculated columns in Power BI
Visualise insightsChooses the right charts and visuals to communicate findings clearly
Collaborate with stakeholdersWorks with store managers to understand what data they need for decisions
Monitor KPIsTracks key metrics and sets up alerts when numbers go outside expected ranges

Key tools: Power BI, Excel, SQL, Microsoft Fabric (for querying)

Database administrator vs data engineer vs data analyst
FeatureDBAData EngineerData Analyst
FocusDatabase health & securityData pipelines & infrastructureInsights & reporting
Works withOperational databasesData lakes, warehouses, pipelinesReports, dashboards, models
Key question'Is the database running well?''How does data get from A to B?''What does the data tell us?'
Primary toolsSSMS, Azure portalFabric, Data Factory, Databricks, SparkPower BI, Excel, SQL
Our characterJake (CloudPulse)FreshMart engineering teamPriya (FreshMart)
Exam tip: role-matching questions

The exam gives you a task and asks β€œwhich role is responsible?” Quick guide:

  • β€œConfigure backup and failover” β†’ DBA
  • β€œBuild an ETL pipeline” β†’ Data Engineer
  • β€œCreate a Power BI dashboard” β†’ Data Analyst
  • β€œTune a slow database query” β†’ DBA
  • β€œDesign the data warehouse schema” β†’ Data Engineer
  • β€œPresent sales trends to management” β†’ Data Analyst

Tricky overlap: Both DBAs and data engineers work with databases, but DBAs focus on operational health while engineers focus on data movement and architecture.

Other roles you might encounter

The exam focuses on three roles, but you may see these mentioned:

  • Data Scientist: Uses statistics and machine learning to build predictive models. Overlaps with data analysts but focuses on prediction rather than description.
  • Data Steward: Manages data governance β€” ensures data quality, compliance, and proper access controls across the organisation.
  • Business Analyst: Similar to data analyst but more focused on business processes and requirements than technical data work.

Flashcards

Question

What is the primary responsibility of a database administrator (DBA)?

Click or press Enter to reveal answer

Answer

Managing the operational health of databases β€” availability, security, backup/recovery, performance tuning, and user access control.

Click to flip back

Question

What is the primary responsibility of a data engineer?

Click or press Enter to reveal answer

Answer

Designing and building data pipelines and infrastructure β€” ETL/ELT processes, data architecture, data platforms (Fabric, Databricks), and ensuring data quality at scale.

Click to flip back

Question

What is the primary responsibility of a data analyst?

Click or press Enter to reveal answer

Answer

Exploring data, building reports and dashboards (Power BI), creating data models, and communicating insights to business stakeholders.

Click to flip back

Knowledge check

Knowledge Check

FreshMart's data warehouse is running slowly. Queries that used to take 2 seconds now take 30 seconds. Who should investigate and fix the performance issue?

Knowledge Check

A new data source (IoT sensors from delivery trucks) needs to be connected to Pacific Freight's data warehouse. The data needs to be cleaned, transformed from JSON to tabular format, and loaded every hour. Who should build this?

Next up: The Azure Data Landscape β€” a map of all the Azure data services you’ll explore in the rest of this course.