Domain 3 β€” Module 4 of 8 50%
22 of 26 overall
Domain 3: Monitor and Optimize an Analytics Solution Free ⏱ ~11 min read

Troubleshoot Streaming & Shortcuts

Identify and resolve Eventhouse, Eventstream, and OneLake shortcut errors β€” ingestion failures, processing lag, missing data, and connectivity issues.

Streaming and shortcut errors

Simple explanation

Think of a live TV broadcast that stutters, freezes, or shows a blank screen.

Streaming errors in Fabric are similar: events stop arriving (Eventstream errors), queries against real-time data fail (Eventhouse errors), or data behind a shortcut becomes unreachable (shortcut errors). Each has different causes and fixes.

Eventstream errors

Most Eventstream errors trace back to source connectivity or capacity limits
Error PatternCauseResolution
Processing lag increasingEvents arriving faster than processing capacityScale up Eventstream capacity, simplify transformations, add partitions to Event Hub
Events not arrivingSource disconnected, Event Hub key expired, consumer group fullCheck source connectivity, refresh credentials, verify consumer group isn't at max readers
Schema validation errorsIncoming events don't match expected schemaUpdate schema in Eventstream, add error handling for malformed events
Destination write failuresTarget KQL database or lakehouse is unavailable or fullCheck destination status, verify permissions, check capacity
Duplicate eventsAt-least-once delivery combined with source retriesImplement deduplication at the destination (MERGE or distinct on event ID)

Eventhouse errors

ErrorCauseResolution
Ingestion failureData format mismatch, column mapping errorCheck ingestion mapping, verify source schema matches target table
Query timeoutQuery scanning too much data, missing materialized viewsAdd time filters, create materialized views for common queries
Table hot limitToo many concurrent ingestions to one tableSpread ingestion across multiple tables or increase capacity
Retention policy conflictData deleted by retention before queries expect itExtend retention period or archive data to lakehouse before deletion

OneLake shortcut errors

ErrorCauseResolution
Access deniedSource credentials expired or user lacks source permissionsRe-authenticate the shortcut connection, verify source-side permissions
Source unavailableExternal storage (S3, ADLS, GCS) is down or unreachableCheck source service health, verify network connectivity
Schema changeSource Delta table schema changed (columns added/removed)Refresh the shortcut metadata, verify downstream queries handle new schema
Performance degradationLarge cross-cloud reads (S3/GCS latency)Enable query acceleration, or consider mirroring for frequently accessed data
Scenario: Zoe troubleshoots a streaming gap

WaveMedia’s real-time dashboard shows a 10-minute gap in playback data. Zoe investigates:

  1. Eventstream health: Processing lag spiked to 8 minutes at 2:15 PM, then events stopped
  2. Source check: Event Hub shows events are still being produced (sender metrics normal)
  3. Consumer group: The Eventstream’s consumer group shows β€œconnection closed” at 2:15 PM
  4. Root cause: The Event Hub access key was rotated at 2:15 PM as part of a scheduled security rotation
  5. Fix: Update the Eventstream’s Event Hub connection with the new key β†’ events resume flowing

Lesson: Coordinate key rotations with downstream consumers. Better: use managed identity instead of keys.


Question

What is the most common cause of Eventstream processing lag?

Click or press Enter to reveal answer

Answer

Events arriving faster than the Eventstream can process β€” usually due to insufficient capacity, complex inline transformations, or too few Event Hub partitions. Fix: scale capacity, simplify transforms, increase partitions.

Click to flip back

Question

What happens when an OneLake shortcut's source credentials expire?

Click or press Enter to reveal answer

Answer

Queries against the shortcut fail with 'access denied.' The shortcut still exists but cannot read data. Fix: re-authenticate the connection in the shortcut settings.

Click to flip back


Knowledge Check

Zoe's Eventstream suddenly stops ingesting events. The Event Hub shows events are still being produced. What should she check first?

Knowledge Check

A KQL query against an Eventhouse table returns 'query timeout' on a table with 2 billion rows. The query has no time filter. What is the best fix?

Next up: Optimize Lakehouse Tables: Delta Tuning β€” use OPTIMIZE, VACUUM, Z-ordering, and V-ordering to make your Delta tables fast.