Azure · AZ-305

Operational Complexity Underestimation — Azure Solutions Architect (AZ-305)

The answer is correct but operationally expensive. The exam prefers managed services over self-managed when both meet functional requirements.

When a Smaller Team Changes the Best Design

A multi-service solution with separate failure domains, manual coordination steps, and custom health monitoring can outperform a simpler alternative on every technical axis and still be the wrong answer. When the scenario states a small team, limited DevOps maturity, or an explicit overhead-reduction goal, the management surface area of a design tips the answer away from the richer design. A consolidated managed service that trades some feature depth for reduced operational burden scores higher than a capable but staff-intensive one. Count every service boundary in the proposed architecture; each one is a place a small team has to own.

20%of exam questions affected (40 of 200)

The Scenario

A company needs to deploy a .NET 8 REST API backend. You recommend Azure VMs in an Availability Set with a Load Balancer, VM Scale Sets for auto-scaling, and custom Azure Monitor dashboards. The correct answer is Azure App Service on a Standard tier plan. The scenario said "reduce management effort" and the workload is a standard web API with no special OS requirements. App Service gives you built-in auto-scaling, health monitoring, deployment slots, SSL termination, and managed patching. VMs require you to configure and maintain all of that yourself.

How to Spot It

  • Azure App Service, Azure Functions, and Azure Container Apps are the exam-preferred answers when scenarios mention operational simplicity. VMs and AKS are correct when the scenario explicitly needs custom OS configuration, GPU compute, or Kubernetes-specific orchestration features.
  • The operational complexity spectrum in Azure: VMs (everything is your job) > AKS (infrastructure is managed, orchestration is yours) > Container Apps (auto-scaling and infrastructure managed) > App Service (deployment and infrastructure managed) > Functions (only code is yours). The exam tests whether you pick the right level.
  • When you see "small team" or "minimize management," count the operational tasks your answer creates: patching, scaling configuration, certificate management, monitoring setup, backup configuration. If a PaaS service handles these automatically, it is the correct answer.

Decision Rules

Whether to assign an Azure Policy definition at the Management Group scope (single assignment, automatic inheritance, continuous enforcement) versus deploying Azure Blueprints to each subscription individually (per-subscription assignment, versioning overhead, and provisioning lifecycle that multiplies administrative burden across fifteen subscriptions).

Azure Management GroupsAzure PolicyAzure Blueprints

Whether the batch ingestion workload scale and team skill profile justify a self-managed Spark cluster (Databricks) or whether a managed low-code data flow service (ADF mapping data flows) satisfies all stated constraints at lower operational cost.

Azure Data FactoryAzure DatabricksAzure Data Lake Storage

Whether a managed low-code orchestration service (Azure Data Factory mapping data flows) or a self-managed Spark platform (Azure Databricks) better satisfies a batch workload of moderate scale when the team lacks cluster-management expertise and the scenario forbids operational overhead.

Azure Data FactoryAzure DatabricksAzure Data Lake Storage

Whether Azure Batch's managed node-pool lifecycle, native task scheduling, and auto-scale-to-zero capability satisfies both the workload-density-target and the operational-complexity-budget constraint better than AKS with a batch job controller or self-managed VMs.

Azure BatchAzure Kubernetes Service (AKS)Azure Virtual Machines

Whether to route platform-level resource diagnostic logs through Azure Monitor Diagnostic Settings to a Log Analytics workspace configured with tiered (interactive + archive) retention, or to conflate platform-log compliance with application telemetry by targeting Application Insights — which cannot natively ingest platform diagnostic events and forces a custom export pipeline that adds operational failure-path complexity.

Azure MonitorAzure Log AnalyticsAzure Application Insights

Whether to satisfy the 180-day platform-log retention requirement by configuring Azure Monitor diagnostic settings to route to a Log Analytics workspace with a native retention policy, versus constructing a custom archival pipeline (e.g., Event Hubs to Blob Storage) or relying solely on Application Insights—which covers only the application telemetry layer and leaves the platform layer unobserved.

Azure MonitorAzure Log AnalyticsAzure Application Insights

Whether the stated latency SLA and access pattern (read-heavy, weekly update cadence, unstructured JSON) are fully satisfied by Azure Blob Storage plus Azure CDN edge caching, or whether adding Azure Cache for Redis provides measurable latency benefit that justifies the cache invalidation complexity and operational overhead it introduces.

Azure Blob StorageAzure CDNAzure Cache for Redis

Whether Azure API Management's built-in throttling policies, response caching, and backpressure combined with Azure Service Bus for async command decoupling satisfy all throughput, latency, and delivery-guarantee constraints without adding operational surface the team cannot sustain—versus a custom Azure Cache for Redis-backed rate limiter plus Azure Event Grid dispatch that looks performant but forces the team to own cache invalidation sequencing, distributed counter consistency, and event dead-letter handling.

Azure API ManagementAzure Service BusAzure Cache for Redis

Whether a sub-one-hour RTO for region-level VM failure requires continuous replication with automated failover (Azure Site Recovery) or can be satisfied by periodic backup-restore (Azure Backup) at lower apparent cost.

Azure Site RecoveryAzure BackupRecovery Services Vault

Whether the 15-minute RPO can be satisfied by scheduled backup or export cycles, or whether it mandates continuous replication via managed services; the answer also determines whether custom orchestration is justified or whether it introduces failure-path complexity that jeopardizes the 1-hour RTO.

Azure Site RecoveryAzure SQL DatabaseRecovery Services Vault

Domain Coverage

Design Identity, Governance, and Monitoring SolutionsDesign Data Storage SolutionsDesign Business Continuity SolutionsDesign Infrastructure Solutions

Difficulty Breakdown

Hard: 24Expert: 4Medium: 12

Related Patterns