Telemetry Module

Introduction

In this module, we'll implement comprehensive observability features in our weather application using Aspire's built-in OpenTelemetry support. We'll cover three main aspects:

Structured Logging: Implementing context-rich logging that makes it easier to trace and debug requests through the system
Custom Metrics: Creating and tracking application-specific metrics like request counts, durations, and cache performance
Distributed Tracing: Adding request tracing to follow operations across service boundaries

You'll learn how to:

Create a diagnostic infrastructure for metrics and tracing
Implement structured logging with scopes and semantic context
Add custom metrics for tracking application behavior
Use distributed tracing to follow requests through the system
Test and observe the telemetry data in the Aspire dashboard
Integrate with external observability platforms

OpenTelemetry in Aspire ServiceDefaults

Aspire's ServiceDefaults project automatically configures OpenTelemetry for your application. When you call builder.AddServiceDefaults(), it:

Configures structured logging with OpenTelemetry
Sets up distributed tracing with common instrumentation:
- ASP.NET Core
- HTTP client calls
- Runtime metrics
- Service discovery
Configures metrics collection and export
Enables integration with the Aspire dashboard

This means you don't need to manually configure the basic OpenTelemetry infrastructure. You can focus on adding your application-specific telemetry.

Note: The custom meters and ActivitySource you add in this module are additive. ServiceDefaults has already wired up baseline OpenTelemetry for ASP.NET Core, HttpClient, and runtime metrics; you're layering your app-specific telemetry on top.

Implementing Custom Metrics

We'll create a diagnostic infrastructure to track specific metrics about our weather service:

Create a new file NwsManagerDiagnostics.cs in the Api/Data folder.
Add the following code to define custom metrics:

using System.Diagnostics;
using System.Diagnostics.Metrics;

namespace Api.Diagnostics
{
    public class NwsManagerDiagnostics
    {
        private static readonly Meter meter = new Meter("NwsManagerMetrics", "1.0");
        public static readonly Counter<int> forecastRequestCounter = meter.CreateCounter<int>("forecast_requests_total", "Total number of forecast requests");
        public static readonly Histogram<double> forecastRequestDuration = meter.CreateHistogram<double>("forecast_request_duration_seconds", "Histogram of forecast request durations");
        public static readonly Counter<int> failedRequestCounter = meter.CreateCounter<int>("failed_requests_total", "Total number of failed requests");
        public static readonly Counter<int> cacheHitCounter = meter.CreateCounter<int>("cache_hits_total", "Total number of cache hits");
        public static readonly Counter<int> cacheMissCounter = meter.CreateCounter<int>("cache_misses_total", "Total number of cache misses");
        public static readonly ActivitySource activitySource = new ActivitySource("NwsManager");
    }
}

Understanding the Metrics

Our custom metrics track several key aspects of the weather service:

Request Counts: Total number of forecast requests made
Request Duration: How long forecast requests take to complete
Error Rates: Number of failed requests
Cache Performance: Cache hit and miss rates
Distributed Tracing: Activity source for following requests

builder.Services.AddOpenTelemetry()
    .WithMetrics(m => m.AddMeter("NwsManagerMetrics"))
    .WithTracing(m => m.AddSource("NwsManager"));

Implementing Telemetry in NwsManager

Now we'll update the NwsManager class to use our metrics and implement structured logging. We'll need to start by adding ILogger to the primary constructor for the NwsManager class:

   public class NwsManager(HttpClient httpClient, 
    IMemoryCache cache, 
    IWebHostEnvironment webHostEnvironment, 
    ILogger<NwsManager> logger)

Next, we'll make several additions to GetForecastByZoneAsync to add in several observability features. Make the following updates, being careful to keep the existing API code shown by the "... API call logic ..." comment. You can refer to the completed code for this lesson if needed.

private static int forecastCount = 0;

public async Task<Forecast[]> GetForecastByZoneAsync(string zoneId)
{
    // Create a logging scope with structured data
    using var logScope = logger.BeginScope(new Dictionary<string, object>
    {
        ["ZoneId"] = zoneId,
        ["RequestNumber"] = Interlocked.Increment(ref forecastCount)
    });

    // Record the request in our metrics
    NwsManagerDiagnostics.forecastRequestCounter.Add(1);
    var stopwatch = Stopwatch.StartNew();

    // Create a trace activity
    using var activity = NwsManagerDiagnostics.activitySource.StartActivity("GetForecastByZoneAsync");
    activity?.SetTag("zone.id", zoneId);

    logger.LogInformation("🚀 Starting forecast request for zone {ZoneId}", zoneId);

    try 
    {
        // Create an exception every 5 calls to simulate an error for testing
        if (forecastCount % 5 == 0)
        {
            throw new Exception("Random exception thrown by NwsManager.GetForecastAsync");
        }

        var zoneIdSegment = Uri.EscapeDataString(zoneId);
        var forecasts = await httpClient.GetFromJsonAsync<ForecastResponse>($"zones/forecast/{zoneIdSegment}/forecast", options);
        stopwatch.Stop();

        // Record the request duration
        var tags = new KeyValuePair<string, object?>[]
        {
          new KeyValuePair<string, object?>("zone.id", zoneId)
        };
        NwsManagerDiagnostics.forecastRequestDuration.Record(stopwatch.Elapsed.TotalSeconds, tags);
        activity?.SetTag("request.success", true);

        var result = forecasts
            ?.Properties
            ?.Periods
            ?.Select(p => (Forecast)p)
            .ToArray() ?? [];

        logger.LogInformation(
            "📊 Retrieved forecast for zone {ZoneId} in {Duration:N0}ms with {PeriodCount} periods",
            zoneId,
            stopwatch.Elapsed.TotalMilliseconds,
            result.Length
        );

        return result;
    }
    catch (HttpRequestException ex)
    {
        stopwatch.Stop();
        NwsManagerDiagnostics.failedRequestCounter.Add(1);
        activity?.SetTag("request.success", false);
        
        logger.LogError(
            ex,
            "❌ Failed to retrieve forecast for zone {ZoneId}. Status: {StatusCode}",
            zoneId,
            ex.StatusCode
        );
        throw;
    }
    catch (Exception ex)
    {
        stopwatch.Stop();
        NwsManagerDiagnostics.failedRequestCounter.Add(1);
        activity?.SetTag("request.success", false);
        
        logger.LogError(
            ex,
            "❌ Unexpected error fetching forecast for zone {ZoneId} after {ElapsedMs}ms",
            zoneId,
            stopwatch.Elapsed.TotalMilliseconds
        );
        throw;
    }
}

This implementation shows how our custom metrics work with structured logging:

Request metrics track performance and error rates
Logging scopes group related log entries
Trace activities connect logs across service boundaries
Log messages include structured data for better analysis

Implementing Cache Metrics

Now let's enhance the GetZonesAsync method to properly track cache hit and miss metrics. Currently, the method only tracks cache misses when data is loaded from the file. We need to add cache hit tracking when data is retrieved from memory cache.

Update the GetZonesAsync method to include cache hit tracking:

public async Task<Zone[]?> GetZonesAsync()
{
    using var activity = NwsManagerDiagnostics.activitySource.StartActivity("GetZonesAsync");

    logger.LogInformation("🚀 Starting zones retrieval with {CacheExpiration} cache expiration", TimeSpan.FromHours(1));

    // Check if data exists in cache first
    if (cache.TryGetValue("zones", out Zone[]? cachedZones))
    {
        // Record cache hit
        NwsManagerDiagnostics.cacheHitCounter.Add(1);
        activity?.SetTag("cache.hit", true);
        
        logger.LogInformation("📈 Retrieved {ZoneCount} zones from cache (cache hit)", cachedZones?.Length ?? 0);
        return cachedZones;
    }

    return await cache.GetOrCreateAsync("zones", async entry =>
    {
        entry.AbsoluteExpirationRelativeToNow = TimeSpan.FromHours(1);
        
        // Record cache miss when we need to load from file
        NwsManagerDiagnostics.cacheMissCounter.Add(1);
        activity?.SetTag("cache.hit", false);

        var zonesFilePath = Path.Combine(webHostEnvironment.WebRootPath, "zones.json");
        if (!File.Exists(zonesFilePath))
        {
            logger.LogWarning("⚠️ Zones file not found at {ZonesFilePath}", zonesFilePath);
            return [];
        }

        using var zonesJson = File.OpenRead(zonesFilePath);
        var zones = await JsonSerializer.DeserializeAsync<ZonesResponse>(zonesJson, options);

        if (zones?.Features == null)
        {
            logger.LogWarning("⚠️ Failed to deserialize zones from file");
            return [];
        }

        var filteredZones = zones.Features
            .Where(f => f.Properties?.ObservationStations?.Count > 0)
            .Select(f => (Zone)f)
            .Distinct()
            .ToArray();

        logger.LogInformation(
            "📊 Retrieved {TotalZones} zones, {FilteredZones} after filtering (cache miss)",
            zones.Features.Count,
            filteredZones.Length
        );

        return filteredZones;
    });
}

Cache Metrics Explained

The enhanced implementation demonstrates both cache hit and miss scenarios:

Cache Hit: When cache.TryGetValue() successfully retrieves data from memory, we increment cacheHitCounter
Cache Miss: When data isn't in cache and GetOrCreateAsync needs to load from file, we increment cacheMissCounter
Activity Tags: We set cache.hit tags for distributed tracing to track cache performance across requests
Structured Logging: Different log messages help distinguish between cache hits and misses in the dashboard

Testing the Implementation

Run the application using the Aspire dashboard
Open the "Structured" tab in the dashboard
Click on several different cities in the weather app
Observe the telemetry data:

Structured Logs View

In the "Structured" tab:
- Set different log level filters
- Search by zone ID or other properties
- View full structured data in "Details"
- See how log entries are grouped by request scope

Metrics View

In the "Metrics" tab, observe:
- forecast_requests_total
- forecast_request_duration_seconds
- failed_requests_total
- cache_hits_total
- cache_misses_total

Traces View

In the "Traces" tab:
- Find a trace for GetForecastByZoneAsync
- See how logs connect to traces
- Observe the complete request flow
- View timing and success/failure data

Telemetry Integrations

Aspire's OpenTelemetry infrastructure makes it easy to integrate with various observability platforms. Here are some popular options:

Cloud Provider Solutions

Azure Monitor Application Insights: Native integration with Azure services. Learn more
AWS CloudWatch: Monitoring for AWS deployments. Learn more
Google Cloud Operations (formerly Stackdriver): Monitoring for GCP. Learn more

Third-Party Services

New Relic: Full-stack observability platform. Learn more
Datadog: Infrastructure and application monitoring. Learn more
Honeycomb: Observability for distributed systems. Learn more
Grafana: Visualization and analytics platform. Learn more
Dynatrace: Application performance monitoring. Learn more

Example: Adding Azure Monitor

To send telemetry to Azure Application Insights:

Note

Getting the connection string: In the Azure Portal, create (or open an existing) Application Insights resource (Create a resource > Application Insights). On the resource Overview page, copy the Connection string value from the Essentials panel. You can also retrieve it via CLI:

az monitor app-insights component show --app <app-insights-name> --resource-group <resource-group> --query connectionString --output tsv

Official docs:

Install the NuGet package:

dotnet add package Azure.Monitor.OpenTelemetry.AspNetCore

Update your Program.cs:

// In ServiceDefaults/Extensions.cs
if (!string.IsNullOrEmpty(builder.Configuration["APPLICATIONINSIGHTS_CONNECTION_STRING"]))
{
    builder.Services.AddOpenTelemetry()
        .UseAzureMonitor();
}

Set your connection string in configuration or environment variables:

{
  "APPLICATIONINSIGHTS_CONNECTION_STRING": "your-connection-string"
}

For other integrations, consult their respective documentation for .NET OpenTelemetry setup instructions.

AI-Powered Observability with Aspire MCP

Now that your application produces rich telemetry data, imagine letting an AI coding agent query those logs, traces, and metrics in real time — right from your editor. That's exactly what the Aspire MCP (Model Context Protocol) server enables.

What is Aspire MCP?

The Aspire dashboard exposes an MCP server that AI coding assistants can connect to. Once connected, your AI agent can:

Query live resource states — see which services are running, healthy, or failing
Stream console logs — ask "why is my API returning 500s?" and the agent reads the logs for you
Inspect distributed traces — the agent can follow a request across service boundaries, just like you do in the Traces tab
Run resource commands — restart a service or clear a cache without leaving your editor

Setting Up MCP

The Aspire CLI makes setup effortless:

aspire mcp init

This command:

Detects your AI tools — VS Code, GitHub Copilot, Claude Code, or others
Generates configuration files — e.g., .vscode/mcp.json for VS Code
Optionally creates an AGENTS.md — project-specific instructions to help the AI agent understand your codebase

That's it! No API keys to manage, no endpoints to configure. The AI agent launches aspire mcp start as a subprocess and connects automatically via stdio.

Try It Out

Make sure your Aspire application is running (aspire run)
Run aspire mcp init and follow the prompts
Open your AI assistant and ask a question like:
- "What resources are running in my Aspire app?"
- "Show me the recent error logs from the API"
- "What's the p95 latency for forecast requests?"

The agent reads the same telemetry you've been building throughout this module — your custom NwsManagerMetrics, your ActivitySource traces, and your structured logs — but now through a conversational interface.

Supported AI Assistants

Assistant	Configuration
VS Code (Copilot Chat)	`.vscode/mcp.json` (auto-generated)
GitHub Copilot CLI	`~/.copilot/mcp-config.json`
Claude Code	`.claude/mcp.json`
OpenCode	Detected automatically

Learn More

Next: Module #7: Database Integration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Telemetry Module

Introduction

OpenTelemetry in Aspire ServiceDefaults

Implementing Custom Metrics

Understanding the Metrics

Implementing Telemetry in NwsManager

Implementing Cache Metrics

Cache Metrics Explained

Testing the Implementation

Structured Logs View

Metrics View

Traces View

Telemetry Integrations

Cloud Provider Solutions

Third-Party Services

Example: Adding Azure Monitor

AI-Powered Observability with Aspire MCP

What is Aspire MCP?

Setting Up MCP

Try It Out

Supported AI Assistants

Learn More

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Telemetry Module

Introduction

OpenTelemetry in Aspire ServiceDefaults

Implementing Custom Metrics

Understanding the Metrics

Implementing Telemetry in NwsManager

Implementing Cache Metrics

Cache Metrics Explained

Testing the Implementation

Structured Logs View

Metrics View

Traces View

Telemetry Integrations

Cloud Provider Solutions

Third-Party Services

Example: Adding Azure Monitor

AI-Powered Observability with Aspire MCP

What is Aspire MCP?

Setting Up MCP

Try It Out

Supported AI Assistants

Learn More