OpenTelemetry support #186

aaronpowell · 2025-02-13T00:28:05Z

Fixes #185

This is heavily borrowed from the OpenAI implementation.

It focuses on tracing through the Chat/Chat Streaming part of the Ollama API, adding an opt-in activity source. If the feature isn't enabled (it's not by default) then the method calls are essentially noops.

Here's what it looks like in the Aspire dashboard. You can see the metrics in the trace from the request and response.

Included docs on how to use it.

This is heavily borrowed from the OpenAI implementation. Fixes awaescher#185

samsp-msft · 2025-02-13T00:35:37Z

@lmolkova - can you take a look?

samsp-msft · 2025-02-13T00:41:20Z

src/Telemetry/OpenTelemetryScope.cs

+	private static readonly Meter _chatMeter = new("OllamaSharp.ChatClient");
+
+	// TODO: add explicit histogram buckets once System.Diagnostics.DiagnosticSource 9.0 is used
+	private static readonly Histogram<double> _duration = _chatMeter.CreateHistogram<double>(GEN_AI_CLIENT_OPERATION_DURATION_METRIC_NAME, "s", "Measures GenAI operation duration.");


You may want to create suitable buckets for the histogram based on what a typical duration would be, using exponential-ish bucket sizes. Eg https://github.com/dotnet/aspnetcore/blob/cfcb73ab7e63ad193d20566efb23b8bf93e494ba/src/Hosting/Hosting/src/Internal/HostingMetrics.cs#L30

Given that you could have a model that's only a few hundred parameters, or one that is several hundreds of billions, I'm not sure that it's really possible to determine what the typical duration would be.

@samsp-msft is is on .NET 10 only? I want to use it too

@aaronpowell users can customize buckets, the advice is part of the spec https://github.com/open-telemetry/semantic-conventions/blob/main/docs/gen-ai/gen-ai-metrics.md#metric-gen_aiclientoperationduration - without it everyone needs to customize all the time

@aaronpowell - The spec seems to have good exponential buckets from 100ms to 80s. You should be setting those as the default buckets using the Advise option.
@lmolkova - that code is present in the .NET 9 branch so you shouldn't have to wait for .NET 10.

OllamaSharp supports netstandard so the implementation will be restricted to that.

do you build just one version, or can you have version specific ifdef's?

samsp-msft · 2025-02-13T00:49:53Z

README.md

+
+To enable the instrumentation:
+
+1. Set instrumentation feature-flag using one of the following options:


Is it really necessary to have a feature flag to enable telemetry? It forms yet another hurdle that developers need to jump through to get telemetry working, and frankly there are already too many of those as it is.

A simplification would be to take the same approach as .NET did with a couple of networking activities. As they are experimental, we added that to the name, so it would be clear when you enable them, that they are not yet stable. Eg https://github.com/dotnet/runtime/blob/6fe852a43152e45dc2e98bb747187d5bfe9f19e3/src/libraries/System.Net.NameResolution/src/System/Net/NameResolutionTelemetry.cs#L161

Short answer - I don't know, I was just mirroring the approach from https://github.com/openai/openai-dotnet/blob/main/docs/observability.md

samsp-msft · 2025-02-13T00:53:02Z

src/Telemetry/OpenTelemetryScope.cs

+
+internal class OpenTelemetryScope : IDisposable
+{
+	private static readonly ActivitySource _chatSource = new("OllamaSharp.ChatClient");


See comment in read.me - maybe rather than having a feature flag, just prepend "Experimental" to the name of the metric and activity sources, so its explicit that the data may change.

The OpenAI SDK doesn't include that - https://github.com/openai/openai-dotnet/blob/main/src/Utility/Telemetry/OpenTelemetryScope.cs#L13-L14

Only because I never finalized to merge it openai/openai-dotnet#187, but it's approved and I highly recommend using the approach @samsp-msft suggested

Ok, I'll update based on that.

samsp-msft · 2025-02-13T00:58:27Z

@lmolkova can probably give the latest status on how the content of the chat should be added to telemetry and what the best knob is for the app to turn that on/off.

One scenario for emitting the chat content is to be able to use a separate service to score the responses so that you can get a sense of correctness over time as models change etc.

lmolkova · 2025-02-13T02:58:30Z

src/Telemetry/OpenTelemetryConstants.cs

+	public const string GEN_AI_RESPONSE_MODEL_KEY = "gen_ai.response.model";
+
+	public const string GEN_AI_SYSTEM_KEY = "gen_ai.system";
+	public const string GEN_AI_SYSTEM_VALUE = "ollamasharp";


this property intends to be language-agnostic. Should it be just ollama ? The language-specific part is essentially inside OllamaSharp.ChatClient.

Also if you're interested in contributing to otel, we'd be more than happy to add ollama to the list of systems in https://github.com/open-telemetry/semantic-conventions/blob/v1.30.0/docs/gen-ai/gen-ai-spans.md

Suggested change

public const string GEN_AI_SYSTEM_VALUE = "ollamasharp";

public const string GEN_AI_SYSTEM_VALUE = "ollama";

That makes sense, I can change it to be just ollama.

If this does end up being implemented, then I'll add it to that list.

lmolkova · 2025-02-13T03:03:44Z

src/Telemetry/OpenTelemetryConstants.cs

+	public const string GEN_AI_REQUEST_PRESENCE_PENALTY_KEY = "gen_ai.request.presence_penalty";
+	public const string GEN_AI_REQUEST_STOP_SEQUENCES_KEY = "gen_ai.request.stop_sequences";
+
+	public const string GEN_AI_PROMPT_KEY = "gen_ai.prompt";


we've reworked OTel semconv a bit and we no longer support this way of reporting prompts and completions.

Check out https://github.com/open-telemetry/semantic-conventions/blob/v1.30.0/docs/gen-ai/gen-ai-events.md

You can find a creative way to report those new events in
https://github.com/dotnet/extensions/blob/main/src/Libraries/Microsoft.Extensions.AI/ChatCompletion/OpenTelemetryChatClient.cs (and it should get easier over time)

I'll review that implementation (I was going off the OpenAI one)

Integrating OTEL observability through the Chat/Chat streaming

1dcae4b

This is heavily borrowed from the OpenAI implementation. Fixes awaescher#185

samsp-msft reviewed Feb 13, 2025

View reviewed changes

lmolkova reviewed Feb 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenTelemetry support #186

OpenTelemetry support #186

aaronpowell commented Feb 13, 2025

samsp-msft commented Feb 13, 2025

samsp-msft Feb 13, 2025

aaronpowell Feb 13, 2025

lmolkova Feb 13, 2025

samsp-msft Feb 13, 2025

aaronpowell Feb 14, 2025

samsp-msft Feb 14, 2025

samsp-msft Feb 13, 2025

aaronpowell Feb 13, 2025

samsp-msft Feb 13, 2025

aaronpowell Feb 13, 2025

lmolkova Feb 13, 2025

aaronpowell Feb 14, 2025

samsp-msft commented Feb 13, 2025

lmolkova Feb 13, 2025

aaronpowell Feb 14, 2025

lmolkova Feb 13, 2025 •

edited

Loading

aaronpowell Feb 14, 2025


		To enable the instrumentation:

		1. Set instrumentation feature-flag using one of the following options:

	public const string GEN_AI_SYSTEM_VALUE = "ollamasharp";
	public const string GEN_AI_SYSTEM_VALUE = "ollama";

OpenTelemetry support #186

Are you sure you want to change the base?

OpenTelemetry support #186

Conversation

aaronpowell commented Feb 13, 2025

samsp-msft commented Feb 13, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

samsp-msft commented Feb 13, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lmolkova Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lmolkova Feb 13, 2025 •

edited

Loading