feat(google_genai): [MLOB-2932] add apm tracing for google-genai #13650

maxzhangdd · 2025-06-11T18:16:29Z

This PR adds support for APM tracing of Google's GenAI Python SDK. Traces currently only contain UST tags as well as provider and model (LLMObs tracing of inputs/outputs and metadata will be done in a later PR).
Traced calls:

google.genai.models.Models.generate_content
google.genai.models.Models.generate_content_stream
google.genai.models.AsyncModels.generate_content
google.genai.models.AsyncModels.generate_content_stream

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

ddtrace/contrib/internal/google_genai/patch.py

github-actions · 2025-06-11T18:17:16Z

CODEOWNERS have been resolved as:

.riot/requirements/1de4a65.txt                                          @DataDog/apm-python
.riot/requirements/7d83e7d.txt                                          @DataDog/apm-python
.riot/requirements/97b1ae2.txt                                          @DataDog/apm-python
.riot/requirements/ce785c0.txt                                          @DataDog/apm-python
.riot/requirements/f5e518d.txt                                          @DataDog/apm-python
ddtrace/contrib/_google_genai.py                                        @DataDog/ml-observability
ddtrace/contrib/internal/google_genai/_utils.py                         @DataDog/ml-observability
ddtrace/contrib/internal/google_genai/patch.py                          @DataDog/ml-observability
ddtrace/llmobs/_integrations/google_genai.py                            @DataDog/ml-observability
releasenotes/notes/google_genai_apm_tracing-a88d4a4dada947d6.yaml       @DataDog/apm-python
tests/contrib/google_genai/__init__.py                                  @DataDog/apm-core-python @DataDog/apm-idm-python
tests/contrib/google_genai/cassettes/v1/generate_content.yaml           @DataDog/apm-core-python @DataDog/apm-idm-python
tests/contrib/google_genai/cassettes/v1/generate_content_stream.yaml    @DataDog/apm-core-python @DataDog/apm-idm-python
tests/contrib/google_genai/conftest.py                                  @DataDog/apm-core-python @DataDog/apm-idm-python
tests/contrib/google_genai/test_google_genai.py                         @DataDog/apm-core-python @DataDog/apm-idm-python
tests/contrib/google_genai/test_google_genai_patch.py                   @DataDog/apm-core-python @DataDog/apm-idm-python
tests/contrib/google_genai/utils.py                                     @DataDog/apm-core-python @DataDog/apm-idm-python
tests/snapshots/tests.contrib.google_genai.test_google_genai.test_google_genai_generate_content.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_genai.test_google_genai.test_google_genai_generate_content_error.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_genai.test_google_genai.test_google_genai_generate_content_stream.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_genai.test_google_genai.test_google_genai_generate_content_stream_error.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_genai.test_google_genai.test_google_genai_vertex_generate_content.json  @DataDog/apm-python
.github/CODEOWNERS                                                      @DataDog/python-guild @DataDog/apm-core-python
ddtrace/_monkey.py                                                      @DataDog/apm-core-python
ddtrace/contrib/integration_registry/registry.yaml                      @DataDog/apm-core-python @DataDog/apm-idm-python
ddtrace/llmobs/_integrations/__init__.py                                @DataDog/ml-observability
ddtrace/settings/_config.py                                             @DataDog/apm-core-python
docs/integrations.rst                                                   @DataDog/python-guild
docs/spelling_wordlist.txt                                              @DataDog/python-guild
riotfile.py                                                             @DataDog/apm-python
supported_versions_output.json                                          @DataDog/apm-core-python
supported_versions_table.csv                                            @DataDog/apm-core-python
tests/llmobs/suitespec.yml                                              @DataDog/ml-observability

github-actions · 2025-06-11T18:39:58Z

Bootstrap import analysis

Comparison of import times between this PR and base.

Summary

The average import time from this PR is: 275 ± 3 ms.

The average import time from base is: 280 ± 4 ms.

The import time difference between this PR and base is: -4.6 ± 0.2 ms.

Import time breakdown

The following import paths have shrunk:

ddtrace.auto 2.229 ms (0.81%)

ddtrace.bootstrap.sitecustomize 1.548 ms (0.56%)

ddtrace.bootstrap.preload 1.548 ms (0.56%)

ddtrace.internal.remoteconfig.client 0.683 ms (0.25%)

ddtrace 0.682 ms (0.25%)

ddtrace.internal._unpatched 0.032 ms (0.01%)

json 0.032 ms (0.01%)

json.decoder 0.032 ms (0.01%)

re 0.032 ms (0.01%)

enum 0.032 ms (0.01%)

types 0.032 ms (0.01%)

pr-commenter · 2025-06-13T18:01:18Z

Benchmarks

Benchmark execution time: 2025-06-17 23:33:48

Comparing candidate commit 23ab599 in PR branch maxzhang/google-genai-integration with baseline commit 5592908 in branch main.

Found 0 performance improvements and 3 performance regressions! Performance is the same for 558 metrics, 3 unstable metrics.

scenario:iastaspectsospath-ospathbasename_aspect

🟥 execution_time [+741.751ns; +850.827ns] or [+17.595%; +20.182%]

scenario:iastaspectsospath-ospathjoin_aspect

🟥 execution_time [+819.918ns; +957.207ns] or [+13.361%; +15.599%]

scenario:iastaspectsospath-ospathnormcase_aspect

🟥 execution_time [+306.928ns; +389.669ns] or [+8.855%; +11.242%]

ddtrace/contrib/_google_genai.py

Yun-Kim · 2025-06-17T18:46:47Z

ddtrace/contrib/internal/google_genai/_utils.py

+# https://cloud.google.com/vertex-ai/generative-ai/docs/model-garden/quickstart
+# for vertex, it seems like the best way to associate provider name with each call is based on the model name prefix


I checked this link and it doesn't seem to show the model names in the below context. Is this the correct link?

Suggested change

# https://cloud.google.com/vertex-ai/generative-ai/docs/model-garden/quickstart

# for vertex, it seems like the best way to associate provider name with each call is based on the model name prefix

# https://cloud.google.com/vertex-ai/generative-ai/docs/model-garden/quickstart

# VertexAI: the best way to associate provider name with each call is checking the model name prefix

I wasn't able to find a definitive source for providers.

Unlike what we initially thought, its hard to get the provider from the full path since it is not required and users can simply provide a model name. : https://github.com/googleapis/python-genai/blob/main/google/genai/models.py#L6005

So this code is a bit more best-effort. Gemini only exports google provided models whereas vertex lists supported models on the left side of the provided link. I just manually mapped supported models to providers.

Let me know if you have any suggestions on how to improve this part.

ddtrace/contrib/internal/google_genai/_utils.py

tests/contrib/google_genai/test_google_genai.py

Yun-Kim · 2025-06-17T19:01:49Z

tests/contrib/google_genai/test_google_genai.py

+
+@pytest.mark.snapshot(token="tests.contrib.google_genai.test_google_genai.test_google_genai_generate_content_async")
+async def test_google_genai_generate_content_async(google_genai_vcr, genai):
+    with google_genai_vcr.use_cassette("generate_content_async.yaml"):


It looks like the request/response from sync and async generate_content methods are the same. Can we just reuse the same snapshot and cassette files? For the sake of minimizing test files we need to maintain
i.e.

@pytest.mark.snapshot(token="tests.contrib.google_genai.test_google_genai.test_google_genai_generate_content") ... with google_genai_vcr.use_cassette("generate_content.yaml"):

Was able to deduplicate cassettes and snapshots. but for snapshots, I had to add an ignore on resource to prevent this:

span mismatch on 'resource': got 'AsyncModels.generate_content_stream' which does not match expected 'Models.generate_content_stream'..

is this a good idea?

tests/contrib/google_genai/test_google_genai.py

tests/contrib/google_genai/utils.py

Co-authored-by: Yun Kim <[email protected]>

datadog-datadog-prod-us1 bot reviewed Jun 11, 2025

View reviewed changes

ddtrace/contrib/internal/google_genai/patch.py Outdated Show resolved Hide resolved

maxzhangdd changed the title ~~WIP~~ feat(google_genai): [MLOB-2932] add apm tracing for google-genai Jun 13, 2025

maxzhangdd closed this Jun 13, 2025

maxzhangdd reopened this Jun 13, 2025

maxzhangdd added 23 commits June 16, 2025 12:30

skeleton for google gen ai integration

0f7eb0f

more skeleton, added to settings/_config.py

fc11773

get span attributes from request

4bc8e81

add skeleton methods to GoogleGenAI Integration

b6774bf

incremental progress on tagging request response

245dd33

updated tag_response

88a6346

added llmobs_set_tags to traced_generate

dbe7406

changed normalize_contents, progress on tag_response and tag_request

3c0bffd

simplify code to just apm tracing, also added provider extraction

d8abc0c

change CODEOWNERS

37cda72

add tests

4183303

add test for extract_provider_and_model_name_genai

95365d6

linting, add ignore arg on snapshot

c8963b6

fix bare except

aa3eaab

more linting checks

f6549b4

modified suitespec

545c3a5

add google_genai component

a9e197b

change google_genai component

72ae21b

fix min_compatible_versions, changed config, changed doc

fd72fa7

lint fix

b9d5720

ruff

977ac43

add supported versions

7f6e417

lint

87952f2

maxzhangdd added 11 commits June 16, 2025 16:03

fix comments, more async changes

72b6109

testing for async and async streaming

505be8b

linting + add a missed cassette

7b8564a

linting

caa7c24

add test for vertex

cd2ed4f

set env vars

d4e3bf7

ignore metadata requests

1dc8558

linting

7b448e0

stop ignoring auths, blank out secrets

ae82d6f

testing changes for vertex

a9aee36

try mocking gce to prevent remote credential check

1a1e7cf

Yun-Kim reviewed Jun 17, 2025

View reviewed changes

maxzhangdd and others added 18 commits June 17, 2025 15:18

linting

25e7c73

Update ddtrace/contrib/_google_genai.py

5a8b350

Co-authored-by: Yun Kim <[email protected]>

Update ddtrace/contrib/internal/google_genai/_utils.py

f0a8d0e

Co-authored-by: Yun Kim <[email protected]>

Update ddtrace/contrib/internal/google_genai/_utils.py

ed9d83d

Co-authored-by: Yun Kim <[email protected]>

Update ddtrace/contrib/internal/google_genai/_utils.py

370dcf5

Co-authored-by: Yun Kim <[email protected]>

Update tests/contrib/google_genai/test_google_genai.py

e0bbe1f

Co-authored-by: Yun Kim <[email protected]>

Update ddtrace/contrib/internal/google_genai/patch.py

c70af6e

Co-authored-by: Yun Kim <[email protected]>

Update ddtrace/contrib/internal/google_genai/patch.py

cfd96ff

Co-authored-by: Yun Kim <[email protected]>

Update tests/contrib/google_genai/conftest.py

c9694cc

Co-authored-by: Yun Kim <[email protected]>

Update tests/contrib/google_genai/conftest.py

eb33c02

Co-authored-by: Yun Kim <[email protected]>

Update tests/contrib/google_genai/conftest.py

f16b9b5

Co-authored-by: Yun Kim <[email protected]>

resolving comments

2aef56b

change riotfile, deduplicate cassettes and snapshots

ade4b18

linting

5457a43

Merge branch 'main' into maxzhang/google-genai-integration

9410a72

linting

ff784fa

fix spelling, supported versions

2504406

change supported_verions

23ab599

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(google_genai): [MLOB-2932] add apm tracing for google-genai #13650

feat(google_genai): [MLOB-2932] add apm tracing for google-genai #13650

maxzhangdd commented Jun 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

github-actions bot commented Jun 11, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 11, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented Jun 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Yun-Kim Jun 17, 2025

Uh oh!

maxzhangdd Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yun-Kim Jun 17, 2025

Uh oh!

maxzhangdd Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		# https://cloud.google.com/vertex-ai/generative-ai/docs/model-garden/quickstart
		# for vertex, it seems like the best way to associate provider name with each call is based on the model name prefix

feat(google_genai): [MLOB-2932] add apm tracing for google-genai #13650

Are you sure you want to change the base?

feat(google_genai): [MLOB-2932] add apm tracing for google-genai #13650

Conversation

maxzhangdd commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Reviewer Checklist

Uh oh!

Uh oh!

github-actions bot commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bootstrap import analysis

Summary

Import time breakdown

Uh oh!

pr-commenter bot commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

scenario:iastaspectsospath-ospathbasename_aspect

scenario:iastaspectsospath-ospathjoin_aspect

scenario:iastaspectsospath-ospathnormcase_aspect

Uh oh!

Uh oh!

Yun-Kim Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

maxzhangdd Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Yun-Kim Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

maxzhangdd Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maxzhangdd commented Jun 11, 2025 •

edited

Loading

github-actions bot commented Jun 11, 2025 •

edited

Loading

github-actions bot commented Jun 11, 2025 •

edited

Loading

pr-commenter bot commented Jun 13, 2025 •

edited

Loading