fix(core)!: include multimodal blocks in `get_buffer_string` prefix format by mdrxy · Pull Request #38174 · langchain-ai/langchain

Mason Daugherty (mdrxy) · 2026-06-15T19:32:15Z

Breaking change - v2 candidate

get_buffer_string now includes image/audio/video block references (e.g. [image: <url>]) in the default prefix format instead of dropping them.

The default (non-XML) get_buffer_string only kept text content and silently dropped image, audio, and video blocks. That is lossy: a user who asks "what does this screenshot show?" with an attached image URL ends up with a summary that no longer references the image at all. Only the format="xml" path preserved that information, so callers such as SummarizationMiddleware had to opt into XML to avoid the loss.

This updates the shared utility so every caller of the default format benefits: non-base64 image, image_url (OpenAI-style), audio, and video blocks are appended as a concise human-readable reference. Plain string content and the XML format are unchanged, and base64/data: media is still omitted to avoid dumping payloads.

Before / After

Given a multimodal HumanMessage:

from langchain_core.messages import HumanMessage, get_buffer_string

msg = HumanMessage(content=[
    {"type": "text", "text": "What does this screenshot show?"},
    {"type": "image_url", "image_url": {"url": "https://example.com/screenshot.png"}},
])
get_buffer_string([msg])

Before (image URL silently dropped):

Human: What does this screenshot show?

After (image reference preserved):

Human: What does this screenshot show? [image: https://example.com/screenshot.png]

Made by Open SWE

…rmat Non-XML `get_buffer_string` dropped image/audio/video content blocks, losing references like image URLs the user explicitly mentioned. The prefix path now appends non-base64 image, audio, and video blocks as a human-readable reference (e.g. `[image: <url>]`) so all callers of the default format benefit. Plain string content and XML output are unchanged. Co-authored-by: open-swe[bot] <open-swe@users.noreply.github.com>

open-swe

✅ Open SWE Review: No issues found

Open SWE reviewed this PR and found no potential bugs to report.

Open in Web • View Open SWE trace

github-actions Bot added core `langchain-core` package issues & PRs fix For PRs that implement a fix internal size: S 50-199 LOC labels Jun 15, 2026

Mason Daugherty (mdrxy) marked this pull request as ready for review June 15, 2026 19:39

Mason Daugherty (mdrxy) requested a review from Eugene Yurtsev (eyurtsev) as a code owner June 15, 2026 19:39

open-swe Bot reviewed Jun 15, 2026

View reviewed changes

Mason Daugherty (mdrxy) changed the title ~~fix(core): include multimodal blocks in get_buffer_string prefix format~~ fix(core)!: include multimodal blocks in get_buffer_string prefix format Jun 15, 2026

github-actions Bot added the breaking Breaking changes label Jun 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(core)!: include multimodal blocks in `get_buffer_string` prefix format#38174

fix(core)!: include multimodal blocks in `get_buffer_string` prefix format#38174
Mason Daugherty (mdrxy) wants to merge 1 commit into
masterfrom
mdrxy/core/get-buffer-string-multimodal

Mason Daugherty (mdrxy) commented Jun 15, 2026 •

edited

Loading

Uh oh!

open-swe Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Mason Daugherty (mdrxy) commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before / After

Uh oh!

open-swe Bot left a comment

Choose a reason for hiding this comment

✅ Open SWE Review: No issues found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Mason Daugherty (mdrxy) commented Jun 15, 2026 •

edited

Loading