docs(ollama): add streaming-with-tools example to OllamaChatGenerator reference by albertodiazdurana · Pull Request #11268 · deepset-ai/haystack

albertodiazdurana · 2026-05-06T16:15:47Z

Related Issues

fixes Docs: surface tool-calling on the Ollama integration landing page haystack-core-integrations#3263 (follow-up; landing-page example landed in docs(ollama): add tool-calling example to integration landing page haystack-integrations#473)

Proposed Changes:

Adds a ### Streaming with Tools section to the OllamaChatGenerator reference page (docs-website/docs/pipeline-components/generators/ollamachatgenerator.mdx), between the existing ### Streaming section and the ## Usage heading.

The section includes:

An executable example combining streaming_callback and tools on OllamaChatGenerator.
A behavioral note: within a single generation step, streamed text tokens and tool-call deltas are mutually exclusive. When the model invokes a tool, chunk.content is empty in the streamed chunks, and the final replies[0].text is None while replies[0].tool_calls carries the reconstructed call list.

How did you test it?

Manually verified against OllamaChatGenerator + llama3.1:8b on Ollama, with two spike scripts:

Primary spike (directive prompt invoking get_weather): 2 chunks fired (1 carrying the tool-call delta, 1 closing). replies[0].tool_calls = [ToolCall(tool_name='get_weather', arguments={'city': 'Berlin'}, ...)]. replies[0].text is None. meta.finish_reason: stop.
Backfill spike (mutual-exclusivity check): six prompts spanning directive / ambiguous / arithmetic / unrelated-topic / literary. Across all six, never observed text content and tool-call deltas in the same chunk, nor text and tool_calls together in the final ChatMessage. The arithmetic prompt ("What is 2+2?") produced 99 chunks of pure text (98 text-chunks, 0 tool-chunks), confirming text streaming works as expected when the model elects not to use a tool.

The change is documentation-only and does not introduce code changes; no unit-test additions apply.

Notes for the reviewer

The PR follows up on Docs: surface tool-calling on the Ollama integration landing page haystack-core-integrations#3263 and the companion PR docs(ollama): add tool-calling example to integration landing page haystack-integrations#473 (which adds a separate, simpler tool-calling example to the integration landing page). I authored both.
Conventional commit title: docs(ollama): ....
Release note: releasenotes/notes/streaming-with-tools-ollamachatgenerator-docs-8e339d62f38ebd06.yaml (single-entry enhancements, RST inline code, matches the shape of prior docs-only release notes).

Checklist

I have read the contributors guidelines and the code of conduct.
I have updated the related issue with new insights and changes (Docs: surface tool-calling on the Ollama integration landing page haystack-core-integrations#3263 thread updated when each PR opened).
I have added unit tests and updated the docstrings. — N/A, docs-only PR.
I've used one of the conventional commit types for my PR title: docs:.
I have documented my code.
I have added a release note file, following the contributors guidelines.
I have run pre-commit hooks and fixed any issue. — Not run; this PR touches only docs-website/docs/.../*.mdx and releasenotes/notes/*.yaml. Happy to run pre-commit if a maintainer flags anything.

… reference Closes deepset-ai/haystack-core-integrations#3263 (follow-up). The component reference page already covers Tool Support and Streaming in separate sections, but no example shows them combined. Adds a Streaming with Tools section between the two, with an executable example verified empirically against llama3.1:8b on Ollama. Notable behavior captured in the doc: when the model invokes a tool, streamed chunks carry tool_calls deltas and chunk.content is empty; the final ChatMessage has text=None and tool_calls populated.

vercel · 2026-05-06T16:15:54Z

@albertodiazdurana is attempting to deploy a commit to the deepset Team on Vercel.

A member of the Team first needs to authorize it.

CLAassistant · 2026-05-06T16:16:01Z

All committers have signed the CLA.

Per CONTRIBUTING.md, every PR requires a release note under releasenotes/notes/. Categorized as `enhancements` to match the shape of prior docs-only release notes (e.g., docs-cleaner-markdown-ocr-examples-...yaml).

vercel · 2026-05-07T08:40:15Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
haystack-docs	Ready	Preview, Comment	May 7, 2026 8:42am

anakin87

Thank you!

I left some comments.

In addition, please also copy this change to 2.28 versioned docs (latest stable) in docs-website/versioned_docs/version-2.28/pipeline-components/generators/ollamachatgenerator.mdx.

anakin87 · 2026-05-07T08:52:20Z

@@ -0,0 +1,4 @@
+---


Since this change only affects docs, we don't need a release note. Please remove this file.

anakin87 · 2026-05-07T08:53:40Z

+:::tip[What to expect when tools fire]
+When the model emits a tool call rather than free-form text, streamed chunks carry `tool_calls` deltas and `chunk.content` is empty. The final `replies[0].text` will be `None`, and `replies[0].tool_calls` holds the reconstructed call list. Plain text streaming and tool calling are mutually exclusive within a single generation step.
+:::


Suggested change

:::tip[What to expect when tools fire]

When the model emits a tool call rather than free-form text, streamed chunks carry `tool_calls` deltas and `chunk.content` is empty. The final `replies[0].text` will be `None`, and `replies[0].tool_calls` holds the reconstructed call list. Plain text streaming and tool calling are mutually exclusive within a single generation step.

:::

this is already kinda clear, so I'd remove this tip section

- Remove releasenotes/notes/streaming-with-tools-ollamachatgenerator-docs-8e339d62f38ebd06.yaml: docs-only change does not need a release note. - Remove the :::tip[What to expect when tools fire] admonition from docs-website/docs/pipeline-components/generators/ollamachatgenerator.mdx: the inline comments in the streaming-with-tools example already convey the same information. - Add the Streaming with Tools section to docs-website/versioned_docs/version-2.28/pipeline-components/generators/ollamachatgenerator.mdx (latest stable), byte-identical to the v3 docs section.

albertodiazdurana · 2026-05-07T12:51:09Z

Thanks for the review @anakin87! Addressed all three comments:

Removed releasenotes/notes/streaming-with-tools-ollamachatgenerator-docs-8e339d62f38ebd06.yaml (docs-only change, no release note needed).
Removed the :::tip[What to expect when tools fire] admonition at the bottom of the streaming-with-tools section, the inline comments in the example already convey the same information.
Copied the new ### Streaming with Tools section into docs-website/versioned_docs/version-2.28/pipeline-components/generators/ollamachatgenerator.mdx (byte-identical to the v3 docs section).

Ready for another look when you have a moment.

albertodiazdurana requested a review from a team as a code owner May 6, 2026 16:15

albertodiazdurana requested review from anakin87 and removed request for a team May 6, 2026 16:15

albertodiazdurana mentioned this pull request May 6, 2026

Docs: surface tool-calling on the Ollama integration landing page deepset-ai/haystack-core-integrations#3263

Open

docs(ollama): add release note for streaming-with-tools doc

e22be72

Per CONTRIBUTING.md, every PR requires a release note under releasenotes/notes/. Categorized as `enhancements` to match the shape of prior docs-only release notes (e.g., docs-cleaner-markdown-ocr-examples-...yaml).

albertodiazdurana mentioned this pull request May 6, 2026

Session 13: OSS contribution chain + DoR audit + retro fixes albertodiazdurana/heating-systems-conversational-ai#14

Merged

anakin87 requested changes May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(ollama): add streaming-with-tools example to OllamaChatGenerator reference#11268

docs(ollama): add streaming-with-tools example to OllamaChatGenerator reference#11268
albertodiazdurana wants to merge 3 commits intodeepset-ai:mainfrom
albertodiazdurana:docs/ollamachatgenerator-streaming-with-tools

albertodiazdurana commented May 6, 2026 •

edited

Loading

Uh oh!

vercel Bot commented May 6, 2026

Uh oh!

CLAassistant commented May 6, 2026 •

edited

Loading

Uh oh!

vercel Bot commented May 7, 2026 •

edited

Loading

Uh oh!

anakin87 left a comment

Uh oh!

anakin87 May 7, 2026

Uh oh!

anakin87 May 7, 2026

Uh oh!

albertodiazdurana commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	:::tip[What to expect when tools fire]
	When the model emits a tool call rather than free-form text, streamed chunks carry `tool_calls` deltas and `chunk.content` is empty. The final `replies[0].text` will be `None`, and `replies[0].tool_calls` holds the reconstructed call list. Plain text streaming and tool calling are mutually exclusive within a single generation step.
	:::

Conversation

albertodiazdurana commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

Uh oh!

vercel Bot commented May 6, 2026

Uh oh!

CLAassistant commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel Bot commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anakin87 left a comment

Choose a reason for hiding this comment

Uh oh!

anakin87 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

anakin87 May 7, 2026

Choose a reason for hiding this comment

Uh oh!

albertodiazdurana commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertodiazdurana commented May 6, 2026 •

edited

Loading

CLAassistant commented May 6, 2026 •

edited

Loading

vercel Bot commented May 7, 2026 •

edited

Loading