MLX: support tool calling during `streamResponse` (currently `respond()`-only)

### Summary
For `MLXLanguageModel`, tool calling works in `respond()` but not in `streamResponse()`. Streaming hardcodes `tools: nil` and ignores tool-call stream items, so callers must choose between **streamed token UX** and **tool calling** for local MLX models — but not both. Foundation models get both.

### Evidence (v0.8.0, `Sources/AnyLanguageModel/Models/MLXLanguageModel.swift`)
- `streamResponse` builds the input with no tools — [L1095-L1100](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L1095-L1100):
  ```swift
  let userInput = makeUserInput(chat: chat, tools: nil, processing: userInputProcessing, additionalContext: additionalContext)
  ```
- The stream loop discards tool-call items — [L1127-L1128](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L1127-L1128):
  ```swift
  case .info, .toolCall:
      break
  ```
- `respond()` already implements the full tool cycle for reference — passes `toolSpecs` ([L921](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L921)) and loops over collect → resolve → re-generate ([L917-L1002](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L917-L1002)).

### Use case
On-device assistant that streams tokens *and* calls tools (file search, web fetch, image generation). Today, enabling tools forces the non-streaming path, so the whole reply lands at once — a noticeable UX regression for local models.

### Proposed approach (reuses existing helpers)
In `streamResponse`:
1. Pass `mlxToolSpecs(for: session)` (already defined at [L843](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L843)) into `makeUserInput` instead of `nil`.
2. In the stream loop, collect `.toolCall` items instead of `break`-ing; when the model stops with pending calls, resolve them via the existing `resolveToolCalls(_:session:)` ([L1454](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L1454)) + `makeTranscriptToolCalls` ([L1435](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L1435)), append results to the transcript, and continue generating — mirroring the `respond()` while-loop and reusing its repeated-call guard ([L733](https://github.com/huggingface/AnyLanguageModel/blob/0.8.0/Sources/AnyLanguageModel/Models/MLXLanguageModel.swift#L733)). Continue yielding text snapshots between tool rounds.

### Acceptance
`streamResponse` with a non-empty `session.tools` executes tools and streams text, at behavioral parity with `respond()`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MLX: support tool calling during `streamResponse` (currently `respond()`-only) #164

Summary

Evidence (v0.8.0, `Sources/AnyLanguageModel/Models/MLXLanguageModel.swift`)

Use case

Proposed approach (reuses existing helpers)

Acceptance

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

MLX: support tool calling during streamResponse (currently respond()-only) #164

Description

Summary

Evidence (v0.8.0, Sources/AnyLanguageModel/Models/MLXLanguageModel.swift)

Use case

Proposed approach (reuses existing helpers)

Acceptance

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

MLX: support tool calling during `streamResponse` (currently `respond()`-only) #164

Evidence (v0.8.0, `Sources/AnyLanguageModel/Models/MLXLanguageModel.swift`)