[chore] Add diffusers-format example to LongCatAudioDiTPipeline by RuixiangMa · Pull Request #13483 · huggingface/diffusers

RuixiangMa · 2026-04-15T16:20:33Z

What does this PR do?

add diffusers-format example(repo_id: ruixiangma/LongCat-AudioDiT-1B-Diffusers)
support seed parameter

import soundfile as sf
import torch
from diffusers import LongCatAudioDiTPipeline

pipeline = LongCatAudioDiTPipeline.from_pretrained(
    "ruixiangma/LongCat-AudioDiT-1B-Diffusers",
    torch_dtype=torch.bfloat16,
)
pipeline = pipeline.to("cuda")

prompt = "A calm ocean wave ambience with soft wind in the background."
audio = pipeline(
    prompt,
    audio_duration_s=5.0,
    num_inference_steps=20,
    guidance_scale=4.0,
    generator=torch.Generator("cuda").manual_seed(42),
).audios[0, 0]

sf.write("output.wav", audio, pipeline.sample_rate)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…ioDiTPipeline Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa · 2026-04-15T16:43:02Z

@dg845 I uploaded a Diffusers-format repository, updated usage docs.

HuggingFaceDocBuilderDev · 2026-04-16T00:38:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dg845 · 2026-04-16T00:39:59Z

- `output_type="pt"` returns a PyTorch tensor shaped `(batch, channels, samples)`.
+- `audio_duration_s` is the most direct way to control output duration.
+- `seed` makes generation reproducible (optional, defaults to None).
+- Output shape is `(batch, channels, samples)` - use `.audios[0, 0]` to get a single audio sample.


nit: I think it might be more clear here if we clarify how the pipeline handles mono and stereo outputs.

Added mono/stereo clarification in Tips

dg845 · 2026-04-16T00:40:28Z

@bot /style

github-actions · 2026-04-16T00:40:58Z

Style bot fixed some files and pushed the changes.

dg845

Thanks for the follow-up PR! Left a few small comments/suggestions :).

Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa · 2026-04-16T02:31:34Z

Thanks for the follow-up PR! Left a few small comments/suggestions :).

Fixed, PTAL

dg845 · 2026-04-16T04:22:52Z

@bot /style

github-actions · 2026-04-16T04:23:23Z

Style bot fixed some files and pushed the changes.

dg845

Thanks! (BTW, you can fix the code style with make style and make quality.)

dg845 · 2026-04-16T04:52:07Z

Merging as the CI failures are unrelated.

github-actions Bot added documentation Improvements or additions to documentation pipelines size/S PR with diff < 50 LOC labels Apr 15, 2026

[chore] Add diffusers-format example and seed parameter to LongCatAud…

974c829

…ioDiTPipeline Signed-off-by: Lancer <maruixiang6688@gmail.com>

RuixiangMa force-pushed the longcatdiffusersmodel branch from f25c3a7 to 974c829 Compare April 15, 2026 16:36

github-actions Bot added size/M PR with diff < 200 LOC and removed size/S PR with diff < 50 LOC labels Apr 15, 2026

dg845 reviewed Apr 16, 2026

View reviewed changes

Comment thread docs/source/en/api/pipelines/longcat_audio_dit.md Outdated

dg845 reviewed Apr 16, 2026

View reviewed changes

Comment thread src/diffusers/pipelines/longcat_audio_dit/pipeline_longcat_audio_dit.py Outdated

dg845 reviewed Apr 16, 2026

View reviewed changes

Apply style fixes

ac4ec51

github-actions Bot added size/M PR with diff < 200 LOC and removed size/M PR with diff < 200 LOC labels Apr 16, 2026

dg845 approved these changes Apr 16, 2026

View reviewed changes

Apply suggestions from code review

533d6b6

Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

github-actions Bot added size/M PR with diff < 200 LOC and removed size/M PR with diff < 200 LOC labels Apr 16, 2026

upd

e2ac8bd

Signed-off-by: Lancer <maruixiang6688@gmail.com>

github-actions Bot added size/S PR with diff < 50 LOC and removed size/M PR with diff < 200 LOC labels Apr 16, 2026

RuixiangMa changed the title ~~[chore] Add diffusers-format example and seed parameter to LongCatAudioDiTPipeline~~ [chore] Add diffusers-format example to LongCatAudioDiTPipeline Apr 16, 2026

Apply style fixes

e73a6a6

github-actions Bot removed the size/S PR with diff < 50 LOC label Apr 16, 2026

github-actions Bot added the size/M PR with diff < 200 LOC label Apr 16, 2026

dg845 approved these changes Apr 16, 2026

View reviewed changes

dg845 merged commit 947bc23 into huggingface:main Apr 16, 2026
13 of 14 checks passed

Conversation

RuixiangMa commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

RuixiangMa commented Apr 15, 2026

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 16, 2026

Uh oh!

dg845 Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

RuixiangMa Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

dg845 commented Apr 16, 2026

Uh oh!

github-actions Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dg845 left a comment

Choose a reason for hiding this comment

Uh oh!

RuixiangMa commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dg845 commented Apr 16, 2026

Uh oh!

github-actions Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dg845 left a comment

Choose a reason for hiding this comment

Uh oh!

dg845 commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RuixiangMa commented Apr 15, 2026 •

edited

Loading

github-actions Bot commented Apr 16, 2026 •

edited

Loading

RuixiangMa commented Apr 16, 2026 •

edited

Loading

github-actions Bot commented Apr 16, 2026 •

edited

Loading