Skip to content

fix(pptx)!: assign pptx notes to ContentLayer.NOTES#3341

Merged
ceberam merged 1 commit intodocling-project:mainfrom
Vdaleke:fix/pptx-notes-content-layer
Apr 21, 2026
Merged

fix(pptx)!: assign pptx notes to ContentLayer.NOTES#3341
ceberam merged 1 commit intodocling-project:mainfrom
Vdaleke:fix/pptx-notes-content-layer

Conversation

@Vdaleke
Copy link
Copy Markdown
Contributor

@Vdaleke Vdaleke commented Apr 21, 2026

Assign pptx notes to ContentLayer.NOTES instead of ContentLayer.FURNITURE.

Resolves #3340

Checklist:

  • Documentation has been updated, if necessary.
  • Examples have been added, if necessary.
  • Tests have been added, if necessary.

Signed-off-by: Matvei Smirnov <vdalekesmirnov@gmail.com>
@github-actions
Copy link
Copy Markdown
Contributor

DCO Check Passed

Thanks @Vdaleke, all your commits are properly signed off. 🎉

@mergify
Copy link
Copy Markdown
Contributor

mergify Bot commented Apr 21, 2026

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

🟢 Require two reviewer for test updates

Wonderful, this rule succeeded.

When test data is updated, we require two reviewers

  • #approved-reviews-by >= 2

@dosubot
Copy link
Copy Markdown

dosubot Bot commented Apr 21, 2026

Documentation Updates

1 document(s) were updated by changes in this PR:

What are the detailed pipeline options and processing behaviors for PDF, DOCX, PPTX, and XLSX files in the Python SDK?
View Changes
@@ -121,7 +121,7 @@
     - `PaginatedPipelineOptions` (image scaling, page image generation)
 - **Processing**:
     - Each slide is treated as a page
-    - Extracts text (paragraphs, lists, indentation, master styles), images (using PIL), tables (cell/span/header), slide notes (furniture)
+    - Extracts text (paragraphs, lists, indentation, master styles), images (using PIL), tables (cell/span/header), slide notes (notes)
     - Tables and images include provenance (location info)
 - **Notes**: Image resolution adjustment is not supported (depends on backend quality). [Pipeline code reference](https://github.com/docling-project/docling/blob/ae4fdbbb09fd377bb271e9b2efe541873eeb2990/docling/document_converter.py#L100-L506).
 

How did I do? Any feedback?  Join Discord

@codecov
Copy link
Copy Markdown

codecov Bot commented Apr 21, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@ceberam ceberam added bug Something isn't working pptx issue related to pptx backend labels Apr 21, 2026
Copy link
Copy Markdown
Member

@ceberam ceberam left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀

@ceberam ceberam merged commit 3a3c8f6 into docling-project:main Apr 21, 2026
43 of 45 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working pptx issue related to pptx backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

PPTX notes are in the "FURNITURE" content layer instead of "NOTES".

3 participants