fix(pptx)!: assign pptx notes to ContentLayer.NOTES#3341
Merged
ceberam merged 1 commit intodocling-project:mainfrom Apr 21, 2026
Merged
fix(pptx)!: assign pptx notes to ContentLayer.NOTES#3341ceberam merged 1 commit intodocling-project:mainfrom
ceberam merged 1 commit intodocling-project:mainfrom
Conversation
Signed-off-by: Matvei Smirnov <vdalekesmirnov@gmail.com>
Contributor
|
✅ DCO Check Passed Thanks @Vdaleke, all your commits are properly signed off. 🎉 |
Contributor
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
🟢 Require two reviewer for test updatesWonderful, this rule succeeded.When test data is updated, we require two reviewers
|
|
Documentation Updates 1 document(s) were updated by changes in this PR: What are the detailed pipeline options and processing behaviors for PDF, DOCX, PPTX, and XLSX files in the Python SDK?View Changes@@ -121,7 +121,7 @@
- `PaginatedPipelineOptions` (image scaling, page image generation)
- **Processing**:
- Each slide is treated as a page
- - Extracts text (paragraphs, lists, indentation, master styles), images (using PIL), tables (cell/span/header), slide notes (furniture)
+ - Extracts text (paragraphs, lists, indentation, master styles), images (using PIL), tables (cell/span/header), slide notes (notes)
- Tables and images include provenance (location info)
- **Notes**: Image resolution adjustment is not supported (depends on backend quality). [Pipeline code reference](https://github.com/docling-project/docling/blob/ae4fdbbb09fd377bb271e9b2efe541873eeb2990/docling/document_converter.py#L100-L506).
|
PeterStaar-IBM
approved these changes
Apr 21, 2026
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Assign pptx notes to ContentLayer.NOTES instead of ContentLayer.FURNITURE.
Resolves #3340
Checklist: