Split in Pages Task for Document Operator
15 days ago by Xiaofei Du

We’ve added a new task to the Document Operator — you can now split a document into individual pages, with support for custom batch sizes, enabling more controlled workflows.
Use cases:
- Pre-process long PDFs before parsing
- Apply AI operations in page batches
- Improve downstream precision
📄 Example pipeline recipe:
version: v1beta
variable:
document:
type: document
description: Input document
component:
document-op:
type: document
task: TASK_SPLIT_IN_PAGES
input:
document: ${variable.document}
batch-size: 8
output:
page-batches:
title: Document page batches
value: ${document-op.output.batches}
This splits the input document into batches of 8 pages.