Split in Pages Task for Document Operator

We’ve added a new task to the Document Operator — you can now split a document into individual pages, with support for custom batch sizes, enabling more controlled workflows.

Use cases:

  • Pre-process long PDFs before parsing
  • Apply AI operations in page batches
  • Improve downstream precision

📄 Example pipeline recipe:

version: v1beta
variable:
  document:
    type: document
    description: Input document
component:
  document-op:
    type: document
    task: TASK_SPLIT_IN_PAGES
    input:
      document: ${variable.document}
      batch-size: 8
output:
  page-batches:
    title: Document page batches
    value: ${document-op.output.batches}

This splits the input document into batches of 8 pages.