PDF Markup

Particularly for print media, markup based on print-ready PDF pages offers advantages:

  • Extraction of images, graphical objects and embedded documents
  • Extraction of associated metadata on the object level
  • Automated duplicate detection
  • Geometrical determination of size
  • Requires PDF version 1.4 or higher

For further processing, the PDF-Markup result of a print page may be provided as CSV list or in XML format