PDF Markup
Particularly for print media, markup based on print-ready PDF pages offers advantages:
- Extraction of images, graphical objects and embedded documents
- Extraction of associated metadata on the object level
- Automated duplicate detection
- Geometrical determination of size
- Requires PDF version 1.4 or higher
For further processing, the PDF-Markup result of a print page may be provided as CSV list or in XML format