Skip to content

Derivative generation on several PDFs resulted in blank pages #7035

@hackartisan

Description

@hackartisan

Summary or User Story

The 2014, 2015, and 2016-17 child resources of https://figgy.princeton.edu/catalog/59863f4b-2179-4c37-b3dd-23e5b0dba765 all have file sets generated from the PDF that contain blank images.

I haven't found any way of identifying resources with this issue; I just happened to notice this when looking at these resources for another reason (when resolving #7013). There may be other PDF resources with this issue.

If you download the pdf directly, you will see that every page has at least a page number, and for the most part there is page content when the generated file appears blank.

Tech Implementation

The first goal is to find all the PDFs we have ingested that have blank pages. Is that possible?

Try reingesting that file to staging and see if it also generates blank pages.

Acceptance Criteria

  • Spend an hour and see if you can figure out how big a problem this is.
  • Viewer for PDFs doesn't have blank pages when there should be content.

First step

Do the time boxed research

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions