Skip to content

Analyse PDF

PDF metadata

Basic info:

pdfinfo my.pdf

Checks PDF for validation errors:

qpdf --check my.pdf

PFD diff tools

Diffoscope

Install:

pamac install python-pypdf
sudo pacman -S diffoscope tinyxxd python-pdfminer

Usage:

diffoscope resume.pdf /tmp/resume.pdf

Issues:

  • On Arch linux, the python-pdfminer package installs /usr/bin/dumppdf.py, which cannot be found by diffoscope, because it expects dumppdf to be in the PATH. It results in FileNotFoundError: [Errno 2] No such file or directory: 'dumppdf'
    • Solution: ln -s /usr/bin/dumppdf.py ~/bin/download/dumppdf

diff-pfd

diff-pdf

Can't get a useful diff at all :/

pdf-diff

  • GitHub
  • Python
  • AUR package: pdf-diff-git
    • Fails to install (ModuleNotFoundError: No module named 'setuptools')
  • pipx install pdf-diff fails to install with: interface.cpp:96:20: error: ‘PyUnicode_FromUnicode’ was not declared in this scope; did you mean ‘PyUnicode_FromString’?

Other diff viewers

Outdated: