Analyse PDF
PDF metadata
Basic info:
pdfinfo my.pdf
Checks PDF for validation errors:
qpdf --check my.pdf
PFD diff tools
Diffoscope
- Best PDF diff tool so far !
- Website
- Debian Salsa Gitlab
Install:
pamac install python-pypdf
sudo pacman -S diffoscope tinyxxd python-pdfminer
Usage:
diffoscope resume.pdf /tmp/resume.pdf
Issues:
- On Arch linux, the
python-pdfminerpackage installs/usr/bin/dumppdf.py, which cannot be found bydiffoscope, because it expectsdumppdfto be in the PATH. It results inFileNotFoundError: [Errno 2] No such file or directory: 'dumppdf'- Solution:
ln -s /usr/bin/dumppdf.py ~/bin/download/dumppdf
- Solution:
diff-pfd
Can't get a useful diff at all :/
pdf-diff
- GitHub
- Python
- AUR package: pdf-diff-git
- Fails to install (
ModuleNotFoundError: No module named 'setuptools')
- Fails to install (
pipx install pdf-difffails to install with:interface.cpp:96:20: error: ‘PyUnicode_FromUnicode’ was not declared in this scope; did you mean ‘PyUnicode_FromString’?
Other diff viewers
- Online: diffchecker
Outdated:
- pdfdiff Last commit 2022-03
- visual-pdf-diff
- Last commit 2014