186

I am looking for an open source command line tool to crop PDF file just like we can do in Adobe Acrobat Pro. I have tried PdfTk, ImageMagick, PyPDF, and GhostScript—all with no success so far.

Rakesh
  • 2,154

11 Answers11

221

I would suggest you take a look at PDFcrop.

If you wish to crop a pdf with left, top, right and bottom margins of 5, 10, 20, and 30 pt (points), then run

pdfcrop --margins '5 10 20 30' input.pdf output.pdf

in terminal. To actually crop something away, use negative values in the argument for crop. For example,

pdfcrop --margins '-50 -50 -50 -50' input.pdf output.pdf

crops 50 pts from the left, top, right, bottom (in this order).

If you run only the command pdfcrop input, it will output a file titled input-crop.pdf with zero margins. I find this very handy when including pdf illustrations in documents.

Cropping multiple files

Unfortunately, pdfcrop cannot crop multiple files at the time. It is however easy to write a script that will crop all pdfs in the folder the script is located in.

Create a new empty file, and call it something.sh. Open it with a text editor and insert the following:

#!/bin/bash
for FILE in ./*.pdf; do
  pdfcrop "${FILE}"
done

Save it, and close. Then right click the file, go to Properties > Permissions and check the field Allow executing file as program (or run chmod +x /path/to/something.sh in terminal). Now close the dialog. Run the script by double clicking it and choosing Run in Terminal. And new, zero-margin cropped version of all pdfs with suffix -crop will now be printed in the folder. If you want margins or other things, you can of course just open the script and add arguments after pdfcrop.

Rasmus
  • 8,655
68

Thanks for Rasmus, you can install pdfcrop from texlive-extra-utils package:

sudo apt-get install texlive-extra-utils

Then crop pdf files using pdf crop command as:

pdfcrop input.pdf output.pdf

use --help to see more amazing parameters like --margins

pdfcrop --margins 5 input.pdf output.pdf

which crop pdf with 5 bp from each side of page

27

When I can't do something with pdftk, the next place I turn is PDFjam, which is a command-line wrapper for the pdfpages LaTeX package (hence you also need that and a TeX distro installed). For help on how to use it, I recommend the regular help screen:

pdfjam --help

as the man page is sparse and the Web page concentrates on examples.

To crop a PDF, the command you need is something like this:

pdfjam --keepinfo --trim "10mm 15mm 10mm 15mm" --clip true --suffix "cropped" input.pdf

This will output a file called input-cropped.pdf. The order of the trims should be left, bottom, right, top, as per \includegraphics from graphicx.

To give an idea of how it compares with PDFcrop, I had cause to crop a quite fancy PDF recently. My original was 675 kB, my cropped version via PDFjam was 1.2 MB, while a version cropped via PDFcrop was 4.5 MB. While both PDFjam and PDFcrop stripped out the embedded hyperlinks and bookmarks, PDFjam with the --keepinfo option preserved the document properties (e.g. title, author, subject).

Alex Ball
  • 444
26

You can also crop PDF files simply using Ghostscript. I have written a small script to simplify the process (inspired by this answer):

#!/bin/bash

if [ $# -lt 5 ]
then
  echo "Usage: `basename $0` <pdf-file> <x_min> <x_max> <y_min> <y_max>"
  echo "Notes:"
  echo " - all coordinates are absolute; no calculation of width/height necessary"
  echo " - use 'gv' to determine the coordinates"
  exit 65
fi

file="$1"
xmin="$2"
xmax="$3"
ymin="$4"
ymax="$5"

base="${file%.*}"
outfile="${base}_cropped.pdf"

echo "writing to: $outfile"

gs \
  -o $outfile \
  -sDEVICE=pdfwrite \
  -c "[/CropBox [$xmin $ymin $xmax $ymax] /PAGES pdfmark" \
  -f $file

In order to determine the coordinates for cropping, I use gv, which prints the coordinates of the mouse cursor using the same units as Ghostscript. For example, here I determine the minimum coordinates for x/y (the values in the upper left corner):

crop1

Now the maximum coordinates:

crop2

And finally, I run the script pdf_crop_by_coordinates.sh test.pdf 45 429 38 419 producing a test_cropped.pdf which looks like that:

result

I have no idea though, how the Ghostscript solution compares to pdfcrop in terms of quality and correctness.

bluenote10
  • 1,933
12

Briss is not command line, but worth a look at.

weberjn
  • 271
10

If a graphical tool is also fine I would recommend krop: http://arminstraub.com/software/krop

Sunday
  • 803
9

The pdfCropMargins program is a command-line application to automatically crop the margins of PDF files.

This program depends on either the Ghostscript program or the pdftoppm program being installed (and locatable) on the system. And analyze the page images with PIL to find bounding boxes, using the threshold 191.

install using

pip install pdfCropMargins

Run using

pdf-crop-margins -v -s -u your-file.pdf

For help

pdf-crop-margins -h | more

4

This may help you.
This is in accordance with the newer version of Ubuntu and life. This is Master PDF Editor. You can use it crop, add some stuff, etc.

Example:
This is before This is before This is after ctrl + k enter image description here

Shamina
  • 258
  • 2
  • 8
2

You could use a pypdf script from this page. But in the answer to this stackexchange question, there seem to be many options as well.

xubuntix
  • 5,580
1

PDF CPU is a good option:

pdfcpu crop '0 0 17' in.pdf out.pdf

https://pdfcpu.io/core/crop

Zombo
  • 1
0

Using pdfjam you can set any option which comes from the package pdfpages originally, and so you can set --fitpaper true, it should adapt the the paper size to fit to the (first) input file.

I had a case to combine multiple JPG into a PDF and adding metadata as well and trim it to incoming jpg size. This will work for equal sizes of input JPGs to be cropped well and have no white margins:

# equal sizes of input images
pdfjam  Image_00001.jpg Image_00002.jpg Image_00003.jpg Image_00004.jpg Image_00005.jpg \
  --pdftitle   'My custom title' \
  --pdfsubject 'My custom subject' \
  --pdfkeywords 'keyword1; keyword2; keyword3; aso;' \
  --pdfauthor 'author1; author2;' \
  --fitpaper true --outfile 'Images_Combined.pdf'

pdfinfo 'Images_Combined.pdf' # testing results for meta data

Title: My custom title

Subject: My custom subject

Keywords: keyword1; keyword2; keyword3; aso;

Author: author1; author2;

Creator: LaTeX with hyperref

Producer: pdfTeX-1.40.22

CreationDate: Sun May 1 15:03:01 2022 CEST

ModDate: Sun May 1 15:03:01 2022 CEST

…

Note: If you have different sizes of input JPG, it will fit the paper size to the first one and scale all remaining pages to be merged to the size of that first one, so some pages will have no white margins and some will have.

Edit: this was done from TeX Live 2021 with pdfjam --version 3.03 and package pdfpages from 2021/03/06