Splitting a PDF page in two

Question

I have a PDF file that was the result of the scan of a book.

In this file 2 pages of the book correspond to 1 in the PDF. So when I see a page in the PDF file I'm actually seeing 2 pages of the book.

enter image description here

(original)

I would like to know if there's any way to convert this file to another PDF where 1 page of the book corresponds to 1 page of the PDF i.e. the normal situation.

Peque · Answer 1 · 2019-07-01T08:02:12.180

73

You can use mutool, a MuPDF command-line tool (sudo apt-get install mupdf-tools):

mutool poster -x 2 input.pdf output.pdf

You can also use -y if you want to perform a vertical split.

edited Jul 01 '19 at 08:02

answered Nov 15 '15 at 16:18

Peque

1,258

neydroydrec · Accepted Answer · 2011-08-12T17:46:18.453

Try Gscan2pdf, which you can download from the Software Centre or which you can install from command line sudo apt-get install gscan2pdf.

Open Gscan2Pdf:

file > import your PDF file;

Now you have a single page (see the left column):
then tools > Clean up;
select double as layout and #output pages as 2, then click OK;
Gscan2pdf splits your document (among other things, it will also clean it up and deskew it etc.) Now you have two pages:
Save your PDF file if you're satisfied with the result.

score 17 · Answer 3 · answered Aug 12 '11 at 17:53

17

I would use Briss. It lets you select various regions of each page, each of which to turn into a new page.

enter image description here

answered Aug 12 '11 at 17:53

frabjous

6,601

Curtis · Answer 4 · 2014-02-19T16:07:04.613

Another option is ScanTailor. This program is particularly well suited to processing several scans at a time.

apt-get install scantailor

It unfortunately only works on image file inputs, but it's simple enough to convert a scanned PDF to a jpg. Here's a one-liner that I used for converting a whole directory of PDFs into jpgs. If a PDF has n pages, it makes n jpg files.

for f in ./*.pdf; do gs -q -dSAFER -dBATCH -dNOPAUSE -r300 -dGraphicsAlphaBits=4 -dTextAlphaBits=4 -sDEVICE=png16m "-sOutputFile=$f%02d.png" "$f" -c quit; done;

I had screenshots ready to share, but I don't have enough rep to post them.

ScanTailor outputs to tif, so if you want the files back in PDF you can use this to make a PDF for each page.

for f in ./*.tif; do tiff2pdf "$f" -o "$f".pdf -p letter -F; done;

Then you can use this one-liner, or an application like PDFShuffler to merge any or all files into one PDF.

gs -q -sPAPERSIZE=letter -dNOPAUSE -dBATCH -sDEVICE=pdfwrite -sOutputFile=output.pdf *.pdf

tanius · Answer 5 · 2022-12-14T15:32:51.493

A command line solution using ImageMagick:

Split the PDF into individual images, here at 300 dpi resolution:
```
 convert -density 300 orig.pdf page.png
```

Split each page image into a left and right image:

 for file in page-*.png;
   do convert "$file" -crop 50%x100% "$file-split.png";
 done

Rename the page-###-split-#.png files to just 001.png, 002.png etc.:

 ls page-*-split-*.png | cat -n | 
   while read n f; do mv "$f" $(printf "%03d.png" $n); done

Combine the resulting page images into a PDF again:
```
 convert [0-9][0-9][0-9].png result.pdf
```

Sources, variations and further tips:

Crop and split book scan in 3 commands, here modified to use a for loop command to prevent memory issues.
Answer: Renaming files in a folder to sequential numbers, together with this comment
Answer: ImageMagick: convert quits after some pages, in case you are running into ImageMagick memory limits (which I did).

score 1 · Answer 6 · edited Aug 15 '24 at 01:45

Here is a python script for this:

# Source http://stackoverflow.com/a/15741856/1301753
import copy
import sys
import math
import pyPdf
def split_pages(src, dst):
    src_f = file(src, 'r+b')
    dst_f = file(dst, 'w+b')
input = pyPdf.PdfFileReader(src_f)
output = pyPdf.PdfFileWriter()

for i in range(input.getNumPages()):
    p = input.getPage(i)
    q = copy.copy(p)
    q.mediaBox = copy.copy(p.mediaBox)

    x1, x2 = p.mediaBox.lowerLeft
    x3, x4 = p.mediaBox.upperRight

    x1, x2 = math.floor(x1), math.floor(x2)
    x3, x4 = math.floor(x3), math.floor(x4)
    x5, x6 = math.floor(x3/2), math.floor(x4/2)

    if x3 &gt; x4:
        # horizontal
        p.mediaBox.upperRight = (x5, x4)
        p.mediaBox.lowerLeft = (x1, x2)

        q.mediaBox.upperRight = (x3, x4)
        q.mediaBox.lowerLeft = (x5, x2)
    else:
        # vertical
        p.mediaBox.upperRight = (x3, x4)
        p.mediaBox.lowerLeft = (x1, x6)

        q.mediaBox.upperRight = (x3, x6)
        q.mediaBox.lowerLeft = (x1, x2)

    output.addPage(p)
    output.addPage(q)

output.write(dst_f)
src_f.close()
dst_f.close()


input_file=raw_input("Enter the original PDF file name :")
output_file=raw_input("Enter the splitted PDF file name :")
split_pages(input_file,output_file)

I hold a copy of this on my personal github site...

score 0 · Answer 7 · answered Aug 06 '16 at 08:45

0

Sejda can do that either using its web interface or command line interface (open source). The task is called splitdownthemiddle

answered Aug 06 '16 at 08:45

Andrea Vacondio

133

score -1 · Answer 8 · answered Jun 11 '17 at 08:26

-1

You could use okular or any pdf reader and then use print to file and select options and copies-> pages . Select your interested pages and then give print. It will cut the selected pages . Simple and easy !!

answered Jun 11 '17 at 08:26

Knight71

99

score -2 · Answer 9 · edited Nov 03 '12 at 12:36

-2

There is a wonderful program scankromsator. It is free and works quite well through wine. More information here.

edited Nov 03 '12 at 12:36

Evandro Silva

9,922

answered Mar 16 '12 at 17:40

oromay

1

Splitting a PDF page in two

9 Answers9

Linked