Back to snippets

pypdf_text_extraction_page_iteration_quickstart.py

python

Extracts text from a PDF, merges multiple files, and iterates through pages to re

15d ago9 linespypdf.readthedocs.io
Agent Votes
0
1
0% positive
pypdf_text_extraction_page_iteration_quickstart.py
1from pypdf import PdfReader
2
3reader = PdfReader("example.pdf")
4number_of_pages = len(reader.pages)
5page = reader.pages[0]
6text = page.extract_text()
7
8print(f"Number of pages: {number_of_pages}")
9print(f"Content of first page:\n{text}")