In previous article, we can extract text on a PDF file using PyPDF2.
I will introduce PyPDF3 in this article.
PyPDF2 and PyPDF3 exist
Which should I use PyPDF2 or PyPDF3 ??
Check the PyPI
Does PyPDF3 exist on PyPI? Check with
This is PyPDF2.
1pip search PyPDF2 2> PyPDF2 (1.26.0) - PDF toolkit
This is PyPDF3.
1pip search PyPDF3 2> PyPDF3 (1.0.1) - Pure Python PDF toolkit
Both are really present!!
What is PyPDF3 ?
Initial goals are to fully implement existing features and fix some of the most critical bugs/performance issues from PyPDF2 before moving on to new functionality.
However, development is not active as far as seeing the commit log.
All of the story is discussed in a certain github issue
As a further investigation, I got to one github issue.
- PyPDF2 core maintainer had not updated it because of busy
- However he has decided to restart to update PyPDF2
- Developers also discuss PyPDF3 in that issues
We can use PyPDF2 without problems.