Python 3.x PyPDF2不打印文本的任何输出

Python 3.x PyPDF2不打印文本的任何输出,python-3.x,pypdf2,Python 3.x,Pypdf2,我正在尝试使用PyPDF2从pdf打印文本。这是我的密码: import PyPDF2 pdf_file = open('report.pdf', 'rb') read_pdf = PyPDF2.PdfFileReader(pdf_file) number_of_pages = read_pdf.getNumPages() page = read_pdf.getPage(1) page_content = page.extractText() print (page_content.encode

我正在尝试使用PyPDF2从pdf打印文本。这是我的密码:

import PyPDF2
pdf_file = open('report.pdf', 'rb')
read_pdf = PyPDF2.PdfFileReader(pdf_file)
number_of_pages = read_pdf.getNumPages()
page = read_pdf.getPage(1)
page_content = page.extractText()
print (page_content.encode('utf-8'))
结果,我得到了一些警告空行

PdfReadWarning: Xref table not zero-indexed. ID numbers for objects will be corrected. [pdf.py:1736]
b''

我已经检查过,这个警告本身不会影响结果,但在我的情况下,我什么也没有得到。任何建议。谢谢

尝试如下更改代码:

import PyPDF2
pdf_file = open('report.pdf', 'rb')
read_pdf = PyPDF2.PdfFileReader(pdf_file)
number_of_pages = read_pdf.getNumPages()
page_content = read_pdf.getPage(1).extractText()
print (page_content.encode('utf-8','strict'))

不。我还是得到了这个
b'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'