reading PDF file with pypdf no contents are captured. Please help

Discussion in 'Python' started by sujan.dasmahapatra, Jul 31, 2013.

sujan.dasmahapatra Member

Joined:: Jun 11, 2009

Messages:: 39

Likes Received:: 0

Trophy Points:: 6

Gender:: Male

I am trying to read a PDF file using pypdf and write onto a text file. But its not working. content value in the below code is just "u/n/n/n/n/n'...PDF file has 5 pages so 5 times new line character and in the begining 'u'..whats going wrong please help. why the contents are not coming. Any help is highly appreciated. Thanks Sujan
Code:
#!/usr/bin/python
import pyPdf
import sys

def getPDFContent(path):
    content = ""
    p = file(path, "rb")
    pdf = pyPdf.PdfFileReader(p)
    for i in range(0, pdf.getNumPages()):
        content += pdf.getPage(i).extractText() + "\n"
    content = " ".join(content.replace(u"\xa0", " ").strip().split())
    return content

def main():
    f= open('test.txt','w')
    pdfl = getPDFContent("test.pdf").encode("ascii", "ignore")
    f.write(pdfl)
    f.close()

if __name__ == "__main__":
    main()

Last edited: Jul 31, 2013

sujan.dasmahapatra, Jul 31, 2013

SHARE #1

(You must log in or sign up to reply here.)

Share This Page

New Profile Posts

Deleted member 155909 ► shabbir
Hi, can you please delete my profile please?

Nov 7, 2024

•••
Hanginium65 ► shabbir
I was trying to post a question in the Forum but I got an alert instead with the message "Your content can not be submitted. This is likely because your content is spam-like or contains inappropriate elements. Please change your content or try again later. If you still have problems, please contact an administrator." I see that also others have had similar problems in the past. May I please have your assistance?

Jul 25, 2024

•••
emmawilson
hello!

Apr 25, 2024

•••
unni krishnan.r ► shabbir
Hello Boss,

How are you, long time no talk :)
Hope everything is good at your end

Oct 19, 2022

•••
Austin Lucas
Austin, a writer and Trend Observer. I enjoy writing content about Trends In On-Demand Mobile App Developments.

Aug 26, 2022

•••

Log in or Sign up

reading PDF file with pypdf no contents are captured. Please help

sujan.dasmahapatra Member

Share This Page

Useful Searches