This page serves as a guide for using the PDFCrowd API to extract text from PDF in Python applications.
Below are Python examples to help you quickly get started with the API. Explore our additional examples for more insights.
To access the API, you will need to use your PDFCrowd username and API key. For initial testing, you may use the following demo credentials without registering:
demo
ce544b6ea52a5621fb9d55f8b542d14d
To obtain your personal API credentials, start a free trial or purchase the API license.
It is recommended that you implement error handling to catch errors the API may return. Effective error handling is vital as it ensures application stability and provides clearer diagnostics. See the example code below for guidance on implementing error handling, and refer to this list of status codes for more information.
try: # call the API except pdfcrowd.Error as why: # print the error sys.stderr.write('Pdfcrowd Error: {}\n'.format(why)) # print the error code sys.stderr.write('Pdfcrowd Error Code: {}\n'.format(why.getCode())) # print the error message sys.stderr.write('Pdfcrowd Error Message: {}\n'.format(why.getMessage()))
Refer to the PDF to Text Python Reference for a description of all API methods.