This page describes how to use the Pdfcrowd online API to convert PDF to HTML
in Python.
The API is user-friendly and can be integrated into your application with just a few lines of code.
Here are Python examples for quickly getting started with the API.
See more examples.
importpdfcrowdimportsystry:# create the API client instanceclient=pdfcrowd.PdfToHtmlClient('demo','ce544b6ea52a5621fb9d55f8b542d14d')# run the conversion and write the result to a fileclient.convertFileToFile('/path/to/logo.pdf','logo.html')exceptpdfcrowd.Erroraswhy:sys.stderr.write('Pdfcrowd Error: {}\n'.format(why))raise
importpdfcrowdimportsystry:# create the API client instanceclient=pdfcrowd.PdfToHtmlClient('demo','ce544b6ea52a5621fb9d55f8b542d14d')# run the conversion and write the result to a fileclient.convertUrlToFile('https://pdfcrowd.com/static/pdf/apisamples/invoice.pdf','invoice.html')exceptpdfcrowd.Erroraswhy:sys.stderr.write('Pdfcrowd Error: {}\n'.format(why))raise
importpdfcrowdimportsystry:# create the API client instanceclient=pdfcrowd.PdfToHtmlClient('demo','ce544b6ea52a5621fb9d55f8b542d14d')# run the conversion and write the result to a fileclient.convertRawDataToFile(open('/path/to/hello_world.pdf','rb').read(),'logo.html')exceptpdfcrowd.Erroraswhy:sys.stderr.write('Pdfcrowd Error: {}\n'.format(why))raise
Authentication
The credentials to access the API are your Pdfcrowd username and the API key.
You can try out the API without registering using the following demo
credentials:
It is recommended that you implement error handling to catch errors
that the API may return, see the example code below. A list of status
codes and their description can be found
here.
try:# call the API exceptpdfcrowd.Erroraswhy:# print the errorsys.stderr.write('Pdfcrowd Error: {}\n'.format(why))# print the error codesys.stderr.write('Pdfcrowd Error Code: {}\n'.format(why.getCode()))# print the error messagesys.stderr.write('Pdfcrowd Error Message: {}\n'.format(why.getMessage()))