PDF to HTML / Ruby Examples

This page contains various examples of using the PDF to HTML API in Ruby. The examples are complete and fully functional. Read more about how to convert PDF to HTML in Ruby.

Basic examples
Rails examples

Basic examples

PDF file to HTML file

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Run the conversion and save the result to a file.
    client.convertFileToFile("/path/to/logo.pdf", "logo.html")

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

PDF file to in-memory HTML

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Run the conversion and store the result in the `html` variable.
    html = client.convertFile("/path/to/logo.pdf")

    # at this point the "html" variable contains HTML raw data and
    # can be sent in an HTTP response, saved to a file, etc.

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

PDF file to HTML stream

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Create an output stream for the conversion result
    output_stream = open("logo.html", "wb")

    # run the conversion and write the result to the output stream.
    client.convertFileToStream("/path/to/logo.pdf", output_stream)

    # Close the output stream.
    output_stream.close()

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

PDF url to HTML file

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Run the conversion and save the result to a file.
    client.convertUrlToFile("https://pdfcrowd.com/static/pdf/apisamples/invoice.pdf", "invoice.html")

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

PDF url to in-memory HTML

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Run the conversion and store the result in the `html` variable.
    html = client.convertUrl("https://pdfcrowd.com/static/pdf/apisamples/invoice.pdf")

    # at this point the "html" variable contains HTML raw data and
    # can be sent in an HTTP response, saved to a file, etc.

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

PDF url to HTML stream

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Create an output stream for the conversion result
    output_stream = open("invoice.html", "wb")

    # run the conversion and write the result to the output stream.
    client.convertUrlToStream("https://pdfcrowd.com/static/pdf/apisamples/invoice.pdf", output_stream)

    # Close the output stream.
    output_stream.close()

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

In-memory PDF to HTML file

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Run the conversion and save the result to a file.
    client.convertRawDataToFile(open('/path/to/hello_world.pdf', 'rb').read(), "logo.html")

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

In-memory PDF to in-memory HTML

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Run the conversion and store the result in the `html` variable.
    html = client.convertRawData(open('/path/to/hello_world.pdf', 'rb').read())

    # at this point the "html" variable contains HTML raw data and
    # can be sent in an HTTP response, saved to a file, etc.

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

In-memory PDF to HTML stream

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Create an output stream for the conversion result
    output_stream = open("logo.html", "wb")

    # run the conversion and write the result to the output stream.
    client.convertRawDataToStream(open('/path/to/hello_world.pdf', 'rb').read(), output_stream)

    # Close the output stream.
    output_stream.close()

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

Get info about the current conversion

require "pdfcrowd"

begin
    # Create an API client instance.
    client = Pdfcrowd::PdfToHtmlClient.new("demo", "ce544b6ea52a5621fb9d55f8b542d14d")

    # Configure the conversion.
    client.setDebugLog(true)

    # Run the conversion and save the result to a file.
    client.convertFileToFile("/path/to/logo.pdf", "logo.html")
    
    # print URL pointing to the debug log for this request.
    puts "Debug log url: #{client.getDebugLogUrl()}"
    
    # print Number of conversion credits remaining in your account.
    puts "Remaining credit count: #{client.getRemainingCreditCount()}"
    
    # print Number of credits consumed for this conversion.
    puts "Consumed credit count: #{client.getConsumedCreditCount()}"
    
    # print Unique identifier assigned to this conversion job.
    puts "Job id: #{client.getJobId()}"
    
    # print Total number of pages in the output document.
    puts "Page count: #{client.getPageCount()}"
    
    # print Size of the output data in bytes.
    puts "Output size: #{client.getOutputSize()}"

rescue Pdfcrowd::Error => why
    STDERR.puts "PDFCrowd Error: #{why}"
    raise
end

Rails examples