Loading...
Convert
DEMO
New
Save
Share
Clone
Settings
?
Help
»
Input
+

You can upload an HTML file, or a zip, tar.gz, or tar.bz2 archive. The archive can contain external files like images, stylesheets and scripts.

Set the filename of the main HTML document stored in the input archive. If not specified, the first HTML file in the archive is used for conversion. Use this method if the input archive contains multiple HTML documents.
Options
+
Conversion Format
The output file format.
Possible values: "png", "jpg", "gif", "tiff", "bmp", "ico", "ppm", "pgm", "pbm", "pnm", "psb", "pct", "ras", "tga", "sgi", "sun", "webp"
Image Output
Set the output image width in pixels. The accepted range is 96-65000.
Set the output image height in pixels. If it is not specified, actual document height is used. Must be a positive integer.
Set the scaling factor (zoom) for the output image. Must be a positive integer.
The output image background color in RGB or RGBA hex format. Use transparent (00000000) for PNG overlays or solid colors for web display. It is available for API client version >= 5.0.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details. The value must be in RRGGBB or RRGGBBAA hexadecimal format.
General Options
Use the print version of the page if available via @media print CSS rules. Enable this when converting websites that have print-optimized styles. Many sites hide navigation, ads, and sidebars in print mode. Produces cleaner PDFs by using the design the website creator intended for printing.
Do not print the background graphics to create printer-friendly PDFs. Use this when documents will be physically printed to save ink costs and improve readability. Removes background colors, images, and patterns while preserving text and foreground content. Particularly useful for documents with dark backgrounds or decorative elements.
Do not execute JavaScript during conversion. Use this to improve conversion speed when JavaScript is not needed, prevent dynamic content changes, or avoid security risks from untrusted scripts. Note that disabling JavaScript means lazy-loaded images and AJAX content will not load.
Do not load images during conversion to create text-only PDFs. Use this to significantly speed up conversion, reduce file size, or create accessible text-focused documents. Ideal for converting documentation where images are not needed, reducing bandwidth usage, or creating lightweight PDFs for email distribution.
Disable loading fonts from remote sources. Use this to speed up conversion by avoiding font download delays, ensure consistent rendering with system fonts, or work around font loading failures. Note that text will fall back to system fonts, which may change the document's appearance.
Use a mobile user agent when making requests to the source URL. It is available for API client version >= 5.3.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Specifies how iframes are handled during conversion. Use "all" to include all embedded content (videos, maps, widgets). Use "same-origin" to include only content from the same domain for security purposes. Use "none" to exclude all iframes for faster conversion and to avoid third-party content issues. Disabling iframes can significantly improve performance and reliability. It is available for API client version >= 5.0.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Possible values:
  • "all" - All iframes are loaded.
  • "same-origin" - Only iframes with the same origin as the main page are loaded.
  • "none" - Iframe loading is disabled.
Automatically block common advertising networks and tracking scripts during conversion, producing cleaner PDFs with faster conversion times. Filters out third-party ad content, analytics beacons, and ad network resources. Ideal for converting news sites, blogs, or any ad-heavy content where ads distract from the main message. May occasionally block legitimate third-party content - disable if critical third-party resources are missing.
Specify the character encoding when the HTML lacks proper charset declaration or has incorrect encoding. Prevents garbled text for non-English content, especially legacy pages without UTF-8 encoding. Set to "utf-8" for modern content, "iso-8859-1" for Western European legacy pages, or other encodings for specific regional content. Only needed when auto-detection fails and you see corrupted characters in the output.
Set the locale for the conversion to control regional formatting of dates, times, and numbers. Use this when converting content for specific regions - for example, set to "en-US" for MM/DD/YYYY dates and comma thousand separators, or "de-DE" for DD.MM.YYYY dates and period thousand separators. Essential for financial reports, invoices, or localized content. It is available for API client version >= 5.0.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Set the HTTP authentication user name. Required to access protected web pages or staging environments.
Set the HTTP authentication password. Required to access protected web pages or staging environments.
Set HTTP cookies to be included in all requests made by the converter to access authenticated or session-based content. Use this when converting pages that require login, maintain user sessions, or personalize content based on cookies. Essential for converting member-only areas, dashboards, or any content behind cookie-based authentication. Format as semicolon-separated name=value pairs.
Enforce SSL certificate validation for secure connections, preventing conversions from sites with invalid certificates. Enable when converting from production sites with valid certificates to ensure security. When disabled, allows conversion from any HTTPS site regardless of certificate validity - including development servers with self-signed certificates, internal corporate sites with expired certificates, or local testing environments.
Abort the conversion if the HTTP status code of the main URL is greater than or equal to 400 (client/server errors). Use this in automated workflows to catch broken URLs or authentication failures early rather than producing invalid PDFs. Ensures your system does not silently generate error page PDFs when source content is unavailable.
Abort the conversion if any sub-request (images, stylesheets, scripts) fails with HTTP 400+ errors. Use this for strict quality control when all assets must load successfully.
Do not send the X-Pdfcrowd HTTP header in HTTP requests made by the converter. Use this if your target server blocks or logs requests with this header, or for privacy when you do not want sites to know you are using PDFCrowd. Some security systems may block requests with non-standard headers.
Apply custom CSS to the input HTML document to modify the visual appearance and layout of your content dynamically. Use this to override default styles, adjust spacing, change fonts, or fix layout issues without modifying the source HTML. Use !important in your CSS rules to prioritize and override conflicting styles. It is available for API client version >= 5.14.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Run a custom JavaScript after the document is loaded and ready to print. Use this to modify page content before conversion, remove unwanted elements, or trigger specific page states. The script is intended for post-load DOM manipulation (add/remove elements, update CSS, ...). In addition to the standard browser APIs, the custom JavaScript code can use helper functions from our JavaScript library.
Run a custom JavaScript right after the document is loaded. The script is intended for early DOM manipulation (add/remove elements, update CSS, ...). In addition to the standard browser APIs, the custom JavaScript code can use helper functions from our JavaScript library.
Set a custom HTTP header to be included in all requests made by the converter. Use this to pass authentication tokens to protected sites, add tracking headers for analytics, or provide API keys for accessing private content. Essential when converting content from APIs or internal systems that require special headers for access control. A string containing the header name and value separated by a colon.
Wait the specified number of milliseconds to finish all JavaScript after the document is loaded. Use this to ensure lazy-loaded images, AJAX content, or animations complete before conversion. Your license defines the maximum wait time by "Max Delay" parameter. Must be a positive integer or 0.
Convert only the specified element from the main document and its children. Use this to extract specific portions of a page (like article content) while excluding navigation, headers, footers, or sidebars. The element is specified by one or more CSS selectors. If the element is not found, the conversion fails. If multiple elements are found, the first one is used.
Control how CSS styles are applied when converting only part of a page. The "cut-out" option extracts the element into a new document root, which may break CSS selectors like "body > div". The "remove-siblings" option keeps the element in its original DOM position but deletes other elements, preserving descendant selectors. The "hide-siblings" option keeps all elements but hides non-selected ones with display:none, preserving all CSS context.
Possible values:
  • "cut-out" - The element and its children are cut out of the document.
  • "remove-siblings" - All element's siblings are removed from the DOM. Keeps target element in position but may break descendant CSS selectors.
  • "hide-siblings" - All element's siblings are hidden using display:none. Preserves CSS context while hiding non-target content.
Wait for the specified element in a source document. Use this when specific dynamic content must be ready before conversion, avoiding unnecessary delays from a fixed JavaScript delay. The element is specified by one or more CSS selectors. The element is searched for in the main document and all iframes. If the element is not found, the conversion fails. Your license defines the maximum wait time by the "Max Delay" parameter.
The main HTML element for conversion is detected automatically. Use this when you want to extract article or main content without knowing the exact CSS selector, automatically excluding navigation and sidebars. It is available for API client version >= 5.5.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Automatically enhance the input HTML to improve readability by removing clutter and reformatting content. Use this when converting web pages with excessive navigation, ads, or sidebars that distract from the main content. Different versions (v1-v4) use progressively aggressive algorithms - start with "v1" and increase if more cleanup is needed. Ideal for converting blog posts, articles, or documentation into clean PDFs. It is available for API client version >= 5.5.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Possible values:
  • "none" - No enhancements are used.
  • "readability-v1" - Version 1 of the enhancements is used. Basic cleanup for simple pages with moderate clutter.
  • "readability-v2" - Version 2 of the enhancements is used. More aggressive cleanup for pages with more ads and navigation.
  • "readability-v3" - Version 3 of the enhancements is used. Strong cleanup for heavily cluttered pages with multiple sidebars.
  • "readability-v4" - Version 4 of the enhancements is used. Maximum cleanup for extremely cluttered pages. May remove some content.
Data
Set the input data for template rendering. The data format can be JSON, XML, YAML or CSV.
Load the input data for template rendering from the specified file. The data format can be JSON, XML, YAML or CSV.
Specify the input data format. Use "auto" for automatic detection or explicitly set to JSON, XML, YAML, or CSV when format is known.
Possible values:
  • "auto" - The data format is auto-detected.
  • "json"
  • "xml"
  • "yaml"
  • "csv"
Set the encoding of the data file set by setDataFile.
Ignore undefined variables in the HTML template. The default mode is strict so any undefined variable causes the conversion to fail. You can use {% if variable is defined %} to check if the variable is defined.
Auto escape HTML symbols in the input data before placing them into the output.
Auto trim whitespace around each template command block.
Set the advanced data options:
  • csv_delimiter - The CSV data delimiter, the default is ,.
  • xml_remove_root - Remove the root XML element from the input data.
  • data_root - The name of the root element inserted into the input data without a root node (e.g. CSV), the default is data.
Miscellaneous
Turn on debug logging to troubleshoot conversion issues. Details about the conversion process, including resource loading, rendering steps, and error messages are stored in the debug log. Use this when conversions fail or produce unexpected results. The debug log is available in conversion statistics.
Tag the conversion with a custom value for tracking and analytics. Use this to categorize conversions by customer ID, document type, or business unit. The tag appears in conversion statistics. A value longer than 32 characters is cut off.
A proxy server used by the conversion process for accessing the source URLs with HTTP scheme. This can help circumvent regional restrictions or provide limited access to your intranet. The value must have format DOMAIN_OR_IP_ADDRESS:PORT.
A proxy server used by the conversion process for accessing the source URLs with HTTPS scheme. This can help circumvent regional restrictions or provide limited access to your intranet. The value must have format DOMAIN_OR_IP_ADDRESS:PORT.
A client certificate to authenticate the converter on your web server. The certificate is used for two-way SSL/TLS authentication (mutual TLS) and adds extra security. Use this when converting content from servers that require client certificate authentication for access. The file must exist and not be empty.
A password for the PKCS12 file with a client certificate if the certificate file is password-protected.
Tweaks
Set the maximum time for loading the page and its resources. After this time, all requests will be considered successful. This can be useful to ensure that the conversion does not timeout. Use this method if there is no other way to fix page loading. It is available for API client version >= 5.15.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details. The accepted range is 10-30.
Specify the User-Agent HTTP header that will be used by the converter when a request is made to the converted web page. It is available for API client version >= 6.4.0. More PDFCrowd versioning details.
Possible values:
  • "chrome-desktop" - The user-agent for desktop chrome corresponding to the converter used.
  • "chrome-mobile" - The user-agent for mobile chrome corresponding to the converter used.
  • "latest-chrome-desktop" - The user-agent of the recently released Chrome browser on desktops.
  • "latest-chrome-mobile" - The user-agent of the recently released Chrome browser on mobile devices.
  • A custom string for the user agent.
API Client Options
Set the converter version. Different versions may produce different output. Choose which one provides the best output for your case. It is available for API client version >= 5.0.0 and applicable for converter version >= 20.10. More PDFCrowd versioning details.
Possible values:
  • "24.04" - Version 24.04.
  • "20.10" - Version 20.10.
  • "18.10" - Version 18.10.
Specify whether to use HTTP or HTTPS when connecting to the PDFCrowd API.

Using HTTP is insecure as data sent over HTTP is not encrypted. Enable this option only if you know what you are doing.

Specify the number of automatic retries when a 502 or 503 HTTP status code is received. The status code indicates a temporary network issue. This feature can be disabled by setting to 0.
No option name matches.
No option is changed.
«
Code
+
Preview
+
The "" image format is not supported by your browser.

Output Preview

- press
to run the conversion -