Constructor
function PdfToTextClient(userName, apiKey)
Constructor for the PDFCrowd API client.
- Parameters:
-
-
userName
- Your username at PDFCrowd. -
apiKey
- Your API key.
-
Conversion Input
function convertUrl(url, callbacks)
Convert a PDF.
- Parameters:
-
-
url
- The address of the PDF to convert.- Constraint:
-
- Supported protocols are http:// and https://.
-
callbacks
- The object that defines the following functions:-
data(readStream)
- called when the output data can be read from readStream -
error(message, statusCode)
- called when an error occurs -
end()
- called when the conversion finishes
-
saveToFile(filePath[, callback])
- saves the output data to a file- filePath - the output file path
- callback(err, filePath) - called when the conversion finishes
-
sendGenericHttpResponse(response, contentType, fileName[, disposition])
- sends the generated output in an HTTP response- response - the response object
- contentType - the response content type
- fileName - the desired file name
- disposition - the response content disposition, can be "attachment" or "inline", the default is "attachment".
-
-
function convertUrlToFile(url, filePath, callback)
Convert a PDF and write the result to a local file.
- Parameters:
-
-
url
- The address of the PDF to convert.- Constraint:
-
- Supported protocols are http:// and https://.
-
filePath
- The output file path. -
callback
- Thecallback(error, filePath)
function is called when the conversion finishes. The error object is present if an error occurred, filePath is the output file path.
-
function convertFile(file, callbacks)
Convert a local file.
- Parameters:
-
-
file
- The path to a local file to convert.
- Constraint:
-
- The file must exist and not be empty.
-
callbacks
- The object that defines the following functions:-
data(readStream)
- called when the output data can be read from readStream -
error(message, statusCode)
- called when an error occurs -
end()
- called when the conversion finishes
-
saveToFile(filePath[, callback])
- saves the output data to a file- filePath - the output file path
- callback(err, filePath) - called when the conversion finishes
-
sendGenericHttpResponse(response, contentType, fileName[, disposition])
- sends the generated output in an HTTP response- response - the response object
- contentType - the response content type
- fileName - the desired file name
- disposition - the response content disposition, can be "attachment" or "inline", the default is "attachment".
-
-
function convertFileToFile(file, filePath, callback)
Convert a local file and write the result to a local file.
- Parameters:
-
-
file
- The path to a local file to convert.
- Constraint:
-
- The file must exist and not be empty.
-
filePath
- The output file path. -
callback
- Thecallback(error, filePath)
function is called when the conversion finishes. The error object is present if an error occurred, filePath is the output file path.
-
function convertRawData(data, callbacks)
Convert raw data.
- Parameters:
-
-
data
(byte[]) - The raw content to be converted. -
callbacks
- The object that defines the following functions:-
data(readStream)
- called when the output data can be read from readStream -
error(message, statusCode)
- called when an error occurs -
end()
- called when the conversion finishes
-
saveToFile(filePath[, callback])
- saves the output data to a file- filePath - the output file path
- callback(err, filePath) - called when the conversion finishes
-
sendGenericHttpResponse(response, contentType, fileName[, disposition])
- sends the generated output in an HTTP response- response - the response object
- contentType - the response content type
- fileName - the desired file name
- disposition - the response content disposition, can be "attachment" or "inline", the default is "attachment".
-
-
function convertRawDataToFile(data, filePath, callback)
Convert raw data to a file.
- Parameters:
-
-
data
(byte[]) - The raw content to be converted. -
filePath
- The output file path. -
callback
- Thecallback(error, filePath)
function is called when the conversion finishes. The error object is present if an error occurred, filePath is the output file path.
-
function convertStream(inStream, callbacks)
Convert the contents of an input stream.
- Parameters:
-
-
inStream
(InputStream) - The input stream with source data.
-
callbacks
- The object that defines the following functions:-
data(readStream)
- called when the output data can be read from readStream -
error(message, statusCode)
- called when an error occurs -
end()
- called when the conversion finishes
-
saveToFile(filePath[, callback])
- saves the output data to a file- filePath - the output file path
- callback(err, filePath) - called when the conversion finishes
-
sendGenericHttpResponse(response, contentType, fileName[, disposition])
- sends the generated output in an HTTP response- response - the response object
- contentType - the response content type
- fileName - the desired file name
- disposition - the response content disposition, can be "attachment" or "inline", the default is "attachment".
-
-
function convertStreamToFile(inStream, filePath, callback)
Convert the contents of an input stream and write the result to a local file.
- Parameters:
-
-
inStream
(InputStream) - The input stream with source data.
-
filePath
- The output file path. -
callback
- Thecallback(error, filePath)
function is called when the conversion finishes. The error object is present if an error occurred, filePath is the output file path.
-
General Options
function setPdfPassword(password)
The password to open the encrypted PDF file.
- Parameter:
-
-
password
- The input PDF password.
-
function setPrintPageRange(pages)
Set the page range to print.
- Parameter:
-
-
pages
- Constraint:
-
- A comma separated list of page numbers or ranges.
-
- Examples:
-
-
Just the second page is printed:
setPrintPageRange("2")
-
The first and the third page are printed:
setPrintPageRange("1,3")
-
Everything except the first page is printed:
setPrintPageRange("2-")
-
Just first 3 pages are printed:
setPrintPageRange("-3")
-
Pages 3, 6, 7, 8 and 9 are printed:
setPrintPageRange("3,6-9")
-
Just the second page is printed:
function setNoLayout(value)
Ignore the original PDF layout.
- Parameter:
-
-
value
(bool) - Set totrue
to ignore the layout.- Default:
-
false
-
function setEol(eol)
The end-of-line convention for the text output.
- Parameter:
-
-
eol
- Allowed Values:
-
-
unix
— Unix convension "LF" is used. -
dos
— Dos convension "CR LF" is used. -
mac
— Mac convension "CR" is used.
-
- Default:
-
unix
-
function setPageBreakMode(mode)
Specify the page break mode for the text output.
- Parameter:
-
-
mode
- Allowed Values:
-
-
none
— No page breaks are inserted. -
default
— The standard page break code "FF" is used. -
custom
— A custom page break is used.
-
- Default:
-
none
-
function setCustomPageBreak(pageBreak)
Specify the custom page break.
- Parameter:
-
-
pageBreak
- String to insert between the pages.
-
- Examples:
-
-
Clear text between pages:
setCustomPageBreak("END OF PAGE")
-
Visual separator with line break
setCustomPageBreak("----my page break---- ")
-
Clear text between pages:
function setParagraphMode(mode)
Specify the paragraph detection mode.
- Parameter:
-
-
mode
- Allowed Values:
-
-
none
— No paragraph detection. -
bounding-box
— Paragraph detection based on line bounding boxes. -
characters
— Paragraph detection based on the number of characters in the line.
-
- Default:
-
none
-
function setLineSpacingThreshold(threshold)
Set the maximum line spacing when the paragraph detection mode is enabled.
- Parameter:
-
-
threshold
- Constraint:
-
- The value must be a positive integer percentage.
- Default:
-
10%
-
function setRemoveHyphenation(value)
Remove the hyphen character from the end of lines.
- Parameter:
-
-
value
(bool) - Set totrue
to remove hyphens.- Default:
-
false
-
function setRemoveEmptyLines(value)
Remove empty lines from the text output.
- Parameter:
-
-
value
(bool) - Set totrue
to remove empty lines.- Default:
-
false
-
function setCropAreaX(x)
Set the top left X coordinate of the crop area in points.
- Parameter:
-
-
x
(int)- Constraint:
-
- Must be a positive integer or 0.
-
- Example:
-
-
Start extraction at 1.4 inches from left:
setCropAreaX(100)
-
Start extraction at 1.4 inches from left:
function setCropAreaY(y)
Set the top left Y coordinate of the crop area in points.
- Parameter:
-
-
y
(int)- Constraint:
-
- Must be a positive integer or 0.
-
- Example:
-
-
Start extraction at 1.4 inches from top:
setCropAreaY(100)
-
Start extraction at 1.4 inches from top:
function setCropAreaWidth(width)
Set the width of the crop area in points.
- Parameter:
-
-
width
(int)- Constraint:
-
- Must be a positive integer or 0.
- Default:
-
PDF page width.
-
- Example:
-
-
Extract narrow 1.4 inch width:
setCropAreaWidth(100)
-
Extract narrow 1.4 inch width:
function setCropAreaHeight(height)
Set the height of the crop area in points.
- Parameter:
-
-
height
(int)- Constraint:
-
- Must be a positive integer or 0.
- Default:
-
PDF page height.
-
- Example:
-
-
Extract small 1.4 inch height:
setCropAreaHeight(100)
-
Extract small 1.4 inch height:
function setCropArea(x, y, width, height)
Set the crop area. It allows to extract just a part of a PDF page.
- Parameters:
-
-
x
(int) - Set the top left X coordinate of the crop area in points.- Constraint:
-
- Must be a positive integer or 0.
-
y
(int) - Set the top left Y coordinate of the crop area in points.- Constraint:
-
- Must be a positive integer or 0.
-
width
(int) - Set the width of the crop area in points.- Constraint:
-
- Must be a positive integer or 0.
- Default:
-
PDF page width.
-
height
(int) - Set the height of the crop area in points.- Constraint:
-
- Must be a positive integer or 0.
- Default:
-
PDF page height.
-
Miscellaneous
function setDebugLog(value)
Turn on the debug logging. Details about the conversion are stored in the debug log. The URL of the log can be obtained from the getDebugLogUrl method or available in conversion statistics.
- Parameter:
-
-
value
(bool) - Set totrue
to enable the debug logging.- Default:
-
false
-
function getDebugLogUrl() { return string; }
Get the URL of the debug log for the last conversion.
- Returns:
- string - The link to the debug log.
function getRemainingCreditCount() { return int; }
Get the number of conversion credits available in your account.
This method can only be called after a call to one of the convertXtoY methods.
The returned value can differ from the actual count if you run parallel conversions.
The special value 999999
is returned if the information is not available.
- Returns:
- int - The number of credits.
function getConsumedCreditCount() { return int; }
Get the number of credits consumed by the last conversion.
- Returns:
- int - The number of credits.
function getJobId() { return string; }
Get the job id.
- Returns:
- string - The unique job identifier.
function getPageCount() { return int; }
Get the number of pages in the output document.
- Returns:
- int - The page count.
function getOutputSize() { return int; }
Get the size of the output in bytes.
- Returns:
- int - The count of bytes.
function getVersion() { return string; }
Get the version details.
- Returns:
- string - API version, converter version, and client version.
function setTag(tag)
Tag the conversion with a custom value. The tag is used in conversion statistics. A value longer than 32 characters is cut off.
- Parameter:
-
-
tag
- A string with the custom tag.
-
- Example:
-
-
Track job in analytics:
setTag("client-1234")
-
Track job in analytics:
function setHttpProxy(proxy)
A proxy server used by the conversion process for accessing the source URLs with HTTP scheme. It can help to circumvent regional restrictions or provide limited access to your intranet.
- Parameter:
-
-
proxy
- Constraint:
-
- The value must have format DOMAIN_OR_IP_ADDRESS:PORT.
-
- Examples:
-
-
Corporate proxy server:
setHttpProxy("myproxy.com:8080")
-
Direct IP proxy connection:
setHttpProxy("113.25.84.10:33333")
-
Corporate proxy server:
function setHttpsProxy(proxy)
A proxy server used by the conversion process for accessing the source URLs with HTTPS scheme. It can help to circumvent regional restrictions or provide limited access to your intranet.
- Parameter:
-
-
proxy
- Constraint:
-
- The value must have format DOMAIN_OR_IP_ADDRESS:PORT.
-
- Examples:
-
-
Secure proxy for HTTPS:
setHttpsProxy("myproxy.com:443")
-
Direct secure proxy IP:
setHttpsProxy("113.25.84.10:44333")
-
Secure proxy for HTTPS:
API Client Options
function setUseHttp(value)
Specify whether to use HTTP or HTTPS when connecting to the PDFCrowd API.
- Parameter:
-
-
value
(bool) - Set totrue
to use HTTP.- Default:
-
false
-
function setClientUserAgent(agent)
Specifies the User-Agent HTTP header that the client library will use when interacting with the API.
- Availability:
- API client >= 6.4.0 See versioning.
- Parameter:
-
-
agent
- The user agent string.
-
function setUserAgent(agent)
Set a custom user agent HTTP header. It can be useful if you are behind a proxy or a firewall.
- Parameter:
-
-
agent
- The user agent string.- Default:
-
pdfcrowd_nodejs_client/6.5.2 (https://pdfcrowd.com)
-
function setProxy(host, port, userName, password)
Specifies an HTTP proxy that the API client library will use to connect to the internet.
- Parameters:
-
-
host
- The proxy hostname. -
port
(int) - The proxy port. -
userName
- The username. -
password
- The password.
-
function setRetryCount(count)
Specifies the number of automatic retries when the 502 or 503 HTTP status code is received. The status code indicates a temporary network issue. This feature can be disabled by setting to 0.
- Parameter:
-
-
count
(int) - Number of retries.- Default:
-
1
-
- Example:
-
-
Retry failed requests three times:
setRetryCount(3)
-
Retry failed requests three times: