Last updated
Was this helpful?
Last updated
Was this helpful?
Realtime is a synchronous integration method. It requires keeping the connection open until the job is finished successfully or returns an error.
The Realtime API endpoint for job submission is:
Provide the job parameters in a JSON payload as shown in the examples below. Python and PHP examples include comments for clarity.
Realtime API supports these result types in the output:
HTML: The raw HTML content scraped from the target webpage;
JSON: Structured data parsed from the HTML content, formatted in JSON format;
PNG: Base64-encoded screenshot of the rendered page in PNG format.
This table explains the default and other available result types based on the parameters included in the payload of the API request.
Realtime API always returns the default output. To get other available outputs from the same realtime job, use with that job ID. Note that the job ID can be found in the headers of each realtime response as well as within the response body.
x
x
html
html
html
x
html
html
png
x
png
html, png
x
true
json
html, json
html
true
json
html, json
png
true
png
html, json, png