Documentation has been updated: see help center and changelog in one place.

Reviews

The amazon_reviews data source is designed to retrieve Amazon product review pages of an ASIN of your choice. To see the response example with retrieved data, download this sample output file in HTML format or check structured data output here.

Explore output data dictionary for Reviews, offering a brief description, screenshot, parsed JSON code snippet, and a table defining each parsed field. Navigate through the details using the right-side navigation or scrolling down the page.

Request samples

In the code examples below, we make a request to retrieve product reviews for ASIN B08238V32L on amazon.nl marketplace.

curl 'https://realtime.oxylabs.io/v1/queries' \
--user 'USERNAME:PASSWORD' \
-H 'Content-Type: application/json' \
-d '{
        "source": "amazon_reviews", 
        "domain": "nl", 
        "query": "B08238V32L",
        "parse": true
    }'

We use synchronous Realtime integration method in our examples. If you would like to use Proxy Endpoint or asynchronous Push-Pull integration, refer to the integration methods section.

Request parameter values

Generic

Basic setup and customization options for scraping Amazon product review pages.

Parameter
Description
Default Value

source

Sets the scraper.

amazon_reviews

query

10-symbol ASIN code.

-

render

Enables JavaScript rendering when set to html. More info.

-

parse

Returns parsed data when set to true. Explore output data dictionary.

false

callback_url

URL to your callback endpoint. More info.

-

user_agent_type

Device type and browser. The full list can be found here.

desktop

- mandatory parameter

Localization

Adapt results to specific geographical locations, domains, and languages.

Parameter
Description
Default Value

geo_location

The Deliver to location. See our guide to using this parameter here.

-

domain

Domain localization for Amazon. The full list of available domains can be found here.

com

locale

Accept-Language header value, which sets the interface language of the Amazon page. More info.

-

Pagination and sorting

Controls for managing the pagination, retrieval, and sorting of search results.

Parameter
Description
Default Value

start_page

Starting page number.

1

pages

Number of pages to retrieve.

1

context: sort_by

Sort reviews by top, recent or helpful.

recent

Sorting behavior

  • Top: high-value reviews, no pagination support;

  • Helpful: similar to top with pagination support;

  • Recent: default, chronological order with pagination.

Content limitations

  • 8 reviews per API page (vs. 10 in Amazon's UI);

  • The response includes only pages with actual reviews. For example, if you request 5 pages, but only 10 reviews exist, the response will consist of only pages 1 and 2

Code example

Structured data

Below you can find a structured output example for amazon_reviews.

amazon_reviews structured output

Data dictionary

HTML Sample

JSON structure

The amazon_reviews provides a collection of user-generated reviews for a specific product listed on Amazon. The table below presents a detailed list of each field we parse, along with its description and data type. The table also includes some metadata.

Key
Description
Type

url

The URL of the Amazon reviews page for a selected product.

string

asin

Amazon Standard Identification Number.

string

page

The current page number.

integer

pages

Number of pages to retrieve.

integer

reviews

A list of reviews with their respective details.

array

page_type

The type of the Amazon page.

string

asin_in_url

Amazon Standard Identification Number retrieved from the URL.

string

product_name

The name of the Amazon product.

string

parse_status_code

The status code of the parsing job. You can see the parser status codes described here.

integer

created_at

The timestamp when the scraping job was created.

timestamp

update_at

The timestamp when the scraping job was finished.

timestamp

job_id

The ID of the job associated with the scraping job.

string

status_code

The status code of the scraping job. You can see the scraper status codes described here.

integer

parser_type

The type of parser used for parsing the data.

string

In the following sections, parsed JSON code snippets are shortened where more than one item for the result type is available.

Reviews

reviews contains multiple reviews submitted by users for a particular product, each with its own details such as title, author, rating, content, timestamp, and verification status. Additionally, each review may include product attributes.

Key (reviews)
Description
Type

id

An alphanumeric identification for the Amazon Review.

string

title

The rating and the title of the review.

string

author

The user by whom the review has been submitted.

string

rating

The star rating assigned when submitting the review.

integer

content

The content of the review.

string

timestamp

Identifies when the review has been submitted.

timestamp

profile_id

An identification number assigned to the user who left the review.

string

is_verified

Identifies whether the user has bought the product via Amazon marketplace.

boolean

helpful_count

Number of helpful votes received for the review.

integer

product_attributes

Identifies the characteristics of the product.

string

Last updated

Was this helpful?