The google_search source is designed to retrieve Google Search results (SERPs). This sub-page specifically presents data related to Google News Search. To explore other result types, read here: Web Search, Image Search.
To scrape Google News search, include the context:udm parameter with value set to 12 or context:tbm parameter with the value set to nws.
Explore output data dictionary for each News SERP feature, offering a brief description, screenshot, parsed JSON code snippet, and a table defining each parsed field. Navigate through the details using the right-side navigation or scrolling down the page.
Request samples
In the examples below, we make a request to obtain News search result pages for the search term adidas on the google.nl domain.
Basic setup and customization options for scraping Google News search results.
- mandatory parameter
- udm and tbm context parameters cannot be used together in a single scraping request; please select one of them. Using both simultaneously may lead to conflicts or unexpected behavior.
Google Advanced Search Operators
When scraping, you might find it useful to combine Google advanced search operators with your query. It enables you to customize the scope of the search, ensuring that the results are more relevant and focused. Explore these special commands here and here. See an example below.
Options to filter and refine search results based on various criteria.
Other
Additional advanced settings and controls for specialized requirements.
Context parameters
All context parameters should be added to the context array as objects with key and value pairs, e.g.:
..."context": [ {"key":"filter","value":"0" }]...
Structured data
SERP Scraper API is capable of extracting either an HTML or JSON object that contains Google search results, offering structured data on various elements of the results page.
google_search news structured output
{"results": [ {"content": {"url":"https://www.google.com/search?q=adidas&tbm=nws&uule=w+CAIQICINdW5pdGVkIHN0YXRlcw&gl=us&hl=en","page":1,"results": {"main": [ {"url":"https://www.cnn.com/2022/05/06/business/under-armour-stock-adidas-nike/index.html", "desc": "Tripped-up supply chains and a coronavirus surge in China are causing \nheadaches for top athletic brands.",
"title":"Wall Street is fed up with Under Armour, Nike and Adidas","source":"CNN","pos_overall":1,"relative_publish_date":"2 days ago" }, ... { "url": "https://www.cnbc.com/2022/05/06/dsw-tests-layout-to-spotlight-brands-like-adidas-crocs-birkenstock.html",
"desc": "DSW is trying out a new store look and layout at a location opening this \nweekend in Houston, in an attempt to focus customers' attention on...",
"title": "DSW is testing a store layout that puts the spotlight on brands like \nAdidas, Crocs and Birkenstock",
"source":"CNBC","pos_overall":10,"relative_publish_date":"2 days ago" } ],"total_results_count":57300000 },"parse_status_code":12000 },"created_at":"2022-05-09 07:25:03","updated_at":"2022-05-09 07:25:07","page":1,"url":"https://www.google.com/search?q=adidas&tbm=nws&uule=w+CAIQICINdW5pdGVkIHN0YXRlcw&gl=us&hl=en","job_id":"6929330379711060993","status_code":200,"parser_type":"v2" } ]}
We only parse news search results for desktop searches.
Output data dictionary
HTML example
JSON structure
The Google News Search structured output includes fields like URL, page, results, and others. The table below presents a detailed list of each SERP feature we parse, along with its description and data type. The table also includes some metadata.
The number of items and fields for a specific result type may vary depending on the search query.
In the following sections, parsed JSON code snippets are shortened where more than one item for the result type is available.
Main
Displays a list of unpaid news results, providing relevant details for each article.
..."main": [ {"url":"https://www.yahoo.com/lifestyle/tiger-woods-nikes-epic-partnership-015311819.html","desc":"That there could ever be a world in which Tiger Woods wasn't sponsored by \nNike seemed...","title":"How Tiger Woods and Nike's Epic Partnership Fell Apart","source":"Yahoo","pos_overall":1,"relative_publish_date":"1 day ago" }, ...},...
Additional
Presents a list of trending articles, accompanied by relevant details.
..."additional": [ {"items": [ {"pos":1,"url":"https://www.complex.com/sneakers/a/brendan-dunne/nike-book-1-colorways-haven-hike-rattlesnake","title":"Nike Book 1 Colorways Haven Hike Rattlesnake","source":"Complex","relative_publish_date":"1 day ago" }, ... ],"pos_overall":2,"section_title":"Devin Booker confirms issues with Nike Book 1 launch" }...
Device type and browser. The full list can be found here.
desktop
geo_location
The geographical location that the result should be adapted for. Using this parameter correctly is extremely important to get the right data. For more information, read about our suggested geo_location parameter structures here.
-
domain
Domain localization for Google. The full list of available domains can be found here.
com
locale
Accept-Language header value which changes your Google search page web interface language. More info.
-
start_page
Starting page number.
1
pages
Number of pages to retrieve.
1
limit
Number of results to retrieve in each page.
10
context:
limit_per_page
If you want to scrape multiple pages with the same IP address, include a JSON array and specify the page numbers using the page key. You must also indicate the number of organic results on each page by adding a limit key. See example.
-
page
The number of the page you would like to scrape. Any integer value greater than 0 will work
1
limit
The number of results on the page in question. Any integer value between 1 and 100 (inclusive) will work.
90
context:safe_search
Safe search. Set to true to enable it.
false
context:
tbs
tbs parameter. This parameter is like a container for more obscure google parameters, like limiting/sorting results by date as well as other filters some of which depend on the tbm parameter (e.g. tbs=app_os:1 is only available with tbm value app). More info here.
-
context:
nfpr
true will turn off spelling auto-correction
false
url
The URL of the Google search page.
string
results
A dictionary containing the results of the search.
array
results.main
A list of unpaid news results with their respective details.
array
results.additional
A list of trending articles with their respective details.
object
results.total_results_count
The total number of results found for the search query.
array
parse_status_code
The status code of the parsing job. You can see the parser status codes described here.
integer
created_at
The timestamp when the scraping job was created.
timestamp
updated_at
The timestamp when the scraping job was finished.
timestamp
page
Page number relative to the Google SERP pagination.
integer
job_id
The ID of the job associated with the scraping job.
string
status_code
The status code of the scraping job. You can see the scraper status codes described here.
integer
url
The URL to the full article.
string
desc
A short excerpt from the full article.
string
title
The title of the article.
string
source
The name of the website where the article is published.
string
pos_overall
Indicates the overall position of the result within the main results of News SERP.
integer
relative_publish_date
Describes how long ago the article was published.
string
items
A list of articles with their respective details.
array
items.pos
A unique indicator denoting the article position in the list.
integer
items.url
The URL to the full article.
string
items.title
The title of the article.
string
items.source
The name of the website where the article is published.
string
items.relative_publish_date
Describes how long ago the article was published.
string
pos_overall
Indicates the overall position of the result within the additional results of News SERP.