Other Search Engines

Scrape other search engines (Yahoo, Yandex, DuckDuckGo, Baidu, etc.) with our universal source. It accepts URLs along with additional parameters.

Request samples

In the example below, we make a request to retrieve a result for the provided Baidu URL.

curl 'https://realtime.oxylabs.io/v1/queries' \
--user 'USERNAME:PASSWORD' \
-H 'Content-Type: application/json' \
-d '{
        "source": "universal",
        "url": "https://www.baidu.com/s?ie=utf-8&f=8&rsv_bp=1&rsv_idx=1&ch=&tn=baidu&bar=&wd=adidas"
    }'
Output example
{
    "results": [
        {
            "content": "<!DOCTYPE html>
            CONTENT
            </html>",
            "created_at": "2024-07-01 08:48:59",
            "updated_at": "2024-07-01 08:49:02",
            "page": 1,
            "url": "https://www.baidu.com/s?ie=utf-8&f=8&rsv_bp=1&rsv_idx=1&ch=&tn=baidu&bar=&wd=adidas",
            "job_id": "7213463587904304129",
            "status_code": 200,
            "session_info": {
                "id": null,
                "expires_at": null,
                "remaining": null
            }
        }
    ]
}

We use synchronous Realtime integration method in our examples. If you would like to use Proxy Endpoint or asynchronous Push-Pull integration, refer to the integration methods section.

Request parameter values

Generic

Basic setup and customization options for scraping other search engines.

- mandatory parameter

Additional

These are the parameters of our features.

Payload sample

In this sample, we include some parameters from the table above. Note that while these parameters are not always necessary or compatible within the same request, they illustrate how to format your requests.

{
    "source": "universal", 
    "url": "https://example.com", 
    "user_agent_type": "desktop",
    "geo_location": "United States",
    "context": [
        {
            "key": "headers", 
            "value": {
                "Content-Type": "application/octet-stream", 
                "Custom-Header-Name": "custom header content"
            }
        }, 
        {
            "key": "cookies", 
            "value": [
                {
                    "key": "NID", 
                    "value": "1234567890"
                },
                {
                    "key": "1P JAR",
                    "value": "0987654321"
                }
            ]
        },
        {
            "key": "follow_redirects",
            "value": true
        },
        {
            "key": "http_method", 
            "value": "post"
        },
        {
            "key": "content",
            "value": "base64EncodedPOSTBody"
        },
        {
            "key": "successful_status_codes",
            "value": [303, 808, 909]
        }
    ],
    "parsing_instructions": {
        "title": {
            "_fns": [
                {
                    "_fn": "xpath_one",
                    "_args": ["//h1/text()"]
                }
            ]
        }
    }
}

Last updated