> For the complete documentation index, see [llms.txt](https://developers.oxylabs.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://developers.oxylabs.io/products/cn/web-scraper-api.md). # 网页爬虫API [**网页爬虫API**](https://oxylabs.io/products/scraper-api/web) 是一个 **一站式网页数据采集解决方案** 旨在从任何公开网站大规模提取实时数据。它涵盖网页抓取的每个阶段，从抓取 URL、提高访问成功率，到数据解析并交付到你首选的存储位置，因此你无需管理代理、请求管理或基础设施。该工具旨在满足企业级安全标准，包括 SOC 2 Type II 合规，并提供可快速适应的基础设施，能够动态调整以适配目标网站，确保在搜索引擎、电商网站、旅游平台等场景下都具有高成功率和可靠的数据提取。 ## 开始使用 **创建你的 API 用户凭据**: 在 [**Oxylabs 控制台**](https://dashboard.oxylabs.io/en/registration) 中注册免费试用或购买产品，以创建你的 API 用户凭据（`USERNAME` 和 `PASSWORD`). ### 请求示例该 API 会自动管理代理轮换、请求重试，并处理机器人监测系统，作为其集成基础设施的一部分，因此只需一次请求即可检索所有结构化数据。下面你会找到示例 cURL 请求。其他编程语言的示例，请参考相关章节： [**亚马逊**](/api-targets/cn/dian-zi-shang-wu/amazon.md), [**谷歌**](/api-targets/cn/sou-suo-yin-qing/google.md), [**其他网站**](/api-targets/cn/ren-yi-yu-ming.md). {% tabs %} {% tab title="亚马逊" %} ```bash curl 'https://realtime.oxylabs.io/v1/queries' \ --user "USERNAME:PASSWORD" \ -H "Content-Type: application/json" \ -d '{ "source": "amazon_product", "query": "B07FZ8S74R", "geo_location": "90210", "parse": true }' ``` {% endtab %} {% tab title="谷歌" %} ```bash curl 'https://realtime.oxylabs.io/v1/queries' \ --user 'USERNAME:PASSWORD' \ -H 'Content-Type: application/json' \ -d '{ "source": "google_search", "query": "adidas", "geo_location": "California,United States", "parse": true }' ``` {% endtab %} {% tab title="其他" %} ```bash curl 'https://realtime.oxylabs.io/v1/queries' \ --user 'USERNAME:PASSWORD' \ -H 'Content-Type: application/json' \ -d '{ "source": "universal", "url": "https://sandbox.oxylabs.io/" }' ``` {% endtab %} {% endtabs %} 我们在示例中使用同步 [**Realtime**](/products/cn/web-scraper-api/integration-methods/realtime.md) 集成方式。如果你想使用 [**Proxy Endpoint**](/products/cn/web-scraper-api/integration-methods/proxy-endpoint.md) 或异步 [**Push-Pull**](/products/cn/web-scraper-api/integration-methods/push-pull.md) 集成，请参阅 [**集成方法**](/products/cn/web-scraper-api/integration-methods.md) 部分。 {% file src="/files/8365b1362a7a5614a3b24348eef8a863a5a2f564" %} {% file src="/files/b0ad6ed9e7d782f0549a12e6f6ad11e8af390529" %} ### 请求参数值 1. **`来源`** - 此参数设置将用于处理你的请求的 scraper。 2. **`URL`** 或 **`query`** - 请提供 `URL` 或 `query` 对应你想抓取的页面类型。请参考下面的表格及相应的目标子页面，了解何时使用每个参数。 3. 另外，你还可以包含其他参数，例如 `geo_location`, `user_agent_type`, `parse` （我们的解析器列表见 [**这里**](/products/cn/web-scraper-api/features/result-processing-and-storage/dedicated-parsers.md)), `render` 等，以定制你的抓取请求。了解更多： [**功能**](/products/cn/web-scraper-api/features.md). \- 必填参数 ### 使用 URL 或参数化输入进行抓取 Oxylabs 支持两大类输入——URL 和参数化输入，如查询、产品或视频 ID。 [通用目标](/api-targets/cn/ren-yi-yu-ming.md) 没有专用 source 的目标可使用 `universal` source。

目标	来源（抓取 URL）	来源（使用 Query、Product 或 Video ID）
亚马逊	`亚马逊`	`amazon_product`, `amazon_search`, `amazon_pricing`, `amazon_sellers`, `amazon_bestsellers`
谷歌	`谷歌`	`google_search`, `google_ads`, `google_ai_mode`, `google_lens`, `google_maps`, `google_travel_hotels`, `google_trends_explore`, `google_shopping_product`, `google_shopping_search`
必应	`必应`	`bing_search`
YouTube	`universal`	`youtube_search`, `youtube_search_max`, `youtube_video_trainability`, `youtube_download`, `youtube_subtitles`, `youtube_metadata`, `youtube_channel`, `youtube_autocomplete`
ChatGPT	`universal`	`chatgpt`
Perplexity	`universal`	`perplexity`
沃尔玛	`walmart`	`walmart_search`, `walmart_product`
TikTok	`universal`	`tiktok_shop_search`, `tiktok_shop_product`
eBay	`ebay`	`ebay_search`, `ebay_product`
Etsy	`etsy`	`etsy_search`, `etsy_product`
Best Buy	`universal`	`bestbuy_search`, `bestbuy_product`
Bed Bath & Beyond	`bedbathandbeyond`	`bedbathandbeyond_search`, `bedbathandbeyond_product`
Bodega Aurrerá	`bodegaaurrera`	`bodegaaurrera_搜索`, `bodegaaurrera_商品`
Instacart	`instacart`	`instacart_search`, `instacart_product`
克罗格	`kroger`	`kroger_search`, `kroger_product`
Lowe's	`lowes`	`lowes_search`, `lowes_product`
Publix	`publix`	`publix_search`, `publix_product`
目标	`Target`	`target_search`, `target_product`, `target_category`
Grainger	`grainger`	`grainger_search`, `grainger_product`
Costco	`costco`	`costco_search`, `costco_product`
Menards	`menards`	`menards_search`, `menards_product`
Petco	`universal`	`petco_search`
Staples	`universal`	`staples_search`
Allegro	`universal`	`allegro_search`, `allegro_product`
Idealo	`universal`	`idealo_搜索`
MediaMarkt	`mediamarkt`	`mediamarkt_search`, `mediamarkt_product`
Cdiscount	`cdiscount`	`cdiscount_search`, `cdiscount_product`
Alibaba	`alibaba`	`alibaba_search`, `alibaba_product`
AliExpress	`aliexpress`	`aliexpress_search`, `aliexpress_product`
IndiaMART	`indiamart`	`indiamart_search`, `indiamart_product`
Avnet	`universal`	`avnet_search`
Lazada	`lazada`	`lazada_search`, `lazada_product`
Rakuten	`universal`	`rakuten_search`
Tokopedia	`universal`	`tokopedia_search`
Flipkart	`flipkart`	`flipkart_search`, `flipkart_product`
MercadoLibre	`universal`	`mercadolibre_search`
Mercado Livre	`universal`	`mercadolivre_search`
Magazine Luiza	`magazineluiza`	`magazineluiza_search`, `magazineluiza_product`
Falabella	`falabella`	`falabella_search`, `falabella_product`
Dcard	`universal`	`dcard_search`
Airbnb	`airbnb`	`airbnb_商品`
Zillow	`zillow`	使用 `query` 不支持该参数
其他网站	`universal`	使用 `query` 不支持该参数

{% hint style="info" %} 如果你需要帮助完成第一次请求或优化你的设置，我们的 24/7 专家支持团队可通过在线聊天提供服务。 {% endhint %} ## 通过 Web Scraper API Playground 进行测试试用 [**网页爬虫API**](https://oxylabs.io/products/scraper-api/web) 和 [**OxyCopilot**](https://oxylabs.io/products/scraper-api/ai-web-scraper-copilot) 在 [**Web Scraper API Playground**](https://dashboard.oxylabs.io/?route=/api-playground). {% embed url="" %} {% embed url="" %} ## 通过 Postman 进行测试使用 Postman 开始使用我们的 API，Postman 是一个用于发起 HTTP 请求的便捷工具。下载我们的 [**网页爬虫API Postman 集合**](https://files.gitbook.com/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FzrXw45naRpCZ0Ku9AjY1%2Fuploads%2FMeGA0TZQMcAFHoVhRSQi%2FWeb%20Scraper%20API.new_postman_collection.json?alt=media\&token=9f51d41b-6604-4eef-b6c1-5024cf52c5bf) 并导入它。该集合包含演示爬虫功能的示例。可根据需要自定义这些示例，或立即开始抓取。如需逐步说明，请观看下面的视频教程。如果您是 Postman 新手，请查看这个简短的 [**指南**](/integrations/cn/wang-ye-pa-chong-api-ji-cheng/postman.md). {% embed url="" %} {% hint style="info" %} *此处提供的所有信息均按“现状”基础提供，仅供参考。对于您使用本页所含任何信息，我们不作任何陈述，并且不承担全部责任。在进行任何类型的抓取活动之前，您应咨询您的法律顾问，并仔细阅读相关网站的服务条款，或获得抓取许可。* {% endhint %} --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://developers.oxylabs.io/products/cn/web-scraper-api.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.