示例 采集百度搜索结果列表的标题和链接。

采集代码

$data = QueryList::get('https://www.baidu.com/s?wd=QueryList', null, [
        'headers' => [
            'User-Agent' => 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.149 Safari/537.36',
            'Accept-Encoding' => 'gzip, deflate, br',
        ]
    ])->rules([
        'title' => ['h3', 'text'],
        'link' => ['h3>a', 'href']
    ])
    ->range('.result')
    ->queryData();
print_r($data);

采集结果
Array
(

[0] => Array
    (
        [title] => QueryList|基于phpQuery的无比强大的PHP采集工具
        [link] => http://www.baidu.com/link?url=GU_YbDT2IHk4ns1tjG2I8_vjmH0SCJEAPuuZN
    )
[1] => Array
    (
        [title] => PHP 用QueryList抓取网页内容 - wb145230 - 博客园
        [link] => http://www.baidu.com/link?url=zn0DXBnrvIF2ibRVW34KcRVFG1_bCdZvqvwIhUqiXaS
    )
[2] => Array
    (
        [title] => 介绍- QueryList指导文档
        [link] => http://www.baidu.com/link?url=pSypvMovqS4v2sWeQo5fDBJ4EoYhXYi0Lxx
    )
    //...

)
QueryList安装:https://querylist.cc/docs/guide/v4/installation

好用的工具:

  • 八爪鱼
  • 后羿
  • 火车头

文章摘要:https://querylist.cc/docs/guide/v4/example
QueryList手册:https://querylist.cc/docs/guide/v4

最后修改:2020 年 12 月 10 日 08 : 14 PM
对您有帮助的话,请赏包辣条吧 ^~^