直接上代码,免的自己忘记了
<?php
echo "Hello World!";
$url = "http://www.youdao.com/w/ok/#keyfrom=dict2.top";//要爬取的网址
//$url = "https://www.runoob.com/html/html-links.html";//要爬取的网址
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt ($ch, CURLOPT_HEADER,false);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, false);
//设置用户代理(后缀为.top的网址不加这个获取不到内容。)
curl_setopt($ch, CURLOPT_USERAGENT, "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.22 (KHTML, like Gecko) Chrome/25.0.1364.172 Safari/537.22");
//关闭直接输出
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
//加入重定向处理
curl_setopt($ch, CURLOPT_FOLLOWLOCATION ,1);
$response = curl_exec($ch);
echo curl_errno($ch);
echo($response);
curl_close($ch);
?>
先获取到了全部内容,在弄指定的内容。好像要用正则表达式,这个也是十窍通了九窍,一窍不通…