爬虫--Lxml简单案例 - 代码天地

爬虫--Lxml简单案例

编程语言 2018-10-18 11:49:09 阅读次数: 0

1.以爬取简书首页标题为例

import requests
from lxml import etree

# 简书首页title爬取
class LxmlSpider:
    def __init__(self):
        self.session = requests.Session()

    def jian_shu_spider(self, url, headers):
        response = requests.get(url, headers=headers).text
        result = etree.HTML(response)
        # title的xpath
        title_list = result.xpath("//div/a[@class='title']")
        for title in title_list:
            print("文章标题：%s"%title.text)

if __name__ == '__main__':
    lxml_soup = LxmlSpider()
    lxml_soup.jian_shu_spider(
    "http://www.jianshu.com",
        {
        "Referer": "https://www.jianshu.com/",
        "User-Agent": "Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36"
        }
    )

2.爬取结果

猜你喜欢

转载自blog.csdn.net/qq_39620483/article/details/83141726

爬虫--Lxml简单案例

爬虫---lxml简单操作

使用lxml编写简单爬虫实例

python简单爬虫用lxml库解析数据

python简单爬虫用lxml解析页面中的表格

爬虫--BeautifulSoup简单案例

爬虫简单案例

node - 简单的爬虫案例

scrapy爬虫简单案例

XPATH(lxml)爬虫测试

爬虫——lxml 模块

爬虫之lxml库

通过lxml数据抽取实现一个简单爬虫(爬虫基础学习)

爬虫基础：lxml与requests库, 使用爬虫获取一个确定的简单信息

Python爬虫解析网页的三种方法，lxml、BeautifulSoup、re案例！

python爬虫精选06集（xpath解析、lxml解析库、案例实战）

LXML库简单使用

使用requests+lxml实现简单的斗鱼信息爬虫（适用于新手）

爬虫基础——正则、xpath、lxml

python爬虫（三）xpath与lxml

【python爬虫】安装lxml模块

python爬虫入门（2）----- lxml

使用lxml进行爬虫简介

Python爬虫之路-lxml模块

python爬虫6：lxml库

爬虫基础知识简单案例

爬虫中的selenium简单学习及案例

爬虫之scrapy简单案例之猫眼

【node】12、Koa实现简单爬虫案例

Python高级教程：简单爬虫实践案例

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

更多

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)