python + selenium爬取淘宝 - 代码天地

python + selenium爬取淘宝

编程语言 2018-05-28 17:06:26 阅读次数: 1

from selenium import webdriver
from lxml import etree
import time

driver = webdriver.Chrome(r"C:\Program Files (x86)\Google\Chrome\Application\chromedriver.exe")
driver.maximize_window()


def get_url(url):
    driver.get(url)
    driver.implicitly_wait(10)
    get_info()

def get_info():
    '''解析页面，查找元素'''
    '''通过观察页面，发现第一个元素的规则比较特殊，剩下的可通过一套规则找到'''
    selector = etree.HTML(driver.page_source)
    infos1 = selector.xpath('//*[@class="item J_MouserOnverReq item-ad  "]')
    infos2 = selector.xpath('//*[@class="item J_MouserOnverReq  "]')
    infos = infos1 + infos2

    for info in infos:
        moneys = info.xpath('//*[@class="price g_price g_price-highlight"]/strong/text()')
        # names = info.xpath('//*[@class="row row-2 title"]/a/span/text()')[0]
        numbers = info.xpath('//*[@class="deal-cnt"]/text()')
        dian_names = info.xpath('//*[@class="shopname J_MouseEneterLeave J_ShopInfo"]/span[2]/text()')
    for money, number, dian_name in zip(moneys, numbers, dian_names):
        print(money,number,dian_name)
    time.sleep(3)
    next_url()


def next_url():
    '''点击下一页'''
    driver.find_element_by_link_text('下一页').click()
    get_info()


if __name__ == '__main__':
    url = 'https://www.taobao.com/'
    driver.get(url)
    driver.implicitly_wait(10)
    driver.find_element_by_name('q').send_keys('python')
    driver.find_element_by_class_name('search-button').click()  #点击搜索
    get_url(driver.current_url) #传递当前页面url
    driver.quit()

猜你喜欢

转载自blog.csdn.net/qq_18525247/article/details/80384824

python + selenium爬取淘宝

Python爬虫：Selenium 爬取淘宝实战练习

Python使用Selenium爬取淘宝异步加载的数据方法

python_利用selenium 爬取淘宝商品

Python爬取淘宝

python 爬取，selenium

python 爬取淘宝信息

Python爬取淘宝图片

python 爬取淘宝商品

python爬取淘宝数据

python爬虫之selenium模拟浏览器爬取淘宝美食

Python进阶之借助selenium爬取淘宝商品信息

使用python利器selenium工具模拟浏览器运行并爬取淘宝商品信息

python selenium实现下拉爬取淘宝商品信息

（廿八）Python爬虫：使用Selenium爬取淘宝商品信息

吃货们看好了!python+selenium爬取淘宝美食

python项目实战:利用selenium+浏览器爬取淘宝商品信息

python selenium控制浏览器爬取淘宝商品信息

python基础项目实战:selenium控制浏览器爬取淘宝商品信息

Python selenium库爬取淘宝网商品信息

python -- 使用selenium模拟登录淘宝，爬取商品信息

使用python selenium爬取淘宝商品信息自动登录淘宝和爬取某一宝贝的主图，属性图和详情图等等

python selenium爬取音频

Python——selenium爬取学科

爬取千万淘宝商品的python脚本

python爬取淘宝商品数据

python爬虫爬取淘宝网页

Python3——爬取淘宝评论

【转】淘宝评论爬取 python

python爬取淘宝网页信息

今日推荐

周排行

vue + echart +map中国地图，省市地图，区县地图

spring boot2 (31)-cors跨域请求

『学习资料推荐』299元买的微信营销资料打包

个人学习卷积神经网络的疑惑解答

网络工程师-软考

模拟人生4 春夏秋冬、星梦起飞版更新下载方法以及常见问题

python关于对象的字符串显示str和repr以及

奇怪的session混乱问题

【3】分治法（divide-and-conquer）

Java项目开发成绩管理系统（九）各模块实现信息修改

每日归档

更多

2024-08-07(0)

2024-08-06(0)

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)