利用Python进行简单爬虫----学习之一 - 代码天地

利用Python进行简单爬虫----学习之一

其他 2018-07-24 03:24:17 阅读次数: 0

（先占坑，之后补充）

1.爬取豆瓣复仇者联盟3标题

from bs4 import BeautifulSoup
import requests
myurl = requests.get('https://movie.douban.com/subject/24773958/')
v_text = BeautifulSoup(myurl.text , 'lxml')
v_title = v_text.find('span' , attrs = {'property' : 'v:itemreviewed'}).text
print(v_title)

2.爬取豆瓣复联3简介

from bs4 import BeautifulSoup
import requests
myurl = requests.get('https://movie.douban.com/subject/24773958/')
v_text = BeautifulSoup(myurl.text , 'lxml')
v_shortIntroduce = v_text.find('span' , attrs = {'property' : 'v:summary'}).text
print(v_shortIntroduce)

3.爬取豆瓣复联3主要信息

from bs4 import BeautifulSoup
import requests
myurl = requests.get('https://movie.douban.com/subject/24773958/')
v_text = BeautifulSoup(myurl.text , 'lxml')
v_mainMeassage = v_text.find('div' , attrs = {'id' : 'info'}).text
print(v_mainMeassage)

4.爬取豆瓣电影标题

from bs4 import BeautifulSoup
import requests
myurl = requests.get('https://movie.douban.com/')
v_text = BeautifulSoup(myurl.text , 'lxml')
v_mainMeassage = v_text.findAll('li' , attrs = {'class' : 'title'})
for i in v_mainMeassage:
    print(i.text)

猜你喜欢

转载自blog.csdn.net/colpac/article/details/80574357

利用Python进行简单爬虫----学习之一

利用Python进行简单爬虫----学习之二

Python爬虫之一

【转载】Python3网络爬虫(一)：利用urllib进行简单的网页抓取

Python3网络爬虫(一)：利用urllib进行简单的网页抓取

python网络爬虫学习笔记之一爬虫基础入门

Python中利用BeautifulSoup库进行简单的网页爬虫

python最简单爬虫入手例子之一：

python爬虫学习requests中的模块请求参数之一

python学习之一

Python爬虫学习：简单的爬虫

【爬虫学习一】 Python实现简单爬虫（requests，BeautifulSoup）

利用python进行多线程爬虫

python爬虫基础之一（爬淘宝）

Python网页爬虫selenium，chromedriver之一

爬虫 Scrapy 学习系列之一：Tutorial

怎样利用 python 学习爬虫？

大数据python之简单的网络爬虫代码实现（单一与循环代码进行网络爬虫）

利用Python进行数据分析学习记录（一）

利用python爬虫实现简单翻译软件

python爬虫之一个完整的小爬虫

python学习笔记之一

python 学习笔记之一

python基础学习之一

Python爬虫：对selenium的webdriver进行简单封装

Python爬虫自学进行简单的文本抓取

python爬虫简单的添加代理进行访问

Python爬虫简单的添加代理进行访问！

利用 OpenCV-Python 进行人脸 Delaunay 三角剖分(人脸检测核心技术之一)

大众点评反爬虫简单研究之一

今日推荐

周排行

Leetcode简单题61~80

解决zookeeper磁盘IO高的问题

多线程相关方法详解

Maven-setting.xml文件详解

Maven 项目的 classpath 理解

渊亭科技大数据笔试题

配置JVM内存分配

计算机网络个人学习笔记（三）网络层：第三部分连载

js中两个等号(==)和三个等号(===)的区别

用C程序自动打开电脑上的程序

每日归档

更多

2024-09-18(0)

2024-09-17(0)

2024-09-16(0)

2024-09-15(0)

2024-09-14(0)

2024-09-13(0)

2024-09-12(0)

2024-09-11(0)

2024-09-10(0)

2024-09-09(0)