python3-正则表达式(re)之获取网页全部url

其他 2019-04-28 19:11:50 阅读次数: 0

有时候,我们需要获取网站的全部url,用作于其他测试

以sogoWeChat为例：

import re
import urllib.request

response = urllib.request.urlopen("https://weixin.sogou.com/")
html = response.read()
tag = re.findall(r'<a href="([a-zA-z]+://[^\s]*)"', str(html))
print(tag)

返回结果

推荐一个正则表达式在线验证网站：http://tool.oschina.net/regex/#

完美

猜你喜欢

转载自blog.csdn.net/weixin_44065501/article/details/89346178

python3-正则表达式(re)之获取网页全部url

python3-正则表达式~match

python3-正则表达式

Python3-正则表达式1

Python3 re模块(正则表达式)

python3中的RE(正则表达式)

python3 正则表达式 re模块

Python Module之re-正则表达式

python基础之：re(正则表达式)模块

Python模块之re正则表达式

python正则表达式之re模块使用

Python之re(正则表达式)模块详解

python模块之re模块（正则表达式）

python之re模块和正则表达式

python之正则表达式re

【转】Python之正则表达式（re模块）

python之正则表达式：re模块

python基础之正则表达式，re模块

Python 之Re模块(正则表达式)

Python 之【re模块的正则表达式学习】

python之re(正则表达式)

python之正则表达式【re】

python学习之 re库正则表达式

python正则表达式(re)

python re 正则表达式

python正则表达式（re）

python正则表达式re

【Re】python正则表达式

[Python] 正则表达式re

python re正则表达式

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)