scrapy--(failed 3 times): [＜twisted.python.failure.Failure twisted.web._newclient.ParseError - 代码天地

scrapy--(failed 3 times): [＜twisted.python.failure.Failure twisted.web._newclient.ParseError

其他 2021-11-29 16:46:24 阅读次数: 0

问题：
[scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (302)
(failed 3 times): [<twisted.python.failure.Failure twisted.web._newclient.ParseError: (‘wrong number of parts’, b’’)>]

理论知识：
1、在使用Scrapy框架中URl被重定向
2、根据 HTTP标准，返回值为200-300之间的值为成功的resonse。
3、如果想处理在这个范围之外的response，可以通过 spider的 handle_httpstatus_list 属性或HTTPERROR_ALLOWED_CODES 设置来指定spider能处理的response返回值。

解决1：
在爬虫文件的 settings.py文件里添加：

HTTPERROR_ALLOWED_CODES = [301]

解决2：
在爬虫文件中添加headers
该headers要添加自己的值，且cookie添加前，要先登录一下爬取的网站。
bibi为例，我已经登录成功了 F12–》Network --》刷新—》Header
在这里插入图片描述

herders={
	'user-agent':' Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/93.0.4577.63 Safari/537.36',
	'cookie':' MUID=0D619CB8AF6860562CB38CC9AE466165; MUIDB=0D619CB8AF6860562CB38CC9AE466165; _EDGE_V=1; SRCHD=AF=NOFORM; SRCHUID=V=2&GUID=FE70CC4A14BD454AAF0D558AA801761A&dmnchg=1; _tarLang=default=en; _TTSS_IN=hist=WyJ6aC1IYW5zIiwiYXV0by1kZXRlY3QiXQ==; _TTSS_OUT=hist=WyJlbiJd;'
}

解决3：
模拟人查询方式方式，点击时间不同，利用随机数,在sittings.py中设置

import random
DOWNLOAD_DELAY = random.uniform(1,3)  #延迟1秒

猜你喜欢

转载自blog.csdn.net/weixin_45044349/article/details/121093318

scrapy--(failed 3 times): [＜twisted.python.failure.Failure twisted.web._newclient.ParseError

scrapy爬虫错误笔记------twisted.python.failure.Failure twisted.internet.error.ConnectionDone: Connection

scrapy中出现这个错误twisted.web._newclient.ResponseNeverReceived

Job aborted due to stage failure: Task 20 in stage 3.0 failed 1 times, most recent failure:问题

Spark开发：Job aborted due to stage failure: Task 0 in stage 0.0 failed 1 times, most recent failure问题

安装scrapy出错Failed building wheel for Twisted

python3 twisted问题

ssr start server by 3 times

dubbo常见报错：Failed to invoke the method in the service Tried 3 times of the providers的解决方法

python3安装scrapy，国内镜像安装，报错，Twisted安装失败

Python3 os.makedev() 方法、Python3 os.stat_float_times() 方法

Python安装scrapy库过程中出现 " Failed building wheel for Twisted"

python网络编程-基于twisted(3)

python3 安装 Twisted 模块

python-scrapy安装及twisted问题

Python pip安装Scrapy，报错Twisted

Attempted reconnect 3 times. Giving up

Python error: (-215:Assertion failed) images.size() == times.total() in function ‘cv::CalibrateDebe

Connection to @localhost failed. [08001] Could not create connection to database server. Attempted reconnect 3 times. Giving up

Connection to @localhost failed. [08001] Could not create connection to database server. Attempted reconnect 3 times. Giving up.

Python3导入scrapy报错1 in C:\Users\ADMINI~1\AppData\Local\Temp\pip-install-831gxniz\Twisted\

笔记-scrapy与twisted

Scrapy之twisted模块

python_scrapy_twisted.web.error.SchemeNotSupported: Unsupported scheme: b''_及解决

python的twisted

twisted

scrapy 安装出错 centos6 requirement Twisted>=13.1.0以及Python3以上版本安装sqlite3的解决方案

转 windows下python3安装Twisted

python3 6 安装Twisted出错怎么办

Python Monitor Water Falls(1)Ideas from Scrapy and Twisted

今日推荐

周排行

成为C++高手之宏与枚举

在CAD二次开发中使用进度条

Js插件ECharts，HighCharts学习网址整理

Celery提交任务出错(on windows.)

cephfs内核客户端性能追踪

thinkphp中PHPExcel用法

EntityFramework动态组合多排序字段

汇编语言（八）实验9 根据材料编程

安装ubuntu后必须做的事情（对我而言）

JS函数式编程

每日归档

更多

2024-10-22(0)

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)