scrapy框架异常之no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates) - 代码天地

scrapy框架异常之no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates)

其他 2019-01-21 09:01:06 阅读次数: 0

今天在用scrapy爬虫时，报了下面的错误：

2019-01-17 16:47:18 [scrapy.dupefilters] DEBUG: Filtered duplicate request: <GET https://newhouse.fang.com/house/s/b95/> - no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates)
2019-01-17 16:47:18 [scrapy.core.engine] INFO: Closing spider (finished)
2019-01-17 16:47:18 [scrapy.statscollectors] INFO: Dumping Scrapy stats:
{'downloader/request_bytes': 2653,
 'downloader/request_count': 7,
 'downloader/request_method_count/GET': 7,
 'downloader/response_bytes': 220568,
 'downloader/response_count': 7,
 'downloader/response_status_count/200': 7,
 'dupefilter/filtered': 1,
 'finish_reason': 'finished',
 'finish_time': datetime.datetime(2019, 1, 17, 8, 47, 18, 37428),
 'log_count/DEBUG': 9,
 'log_count/INFO': 7,
 'request_depth_max': 7,
 'response_received_count': 7,
 'scheduler/dequeued': 7,
 'scheduler/dequeued/memory': 7,
 'scheduler/enqueued': 7,
 'scheduler/enqueued/memory': 7,
 'start_time': datetime.datetime(2019, 1, 17, 8, 47, 5, 279308)}
2019-01-17 16:47:18 [scrapy.core.engine] INFO: Spider closed (finished)

原因:在爬虫出现了重复的链接,重复的请求,出现这个DEBUG或者是yield scrapy.Request(xxxurl,callback=self.xxxx)中有重复的请求其实scrapy自身是默认有过滤重复请求的让这个DEBUG不出现,可以有 dont_filter=True,在Request中添加可以解决

yield scrapy.Request(xxxurl,callback=self.xxxx,dont_filter=True)

猜你喜欢

转载自blog.csdn.net/qq_40176258/article/details/86527568

scrapy框架异常之no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates)

Find All Duplicates in an Array

leetcode:Find All Duplicates in an Array

【CODE】Find All Duplicates in an Array

【LeetCode】442. Find All Duplicates in an Array

442. Find All Duplicates in an Array

leetcode——442Find All Duplicates in an Array

[leetcode]442. Find All Duplicates in an Array

Leetcode 442. Find All Duplicates in an Array

LeetCode #442 - Find All Duplicates in an Array

leetcode-442.Find All Duplicates in an Array

LeetCode-Find All Duplicates in an Array

LeetCode系列(七)-Find All Duplicates in an Array

LeetCode442: Find All Duplicates in an Array

Leetcode 442 Find All Duplicates in an Array

LeetCode442. Find All Duplicates in an Array

[leetcode] 442. Find All Duplicates in an Array

LeetCode 442 Find All Duplicates in an Array (思维)

1047--Remove All Adjacent Duplicates In String

[LC] 442. Find All Duplicates in an Array

leetcode array|442. Find All Duplicates in an Array

【LeetCode】442. Find All Duplicates in an Array【M】【60】

python leetcode 442. Find All Duplicates in an Array

【LeetCode】442. Find All Duplicates in an Array（C++）

[LeetCode] 442. Find All Duplicates in an Array (C++)

LeetCode刷题：442. Find All Duplicates in an Array

lc1047. Remove All Adjacent Duplicates In String

LeetCode 1047. Remove All Adjacent Duplicates In String

1047. Remove All Adjacent Duplicates In String - Easy

【Golang】LeetCode442Find All Duplicates in an Array

今日推荐

周排行

成为C++高手之宏与枚举

在CAD二次开发中使用进度条

Js插件ECharts，HighCharts学习网址整理

Celery提交任务出错(on windows.)

cephfs内核客户端性能追踪

thinkphp中PHPExcel用法

EntityFramework动态组合多排序字段

汇编语言（八）实验9 根据材料编程

安装ubuntu后必须做的事情（对我而言）

JS函数式编程

每日归档

更多

2024-10-22(0)

2024-10-21(0)

2024-10-20(0)

2024-10-19(0)

2024-10-18(0)

2024-10-17(0)

2024-10-16(0)

2024-10-15(0)

2024-10-14(0)

2024-10-13(0)