scrapy框架CrawlSpiders类

CrawlSpiders类

创建爬虫:

scrapy genspider -t crawl tencent tencent.com

from scrapy.linkextractors import LinkExtractor

link_list = LinkExtractor(allow=("start=\d+"))

link_list.extract_links(response)

猜你喜欢

转载自www.cnblogs.com/xuezhihao/p/11646142.html