Scrapy爬取全网小说到本地TXT，Python少年最爱的一个爬虫项目！

其他 2018-05-08 23:00:00 阅读次数: 4

scrapy，写了一个简单的python爬虫项目，功能是采集某小说网站的全部小说，保存到本地

送给刚刚学习scrapy的python朋友学习。

Scrapy爬取全网小说到本地TXT，Python少年最爱的一个爬虫项目！

部分Python代码：

# -*- coding: utf-8 -*-

# Define your item pipelines here

# Don't forget to add your pipeline to the ITEM_PIPELINES setting

# See: https://doc.scrapy.org/en/latest/topics/item-pipeline.html

import os

class BiqugePipeline(object):

def process_item(self, item, spider):

#return item

curPath = 'E:/小说/'

tempPath = str(item['name'])

targetPath = curPath+ tempPath

#print('-----')

#print(targetPath)

if not os.path.exists(targetPath):

os.makedirs(targetPath)

filename_path = targetPath+'/'+ str(item['chapter_name']) + '.txt'

print('------')

print(filename_path)

print(item['chapter_content'])

with open(filename_path, 'a', encoding='utf-8') as f:

f.writelines(item['chapter_content'])

return item

猜你喜欢

转载自blog.csdn.net/qq_41841569/article/details/80225544

Scrapy爬取全网小说到本地TXT，Python少年最爱的一个爬虫项目！

Python爬虫层层递进，从爬取一章小说到爬取全站小说

如何用python爬虫从爬取一章小说到爬取全站小说

python 爬取整本小说到本地文件

五分钟写一个小爬虫，爬取小说并写入txt文件

【Python爬虫】轻松几步将一个 scrapy项目变成 scrapy_redis 分布式爬取

一个爬虫从网页中爬取小说

Python笔记（五） --写一个爬虫对新笔趣阁的小说进行爬取

scrapy爬取小说(一）

Python爬虫入门实战系列（一）--爬取网络小说并存放至txt文件

一个简单的使用scrapy爬取小说的例

Scrapy 学习笔记 - 一个练手任务，爬取起点的全部小说名

python爬虫五：爬取小说，下载到本地

我的第一个python爬虫程序——爬取网络小说（含错误及源码）

Python爬虫——爬取小说

Python爬虫之Scrapy框架系列（14）——实战ZH小说爬取【多页爬取】

Python爬虫实战项目之小说信息爬取

python爬虫-利用scrapy框架完成天天书屋内容爬取，并保存本地txt

scrapy 爬取小说

scrapy爬取小说

python爬虫--一次爬取小说的尝试

python爬虫之类的方法爬取一部小说

【python实现网络爬虫（5）】第一个Scrapy爬虫实例项目（Scrapy原理及Scrapy爬取名言名句网站信息）

一个简单的爬取小说的python程序彻底搞懂Python的字符编码

python-scrapy爬取小说下载网小说

Python爬虫练习爬取网络小说保存到txt

Python爬虫实战，requests+openpyxl模块，爬取小说数据并保存txt文档（附源码）

爬虫：Scrapy爬取第一个网页实例解析

小说免费看！python爬虫框架scrapy 爬取纵横网

Python爬虫—爬取小说名著

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)