python scraping webs - python取得NIPS oral paper列表 - 代码天地

python scraping webs - python取得NIPS oral paper列表

其他 2019-12-13 14:34:11 阅读次数: 0

 1 from lxml import html
 2 import requests
 3 
 4 # using xpath
 5 
 6 # page = requests.get('http://econpy.pythonanywhere.com/ex/001.html')
 7 page = requests.get('https://nips.cc/Conferences/2019/Schedule')
 8 tree = html.fromstring(page.content)
 9 
10 #This will create a list of buyers:
11 # buyers = tree.xpath('//div[@title="buyer-name"]/text()')
12 # test = tree.xpath('//*[@id="maincard_15788"]/div[3]')
13 # print(test)
14 
15 
16 
17 doc = tree
18 # btags = doc.xpath("//*[@class[starts-with(., 'maincard narrower Oral') and string-length() > 3]]")
19 btags = doc.xpath("//*[@class[starts-with(., 'maincard narrower Spotlight') and string-length() > 3]]")
20 idx = 1
21 with open('nips_paperlist_spotlight.txt', 'w') as f:
22     for b in btags:
23         type = b.xpath("div[1]")[0].text
24         title = b.xpath("div[3]")[0].text
25         author = b.xpath("div[5]")[0].text
26         out_str = "%d, %s, %s, %s\n"%(idx, type,  title, author)
27         print(out_str)
28         f.writelines(out_str)
29         # print(idx)
30         # print(type)
31         # print(title)
32         # print(author)
33         idx += 1

使用XPath

lxml, requests

https://docs.python-guide.org/scenarios/scrape/

https://stackoverflow.com/questions/12393858/xpath-using-contains-with-a-wildcard

猜你喜欢

转载自www.cnblogs.com/imoon22/p/12034855.html

python scraping webs - python取得NIPS oral paper列表

python web scraping

"Web Scraping with Python"笔记（一）

Web Scraping HTML Tables with Python

Website Scraping with Python 阅读笔记

OReilly.Web.Scraping.with.Python.2015.6

《Web Scraping with Python》PDF高清完整版-PDF下载

Web Scraping using Python Scrapy_BS4 - Software

Web Scraping using Python Scrapy_BS4 - Introduction

Web Scraping using Python Scrapy_BS4 - using Scrapy and Python(2)

Web Scraping using Python Scrapy_BS4 - using Scrapy and Python(1)

《OReilly.Web.Scraping.with.Python.Collecting.Data.from.the.Modern.Web》pdf

python - 神器系列之爬虫神器scraper api/proxy api for web scraping

Python Scraping：通过 3 个简单步骤创建浏览器类

网络爬虫基础教程 Web scraping using Beautiful soup in Python: An introduction

NIPS 2018 paper list（论文列表）

CVPR2019 Oral论文《Side Window Filtering》解读及算法 Python 实现

【NIPS2018】Spotlight及Oral论文汇总

EXCEL TIPS From Webs

good_webs

开源|2017 CVPR（Oral Paper）多目标实时体态估测项目开源

axis接收发布webS

python自动从arxiv下载paper

Oral English

scraping-day1

scraping-day0

python列表

Python—列表

python 列表

python——列表

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

更多

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)