Python全国就业分布

Catalog

就业分布

Python代码(直接复制可用)

# 从网络读取数据
import requests, re, pandas as pd
url = 'https://blog.csdn.net/Yellow_python/article/details/81811778'
r = requests.get(url, headers={'User-Agent': 'Opera/8.0 (Windows NT 5.1; U; en)'})
data = re.findall('<pre><code>([\s\S]+?)</code></pre>', r.text)[0].strip()
df = pd.DataFrame([i.split(',') for i in data.split()],
                  columns=['province', 'city', 'longitude', 'latitude', 'demand', 'salary', 'total'])
df['longitude'] = pd.to_numeric(df['longitude'])
df['latitude'] = pd.to_numeric(df['latitude'])
df['demand'] = pd.to_numeric(df['demand'])
df['salary'] = pd.to_numeric(df['salary'])
df['total'] = pd.to_numeric(df['total'])
print(df.head(22))
# 可视化中文输出设置
from pylab import mpl
mpl.rcParams['font.sans-serif'] = ['SimHei']  # 指定默认字体
mpl.rcParams['axes.unicode_minus'] = False  # 解决保存图像是负号'-'显示为方块的问题
# 地图包
from mpl_toolkits.basemap import Basemap  # 需要下载basemap
m = Basemap(llcrnrlon=73.6, llcrnrlat=17.9, urcrnrlon=134.8, urcrnrlat=53.6)  # 经纬度
m.drawcountries(linewidth=0.5)  # 国界线
m.drawcoastlines(linewidth=0.4)  # 沿岸线
# 散点图(面积表示需求量、透明度表示薪资)
import matplotlib.pyplot as mp
for i in df.values:
    mp.scatter(i[2], i[3], s=i[4]**0.44*44, alpha=i[5]/df['salary'].max(), c='orange')
    mp.text(i[2], i[3], i[1], size=i[4]**0.5/5, color='black', ha='center', va='center')
mp.show()

城市top8

说明

数据说明

  • 数据获取方式:爬虫(点击可查看爬虫代码)
  • 来源网站:51job
  • 采集日期:2018年8月15日
  • 其它说明:数据有少量清洗

下载本篇数据的代码

import requests, re, pandas as pd
def download():
    url = 'https://blog.csdn.net/Yellow_python/article/details/81811778'
    header = {'User-Agent': 'Opera/8.0 (Windows NT 5.1; U; en)'}
    r = requests.get(url, headers=header).text
    data = re.findall('<pre><code>([\s\S]+?)</code></pre>', r)[0].strip()
    df = pd.DataFrame([i.split(',') for i in data.split()],
                      columns=['province', 'city', 'longitude', 'latitude', 'demand', 'salary', 'total'])
    df['longitude'] = pd.to_numeric(df['longitude'])
    df['latitude'] = pd.to_numeric(df['latitude'])
    df['demand'] = pd.to_numeric(df['demand'])
    df['salary'] = pd.to_numeric(df['salary'])
    df['total'] = pd.to_numeric(df['total'])
    # df.to_csv('python_job.csv', index=False)  # 保存为csv
    return df
print(download().head())

数据源

字段名称:省,市,经度,纬度,岗位需求量,平均薪资,总量(需求量*薪资)

上海市,上海市,121.4737,31.23037,8834,17746.217643951375,156770086.66666645
北京市,北京市,116.40717,39.90469,6895,17359.462412376124,119693493.33333339
广东省,深圳市,114.05956,22.54286,4131,17830.382272250466,73657309.16666667
广东省,广州市,113.26436,23.12908,3146,14605.727378681919,45949618.33333331
浙江省,杭州市,120.15515,30.27415,2272,16811.56763497653,38195881.66666668
四川省,成都市,104.06476,30.5702,1398,13257.751549833096,18534336.666666668
江苏省,南京市,118.79647,32.05838,1306,13538.495405819296,17681275.0
湖北省,武汉市,114.30525,30.59276,1090,12180.56880733945,13276820.000000002
江苏省,苏州市,120.58319,31.29834,733,14013.785811732605,10272105.0
重庆市,重庆市,106.55073,29.56471,505,10797.056105610558,5452513.333333332
陕西省,西安市,108.93984,34.34127,417,12317.745803357317,5136500.000000001
湖南省,长沙市,112.93886,28.22778,456,11094.422514619884,5059056.666666667
福建省,厦门市,118.08948,24.47951,294,13753.897392290246,4043645.8333333326
安徽省,合肥市,117.22901,31.82057,335,11926.701492537311,3995444.9999999995
福建省,福州市,119.29647,26.07421,317,10957.518401682437,3473533.3333333326
天津市,天津市,117.19937,39.0851,280,11461.142857142855,3209119.9999999995
河南省,郑州市,113.62493,34.74725,301,10658.56589147287,3208228.333333334
广东省,珠海市,113.57668,22.27073,220,14309.015151515154,3147983.333333334
广东省,东莞市,113.75179,23.02067,153,18108.55119825708,2770608.333333333
江苏省,无锡市,120.31237,31.49099,234,11276.994301994302,2638816.6666666665
浙江省,宁波市,121.55027,29.87386,164,13187.113821138213,2162686.666666667
广东省,佛山市,113.12192,23.02185,135,14583.117283950614,1968720.833333333
海南省,东方市,108.65367,19.09614,148,13021.790540540538,1927224.9999999998
辽宁省,大连市,121.61476,38.91369,161,11700.517598343686,1883783.3333333333
辽宁省,沈阳市,123.4631,41.67718,175,9678.42857142857,1693725.0
山东省,济南市,117.12009,36.65184,145,8978.735632183907,1301916.6666666665
山东省,青岛市,120.38299,36.06623,118,10921.398305084746,1288725.0
云南省,昆明市,102.83322,24.87966,98,11542.517006802724,1131166.666666667
河北省,沙河市,114.50332,36.85516,73,14521.32420091324,1060056.6666666665
江苏省,昆山市,120.98181,31.38475,75,12517.222222222223,938791.6666666667
江西省,南昌市,115.85794,28.68202,105,8897.619047619048,934250.0
黑龙江省,五常市,127.16746,44.93190999999999,48,17418.75,836100.0
黑龙江省,哈尔滨市,126.5358,45.80216,87,8833.429118773947,768508.3333333334
吉林省,长春市,125.32357,43.81602,54,13887.808641975307,749941.6666666666
江苏省,常州市,119.97365,31.81072,64,10085.15625,645450.0
贵州省,贵阳市,106.63024,26.64702,71,8284.74178403756,588216.6666666667
海南省,海口市,110.19989,20.04422,50,11446.166666666664,572308.3333333333
河北省,石家庄市,114.5143,38.04275999999999,54,9572.06790123457,516891.66666666674
山东省,龙口市,120.47872,37.64345,34,14056.127450980393,477908.3333333334
河北省,安国市,115.32664,38.41845,41,11597.357723577235,475491.6666666666
安徽省,亳州市,115.77931,33.84461,16,27526.041666666664,440416.6666666666
广东省,中山市,113.3926,22.51595,36,10485.833333333334,377490.0
河南省,三门峡市,111.2003,34.77261,37,9930.405405405405,367425.0
河南省,焦作市,113.24201,35.21563,46,7684.782608695652,353500.0
广东省,惠州市,114.41679,23.11075,35,9897.285714285714,346405.0
福建省,南平市,118.12043,27.33175,21,16288.095238095235,342049.99999999994
四川省,泸州市,105.44257,28.8717,19,16676.315789473683,316850.0
广西壮族自治区,南宁市,108.3669,22.81673,31,8742.741935483871,271025.0
浙江省,嘉兴市,120.7555,30.74501,24,11275.694444444445,270616.6666666667
甘肃省,合作市,102.91012,35.00027,25,10169.5,254237.5
黑龙江省,海伦市,126.97338,47.4621,15,15366.666666666666,230500.0
浙江省,温州市,120.69939,27.99492,27,8125.0,219375.0
台湾省,桃园市,121.310511,24.99661,14,14898.809523809523,208583.3333333333
湖南省,津市市,111.87741,29.60543,21,9810.31746031746,206016.66666666666
山东省,烟台市,121.44801,37.46353,21,9524.285714285714,200010.0
山东省,泰安市,117.0884,36.19994000000001,16,11804.6875,188875.0
江苏省,徐州市,117.28577,34.2044,22,7538.863636363636,165855.0
江苏省,南通市,120.89371,31.97958,19,8722.807017543859,165733.3333333333
湖北省,黄石市,115.0389,30.19953,12,13152.083333333334,157825.0
吉林省,临江市,126.91798,41.81193,14,11102.38095238095,155433.3333333333
吉林省,龙井市,129.42641,42.76587,11,13518.181818181818,148700.0
江苏省,扬州市,119.41269,32.39358,15,9548.333333333334,143225.0
浙江省,台州市,121.42056,28.65611,14,10096.42857142857,141350.0
黑龙江省,双城市,126.31227,45.38355,9,14850.185185185184,133651.66666666666
甘肃省,兰州市,103.83417,36.06138,13,9087.179487179486,118133.33333333333
江西省,鹰潭市,117.06919,28.26019,8,14583.33333333333,116666.66666666664
安徽省,芜湖市,118.43313,31.35246,12,9566.666666666666,114800.0
海南省,三亚市,109.51209,18.25248,7,15695.238095238094,109866.66666666666
福建省,宁德市,119.54819,26.66571,8,13624.999999999998,108999.99999999999
福建省,永安市,117.36518,25.94138,9,11981.48148148148,107833.33333333331
辽宁省,丹东市,124.35601,39.9998,7,15071.428571428569,105499.99999999999
安徽省,马鞍山市,118.50611,31.67067,8,13070.833333333334,104566.66666666667
江苏省,泰兴市,120.052,32.17191,7,14571.42857142857,102000.0
浙江省,义乌市,120.07468,29.30558,9,11279.62962962963,101516.66666666667
浙江省,平湖市,121.01606,30.67585,6,16916.666666666668,101500.0
广东省,江门市,113.08161,22.57865,10,10011.666666666668,100116.66666666667
江苏省,盐城市,120.16164,33.34951,11,8865.151515151516,97516.66666666667
新疆维吾尔自治区,乌鲁木齐市,87.61688000000001,43.82663,8,12100.0,96800.0
山西省,太原市,112.55067,37.87059,13,7048.974358974359,91636.66666666667
浙江省,海宁市,120.68102,30.50938,5,18220.0,91100.0
辽宁省,凤城市,124.06605,40.45279,11,8246.969696969698,90716.66666666667
江苏省,张家港市,120.5555,31.87547,5,18000.0,90000.0
湖南省,湘潭市,112.94411,27.82975,6,13640.27777777778,81841.66666666667
香港,香港,114.16546,22.27534,3,25500.0,76500.0
河南省,洛阳市,112.45361,34.61812,7,10728.57142857143,75100.0
浙江省,金华市,119.64759,29.07812,10,7460.0,74600.0
湖北省,咸宁市,114.32245,29.84126,7,10633.33333333333,74433.33333333331
浙江省,绍兴市,120.5802,30.03033,7,10597.619047619048,74183.33333333333
福建省,泉州市,118.67587,24.87389,8,8318.75,66550.0
山东省,日照市,119.52719,35.41646,6,11041.666666666666,66250.0
陕西省,咸阳市,108.70929,34.32932,5,13010.0,65050.0
四川省,绵阳市,104.6796,31.46751,8,7741.666666666666,61933.33333333333
海南省,文昌市,110.79774,19.54329,6,10075.0,60450.0
湖南省,株洲市,113.13396,27.82767,8,7359.375,58875.0
广东省,汕头市,116.68221,23.3535,9,6491.666666666667,58425.0
湖北省,荆门市,112.19945,31.03546,5,11600.0,58000.0
河北省,唐山市,118.18058,39.63048,7,8235.714285714286,57650.0
广东省,揭阳市,116.37271,23.54972,4,14375.0,57500.0
吉林省,吉林市,126.54944,43.83784,4,14331.25,57325.0
江西省,瑞金市,116.02709,25.88562,3,18780.55555555556,56341.66666666667
湖北省,襄阳市,112.12255,32.009,9,6241.666666666667,56175.0
内蒙古自治区,呼和浩特市,111.75199,40.84149,6,9225.0,55350.0
辽宁省,朝阳市,120.4508,41.57347,6,8883.333333333334,53300.0
福建省,龙海市,117.81813,24.44658,3,17500.0,52500.0
江西省,吉安市,114.99376,27.11382,4,13000.0,52000.0
江西省,九江市,116.00146,29.70548,4,12875.0,51500.0
辽宁省,海城市,122.68463,40.88145,5,10295.0,51475.0
广东省,清远市,113.05615,23.68201,3,15083.333333333334,45250.0
江苏省,镇江市,119.425,32.18959,4,10893.75,43575.0
浙江省,丽水市,119.92293,28.4672,2,20500.0,41000.0
黑龙江省,安达市,125.34379,46.41773,6,6733.333333333333,40400.0
浙江省,湖州市,120.08805,30.89305,5,7300.0,36500.0
宁夏回族自治区,银川市,106.23248,38.48644,4,9012.5,36050.0
浙江省,龙泉市,119.14168,28.07434,4,9006.25,36025.0
广东省,肇庆市,112.46528,23.0469,3,12000.0,36000.0
陕西省,安康市,109.02932,32.68486,2,17966.666666666664,35933.33333333333
甘肃省,白银市,104.13773,36.5447,3,11833.333333333334,35500.0
湖南省,岳阳市,113.12919,29.35728,4,8250.0,33000.0
山东省,威海市,122.12171,37.51348,4,7937.5,31750.0
安徽省,明光市,117.98944,32.77699,2,15000.0,30000.0
安徽省,淮南市,116.9998,32.62549,5,5950.0,29750.0
辽宁省,东港市,124.15209,39.86172,4,7256.25,29025.0
福建省,南安市,118.38627,24.9604,3,9500.0,28500.0
江苏省,常熟市,120.75224,31.65381,4,7125.0,28500.0
山东省,潍坊市,119.16176,36.70686,2,14125.0,28250.0
陕西省,延安市,109.48978,36.58529,3,9083.333333333334,27250.0
广东省,湛江市,110.35894,21.27134,3,8983.333333333334,26950.0
四川省,乐山市,103.76539,29.55221,3,8500.0,25500.0
山东省,淄博市,118.0548,36.8131,3,8166.666666666667,24500.0
河南省,南阳市,112.52851,32.99073,1,22500.0,22500.0
江苏省,靖江市,120.27454,32.01494,3,7500.0,22500.0
江西省,宜春市,114.41612,27.81443,2,10666.666666666664,21333.33333333333
安徽省,滁州市,118.31683,32.30181,2,10625.0,21250.0
广东省,梅州市,116.12264,24.28844,3,7083.333333333333,21250.0
河北省,承德市,117.9634,40.9515,2,10500.0,21000.0
黑龙江省,大庆市,125.10307,46.58758,3,6975.0,20925.0
山东省,德州市,116.35927,37.4355,3,6933.333333333333,20800.0
河北省,保定市,115.46459,38.87396,2,10250.0,20500.0
河北省,邯郸市,114.53918,36.62556,4,5062.5,20250.0
甘肃省,庆阳市,107.64292,35.70978,1,20000.0,20000.0
青海省,西宁市,101.77782,36.61729,2,10000.0,20000.0
安徽省,黄山市,118.33866,29.71517,2,9450.0,18900.0
广西壮族自治区,贵港市,109.59764,23.11306,2,9000.0,18000.0
湖北省,鄂州市,114.89495,30.39085,2,8975.0,17950.0
湖北省,荆州市,112.24069,30.33479,2,8825.0,17650.0
江苏省,泰州市,119.92554,32.45546,2,8800.0,17600.0
安徽省,宁国市,118.98336,30.63364,1,17500.0,17500.0
山西省,吕梁市,111.14165,37.51934,2,8133.333333333335,16266.66666666667
湖南省,常德市,111.69854,29.03158,3,5291.666666666667,15875.0
江西省,上饶市,117.94357,28.45463,2,7575.0,15150.0
江西省,乐平市,117.129,28.96173,2,7500.0,15000.0
安徽省,宣城市,118.75866,30.94078,2,7375.0,14750.0
湖南省,沅江市,112.35468,28.84402,2,7375.0,14750.0
浙江省,舟山市,122.20778,29.98539,2,7375.0,14750.0
浙江省,东阳市,120.24191,29.28946,1,13750.0,13750.0
河北省,衡水市,115.67054,37.73886,3,4500.0,13500.0
新疆维吾尔自治区,克拉玛依市,84.88927,45.57999,2,6750.0,13500.0
江苏省,淮安市,119.01595,33.61016,1,12500.0,12500.0
福建省,龙岩市,117.01722,25.07504,2,5162.5,10325.0
广西壮族自治区,柳州市,109.41552,24.32543,2,5000.0,10000.0
福建省,莆田市,119.00771,25.454,1,9700.0,9700.0
云南省,保山市,99.16181,25.11205,1,9500.0,9500.0
河北省,廊坊市,116.68376,39.53775,1,9000.0,9000.0
吉林省,延吉市,129.5091,42.89107,1,8950.0,8950.0
广东省,云浮市,112.04453,22.91525,2,4470.0,8940.0
江苏省,太仓市,121.12975,31.45911,1,8300.0,8300.0
广东省,河源市,114.70065,23.74365,1,7900.0,7900.0
山东省,济宁市,116.58724,35.41459,1,7000.0,7000.0
云南省,丽江市,100.2271,26.85648,1,7000.0,7000.0
福建省,三明市,117.63922,26.26385,1,6000.0,6000.0
湖南省,怀化市,110.0016,27.56974,1,5750.0,5750.0
吉林省,敦化市,128.23109,43.37278,1,5500.0,5500.0
福建省,福安市,119.64768,27.08797,1,4775.0,4775.0
福建省,漳州市,117.64725,24.51347,1,4500.0,4500.0
广东省,茂名市,110.92523,21.66329,1,4475.0,4475.0
河南省,鹤壁市,114.29745,35.747,1,3750.0,3750.0
河南省,安阳市,114.3931,36.09771,1,3750.0,3750.0

猜你喜欢

转载自blog.csdn.net/Yellow_python/article/details/81811778