Python时间序列LSTM预测系列学习笔记（1）-单变量

本文是对：

https://machinelearningmastery.com/time-series-forecasting-long-short-term-memory-network-python/

https://blog.csdn.net/iyangdi/article/details/77853055

两篇博文的学习笔记，两个博主笔风都很浪，有些细节一笔带过，本人以谦逊的态度进行了学习和整理，笔记内容都在代码的注释中。有不清楚的可以去原博主稳重查看。

数据集下载：https://datamarket.com/data/set/22r0/sales-of-shampoo-over-a-three-year-period

from pandas import read_csv
from pandas import datetime
from sklearn.metrics import mean_squared_error
from math import sqrt
from matplotlib import pyplot
"""https://blog.csdn.net/iyangdi/article/details/77853055
    这个算法并没有什么用，只是求了两个数组之间的标准差
    我们强行粗犷的把test数组中第n-1个元素变成了预测数组predictions中的第n个元素，作为测试结果来使用
    最后计算了了test数组和predictions数组之间的标准差，当做预测效果的评估
    所以第一节毫无意义"""

# 加载数据
def parser(x):
    return datetime.strptime(x, '%Y/%m/%d')


series = read_csv('data_set/shampoo-sales.csv', header=0, parse_dates=[0], index_col=0, squeeze=True,
                  date_parser=parser)

# 分成训练和测试集合，前24列给训练集，后12行给测试集
X = series.values
train, test = X[0:-12], X[-12:]


#把数组train赋值给一个history列表
history = [x for x in train]
#创建一个predictions列表，这个列表记录了观测值，创建一个predictions数组中第n个元素，对应test数组中第n-1个元素
predictions = list()
for i in range(len(test)):
    predictions.append(history[-1])  # history[-1],就是执行预测，这里我们只是假设predictions数组就是我们预测的结果
    history.append(test[i])  # 将新的测试数据加入模型

# 预测效果评估
rmse = sqrt(mean_squared_error(test, predictions))#返回的结果是测试数组test，和观测数组predictions的标准差，https://www.cnblogs.com/nolonely/p/7009001.html
print('RMSE:%.3f' % rmse)

# 画出预测+观测值
pyplot.plot(test)#测试数组
pyplot.plot(predictions)#观测数组
pyplot.show()

Python时间序列LSTM预测系列学习笔记（1）-单变量

猜你喜欢