【图像识别】初探数字识别-识别水表示数

读取图片

用openCV来读取图片灰度值，matplotlib来显示图片。

import matplotlib.pyplot as plt 
import matplotlib.image as mpimg # mpimg 用于读取图片
import numpy as np
import seaborn as sns
import cv2

im = mpimg.imread("./2018shuibiao.jpeg")  
im_gray = cv2.cvtColor(im, cv2.COLOR_BGR2GRAY)   #转换了灰度化
plt.figure('picture')
plt.imshow(im_gray, cmap = 'gray', interpolation = 'bicubic')
plt.xticks([]), plt.yticks([])  # to hide tick values on X and Y axis
plt.show()

水表305*400
图片来自某歌。

灰度直方图

plt.figure('displot')
sns.distplot(im_gray.flatten(),kde = False, color='b')
plt.show()

灰度分布图

Canny边缘检测

gaus = cv2.GaussianBlur(gray,(3,3),0)

#主要调整这个值来决定Canny检测的精密度
low_thre = 110 
#阈值上界定为1.2倍
edges = cv2.Canny(gaus, low_thre, low_thre*1.2, apertureSize=3)

plt.figure('Edge')
plt.imshow(edges, cmap = 'gray', interpolation = 'bicubic')
plt.xticks([]), plt.yticks([])  # to hide tick values on X and Y axis
plt.show()

当阈值较小时，边缘检测的更细致。
threhold较小

当阈值增大后，效果如下。
阈值较大

为了下一步检测到方框，所以选择较大的阈值，避免其他边缘的干扰。

HoughLinesP识别方框

approxPolyDP函数，通常可以识别矩形，但没有研究过。水表表盘的图案较复杂，比一般场景下识别矩形要困难一些。模式识别之线条矩形识别—长方形画布或纸张并提取图像内容


minLineLength = 35
# 同一方向上两条线段判定为一条线段的最大允许间隔（断裂），超过了设定值，则把两条线段当成一条线段，值越大，允许线段上的断裂越大，越有可能检出潜在的直线段
maxLineGap = 10
# 超过设定阈值才被检测出线段，值越大，基本上意味着检出的线段越长，检出的线段个数越少。
threshold = 55

lines = cv2.HoughLinesP(edges, 2.0, np.pi / 180, threshold, maxLineGap=2)
for line in lines:
    for x1, y1, x2, y2 in line:
        cv2.line(img, (x1, y1), (x2, y2), (0, 255, 0), 2)
 
#cv2.imshow("houghline",img)
cv2.imwrite('shuibiao_test.jpg',img)
cv2.waitKey()
cv2.destroyAllWindows()

如果阈值threshold比较小，会识别到更多的短线段，图片如下。并不是我们希望的结果
这里写图片描述

这里写图片描述
方框的横线都检测到了，但是竖线没有被检测到，是因为竖线比较短，如果降低阈值会将别的短线一起引入，而我们希望得到少而长的线。不过，有这个结果之后，也可以通过统计各个线段的端点坐标，求取矩形的四个角坐标。统计各个直线左端点最小值和右端点最大值。

图片截取与分割

获取方框四个点的坐标后。

x1 = 97
x2 = 133
y1 = 114
y2 = 236
rectangle = img[x1:x2, y1:y2]
cv2.imwrite('shuibiao_rectangle.jpg',rectangle)

这里写图片描述