flink 1.6.3在执行批任务的时候,如果operator的并行度不同,有些任务执行完,jobManager直接会kill掉未完成的任务,会抛以下异常
Caused by: org.apache.flink.runtime.io.network.netty.exception.RemoteTransportException: Lost connection to task manager 'zk03.td.com/172.26.125.25:34669'. This indicates that the remote task manager was lost.
at org.apache.flink.runtime.io.network.netty.CreditBasedPartitionRequestClientHandler.exceptionCaught(CreditBasedPartitionRequestClientHandler.java:160)
at org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeExceptionCaught(AbstractChannelHandlerContext.java:285)
at org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeExceptionCaught(AbstractChannelHandlerContext.java:264)
at org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.fireExceptionCaught(AbstractChannelHandlerContext.java:256)
at org.apache.flink.shaded.netty4.io.netty.channel.ChannelInboundHandlerAdapter.exceptionCaught(ChannelInboundHandlerAdapter.java:131)
at org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeExceptionCaught
这个问题已经有人在github上解决,但没有合并到官网可下载的版本中
https://issues.apache.org/jira/browse/FLINK-10941
自己可以整理代码,build 发版,build好的包(已经在我们的项目中使用)地址: flink-dist_2.11-1.6.3-bugfix.jar