修改 Hive Metastore 里记录的 InputFormat、OutputFormat - 代码天地

修改 Hive Metastore 里记录的 InputFormat、OutputFormat

其他 2018-12-31 11:31:03 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/Koprvhdix/article/details/79741845

解决方案写在前面：alter table xxxx set fileformat parquet

因为同事升级Spark时出的bug，误以为需要修改 Hive Metastore 的记录。然后历程比较坎坷，所以记录一下

Spark 1.6.2 创建分区表时，在 Hive Metastore 里记录的是

# Storage Information
InputFormat:            org.apache.hadoop.mapred.SequenceFileInputFormat
OutputFormat:           org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat

需要修改为 parquet 格式。找了老多blog，很多只记录了如何修改SerDe Library，没有说怎么修改InputFormat，最后从 Hive 的 jira(HIVE-6756) 里获得启发。应该修改 table 的 fileformat。再结合抛砖引玉的blog 。每个分区都应该修改 fileformat。

最后是两种方案：一种是用 2.2 把分区表全表重建一遍。还有一种是每个分区都去修改 fileformat。

路漫漫其修远兮，加油加油！

猜你喜欢

转载自blog.csdn.net/Koprvhdix/article/details/79741845

修改 Hive Metastore 里记录的 InputFormat、OutputFormat

hive metastore

HIVE --- Metastore

Hive: Metastore Configuration

保存hive的metastore

Hive Metastore原理及配置

Hive(17):metastore服务

Hive的metastore报错日志

1.6 Hive配置metastore

hive开启metastore服务

hive metastore 启动出错解决

Pig: using hive metastore in oozie

impala + hive metastore遇到的问题

hive的metastore与hiveserver2

Spark连接Hive的metastore异常

Hadoop的OutputFormat和InputFormat

Hive metastore三种配置方式

Hive metastore三种存储方式

hadoop hive中metastore报错的解决

hive metastore 基础表简绍

hive metastore 报错 binlog mode 不对问题

hive安装--设置mysql为远端metastore

Hive Metastore 创建数据库失败

hive启动MetaStore报错解决方案

Hive为什么要启用Metastore？

Hive metastore（元数据）配置到 MySql

hive --metastore三种模式

Hive MetaStore 在快手遇到的挑战与优化

hive 架构及 metastore 功能简单介绍

Hive安装配置指北（含Hive Metastore详解）

今日推荐

周排行

深度学习------Lingvo框架下的加速通道GPipe

webjars管理静态资源

C专家编程_2.2

mysql 源码安装

json文件操作

123231432

注解的实现

Spring MVC 控制器

《人月神话》读后感二

C#使用HttpWebRequest和HttpWebResponse上传文件示例

每日归档

更多

2024-09-08(0)

2024-09-07(0)

2024-09-06(0)

2024-09-05(0)

2024-09-04(0)

2024-09-03(0)

2024-09-02(0)

2024-09-01(0)

2024-08-31(0)

2024-08-30(0)