1,下载相关工具并安装
R下载地址:https://cloud.r-project.org/,选择对应的系统版本进行下载
R开发工具推荐使用RStudio,下载地址:https://www.rstudio.com/products/rstudio/download/
2,R访问Hive配置
加载连接信息:drv<-JDBC("org.apache.hive.jdbc.HiveDriver", list.files("HIVE_LIB_DIR",pattern="jar$", full.names=TRUE, recursive=TRUE))
建立连接:conn <- dbConnect(drv,"jdbc:hive2://hostname:10000/db_test;user=xxx;password=xxx")
查看表:dbListTables(conn)
其中:HIVE_LIB_DIR为R访问Hive时所依赖的jar文件路径名,jar文件参考目录如下:
commons-collections-3.2.2.jar commons-configuration-1.9.jar commons-lang-2.6.jar commons-logging-1.1.jar commons-logging-api-1.1.jar guava-11.0.2.jar hadoop-auth-2.6.0.jar hadoop-common-2.6.0.jar hive-cli-0.11.0.jar hive-common-1.1.0.jar hive-jdbc-1.1.0.jar hive-service-1.1.0.jar hive-shims-0.23-1.1.0.jar hive-shims-common-1.1.0.jar hive_metastore.jar hive_service.jar httpclient-4.2.5.jar httpcore-4.2.5.jar libfb303-0.9.0.jar libthrift-0.9.0.jar log4j-1.2.14.jar mysql-connector-java-5.1.38.jar ql.jar slf4j-api-1.5.11.jar slf4j-log4j12-1.5.11.jar TCLIServiceClient.jar zookeeper-3.4.6.jar