Spark报错：The pivot column feature has more than 10000 distinct values - 代码天地

Spark报错：The pivot column feature has more than 10000 distinct values

业界资讯 2023-06-12 05:46:26 阅读次数: 0

（作者：陈玓玏 data-master)

用pyspark做窄表转宽表的时候，出现报错：

pyspark.sql.utils.AnalysisException: 
u'The pivot column feature has more than 10000 distinct values, 
this could indicate an error. 
If this was intended, 
set spark.sql.pivotMaxValues to at least 
the number of distinct values of the pivot column.;'

在这里插入图片描述

好可怕，看字面意思，我的窄表里出现了超过1W个item，要知道这个是造的因子，也就是说有超过1W个因子？

虽然很可怕，但既然碰到问题，就要解决问题，解决问题的方法，报错里已经给出了，就是设置一下spark.sql.pivotMaxValues这个参数。

然后我就直接在hive里查了一下，我总共有多少个item，查出是2W6，于是把参数spark.sql.pivotMaxValues设置成30000，报错消失。

猜你喜欢

转载自blog.csdn.net/weixin_39750084/article/details/107618042

Spark报错：The pivot column feature has more than 10000 distinct values

ValueError: need more than 0 values to unpack

Oracle中查询时候报错：A column name in the order-by list matches more than one select list column解决方法

res\values\attrs.xml: Error: Found item Attr/text more than one time 编译报错解决方案

SemanticException Column xx Found in more than One Tables/Subqueries

Distinct Values

matlab报错，Assignment has more non-singleton rhs dimensions than non-singleton subscripts. 的解决办法

adb报错：more than one device/emulator

mgr 加入第二个节点报错-[ERROR] [MY-011526] [Repl] Plugin group_replication reported: 'This member has more executed transactions than those present in the grou

【Elasticsearch】Elasticsearch 报错 Values less than -1 bytes are not support

HDU Distinct Values

Distinct Values HDU - 6301

HDU 6301 Distinct Values

POJ 6301 Distinct Values

Distinct Values(贪心)

Another Distinct Values

[转]Mysql报错：Result consisted of more than one row

angularJs报错Warning: Tired to load angular more than once

Centos 报错 Repository extras is listed more than once in the configuration

SqlServer批量新增异常：Prepared or callable statement has more than 2000 parameter markers

Need more values to unpack

there is more than onebean ofxxx

More Than Python

DNNLinearConbinedRegressor的feature_column

HDU 6301 Distinct Values （set）

HDU6301 Distinct Values

Distinct Values（优先队列模拟）

HDU-6301 Distinct Values

HDU多校（Distinct Values）

牛客Another Distinct Values

今日推荐

周排行

四大线程池详解

如何高效使用Vim

Mogodb的常用操作总结

Spyder默认页面布局调整

SAR日志分析

OAuth是一个关于授权（authorization）的开放网络标准，在全世界得到广泛应用，目前的版本是2.0版。本文对OAuth 2.0的设计思路和运行流程，做一个简明通俗的解释，主要参考材料为R

WebService中注解开发，CXF，Spring整合，Rest风格

2019考研英语一 Text1分析

windows下安装docker详细步骤

CentOS 7/6系统升级内核版本到5.2.2

每日归档

更多

2024-08-05(0)

2024-08-04(0)

2024-08-03(0)

2024-08-02(0)

2024-08-01(0)

2024-07-31(0)

2024-07-30(0)

2024-07-29(0)

2024-07-28(0)

2024-07-27(0)