Apache spark 统计spark数据帧中所有列（300列）的每个不同值的出现次数_Apache Spark_Apache Spark Sql - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/apache-spark/6.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Apache spark 统计spark数据帧中所有列（300列）的每个不同值的出现次数_Apache Spark_Apache Spark Sql - Fatal编程技术网

Apache spark 统计spark数据帧中所有列（300列）的每个不同值的出现次数

apache-spark

Apache spark 统计spark数据帧中所有列（300列）的每个不同值的出现次数,apache-spark,apache-spark-sql,Apache Spark,Apache Spark Sql,我有一个spark数据框，有300列，每列有10个不同的值。我需要计算所有300列的不同值的计数 -------------------------------------------------------- col1 | col2 | col3 ............col299 | col 300 ------------------------------------------------------- value11 | value21

我有一个spark数据框，有300列，每列有10个不同的值。我需要计算所有300列的不同值的计数

  --------------------------------------------------------
     col1    |  col2    | col3 ............col299   | col 300
  -------------------------------------------------------
  value11    | value21  | value31       | value300  | value 301
  value12    | value22  | value32       | value300  | value 301
  value11    | value22  | value33       | value301  | value 302
  value12    | value21  | value33       | value301  | value 302

如果是单列，我使用下面的代码计算

import org.apache.spark.sql.functions.count
df.groupBy("col1").agg(count("col1")).show

但是如何有效地计算300列。请帮忙

您可以很容易地按照下面提到的方法进行操作

首先收集所有列名和转换作为键值。如下

val exprs=df.columns.map（（->“近似计数”\u不同”））.toMap

然后simple
df.groupBy（“col1”）.agg（exprs）
将为您提供所有列的不同值

参考：
您可以按照下面提到的方法轻松完成
首先收集所有列名和转换作为键值。如下
val exprs=df.columns.map（（->“近似计数”\u不同”））.toMap
然后simple
df.groupBy（“col1”）.agg（exprs）
将为您提供所有列的不同值

<>强>参考< /强>：
如果你可以用近似的不同计数考虑使用有效的<代码>近似xOntTyx区别的< /代码>如果你可以用近似的不同计数来考虑，考虑使用有效的<代码>大约

[cobol]相关文章推荐

COBOL如何对两个无序文件进行排序和合并？ cobol

Cobol打印屏幕到文件 cobol

Cobol 需要看看我是否可以使用read语句来完成3种类型的输出吗？ cobol

重写cobol时出错 cobol

Cobol 将字母转换为也包含数字的字符串中的数字 cobol

COBOL进程FSPCCURR在PT8.55.07环境中失败 cobol

Cobol将表单提要字符写入文件 cobol

大型机CEE3DD异常终止-CEE3501S-在COBOL动态调用中未找到模块 cobol

PeopleSoft Cobol指令 cobol

COBOL错误：组项不能有PICTURE子句 cobol

随机文章推荐

如何将Resharper（R）中的默认访问修饰符更改为内部 resharper

可以将Specflow与Resharper一起使用吗？ resharper

使用ReSharper模板自动添加导入 resharper

ReSharper Shift+Alt+L（转到打开文件）在2015年无法使用.resx？ resharper visual-studio-2015

Resharper 将单个语句块自动格式化为一行，并在同一行上使用大括号 resharper visual-studio-2015

引用netstandard2.0库的netcoreapp2.0控制台应用程序的ReSharper intellisense resharper visual-studio-2017

如何配置resharper向C#区域块添加注释？ resharper

[apache spark]相关推荐

Tags

Fortran Aem Iframe Loopbackjs Parse Platform Webstorm Autodesk Forge Service Google Maps Api 3 Xcode Mono Css Dynamics Crm 2011 Kernel Command Line Sprite Kit Android Ndk Eclipse Asterisk Jms Https Plot Hyperledger Fabric Machine Learning Google Cloud Dataflow Routes Visual Studio 2010 Ios8 Tcp Kdb Compilation Xamarin Vbscript Karate Jsf Youtube Api Terminal Ibm Cloud Jquery Sdk Download Applescript Laravel 4 Excel Formula Python 2.7 Cmd Sails.js Webview Internet Explorer Winforms Joomla Sqlite Sencha Touch 2 Tableau Api Sencha Touch Memory Leaks C++11 Responsive Design Facebook Graph Api Mariadb Odoo Tinymce Tensorflow Drupal 6 Google Plus Prometheus Ruby On Rails 3 Mod Rewrite Embedded Programming Languages Entity Framework Core Jqgrid Xml Asp.net Mvc 4 Udp Ssis Email Stata Dll Sublimetext3 Clojure Vmware Db2 Ruby Resharper Sap Requirejs Streaming Blockchain Actionscript Linkedin Cocoa Amazon Redshift Youtube Leaflet Jquery Plugins Docusignapi Artificial Intelligence Google Api Android Layout C++ Cli Jsf 2 R Chart.js Stored Procedures Ibm Midrange Linker Memory Silverlight 4.0 Scripting Sql Sql Server 2005 Jestjs Ssas Build Go Qml Listview Npm Codenameone Firebase Io Process Rally Utf 8 Apache Camel Vhdl Stanford Nlp Methods Swing Ignite Numpy Log4net Ssh Google Apps Script Windows Runtime Properties Class React Native Magento Artifactory Amazon Dynamodb Mediawiki Path Mobile Spring Integration Perl Google App Engine Nlp Pine Script Gcc Breeze Computer Vision Cygwin Sequelize.js Windbg Visual Studio Ethereum Hive Javafx 2 Jdbc Python Robotframework Hash Stm32 Gitlab Google Chrome Log4j Login Shiny Mercurial Drupal Keras Ms Office Directory Twilio Configuration Ssl EmptyTag Internationalization Bots Silverstripe Editor Jekyll Clearcase Wolfram Mathematica Moodle Google Cloud Platform Vue.js Cmake Openlayers Indexing Unicode Checkbox Vb6 Computer Science Yii2 Orchardcms Yii Ios4 Drupal 7

Copyright © 2024. All Rights Reserved by - Fatal编程技术网