Hive 配置单元中集合中元素的平均计数？_Hive_Aggregate Functions_Explode_Hiveql_Apache Hive - Fatal编程技术网

Hive 配置单元中集合中元素的平均计数？

hive

Hive 配置单元中集合中元素的平均计数？,hive,aggregate-functions,explode,hiveql,apache-hive,Hive,Aggregate Functions,Explode,Hiveql,Apache Hive,我有两列id和段。段是一组以逗号分隔的字符串。我需要找到所有表中的平均段数。一种方法是使用两个单独的查询- A - select count(*) from table_name; B - select count(*) from table_name LATERAL VIEW explode(split(segment, ',') lTable AS singleSegment where segment != "" avg = B/A 在上述情况下，答案为8/4=2 有没有更好的方法来实现

我有两列id和段。段是一组以逗号分隔的字符串。我需要找到所有表中的平均段数。一种方法是使用两个单独的查询-

A - select count(*) from table_name;
B - select count(*) from table_name LATERAL VIEW explode(split(segment, ',') lTable AS singleSegment where segment != ""
avg = B/A

在上述情况下，答案为8/4=2

有没有更好的方法来实现这一点？

试试：

select sum(CASE segment 
           WHEN '' THEN 0 
           ELSE  size(split(segment,','))
           END
           )*1.0/count(*) from table_name;

如果您的id字段是唯一的，并且您希望向段部分添加一个筛选器，或防止其他格式错误的

段值，如a、b、
和a、b
，您可以执行以下操作：
SELECT SUM(seg_size)*1.0/count(*) FROM (
    SELECT count(*) as seg_size from table_name
    LATERAL VIEW explode(split(segment, ',')) lTable AS singleSegment
    WHERE trim(singleSegment) != ""
    GROUP BY id
) sizes

然后您可以将其他内容添加到where子句中
但是此查询需要运行两个配置单元作业，而较简单的查询需要运行一个配置单元作业，并且要求id字段唯一。
上述查询的较长版本运行良好。谢谢是的，我删除了错误的第一个查询，所以你真的是指较短的查询，现在：）@BlitzKrieg较长的查询速度会较慢，但它确实提供了更多的灵活性。




[ms word]相关文章推荐



                                                        
Ms word Word模板是否有独立于平台的基于Web的替代品？
ms-word 
Ms word 在Microsoft Word 2010文档中创建可编辑区域
ms-wordms-office 
Ms word 如何使用OpenXML从段落中查找页码？
ms-word 
Ms word MS Word：更改现有标题名称
ms-word 
Ms word 防止垂直合并的单元格跨页面中断
ms-word 
Ms word 超链接反斜杠自动转换为斜杠
ms-word 
Ms word 将word转换为pdf:某些超链接不起作用
ms-word 
Ms word 我们可以在所有平台上向Word文档添加跟踪像素吗？
ms-wordlogic 
Ms word 将具有邮件合并逻辑的WordPerfect文件转换为.docx
ms-word 
Ms word “中的Word邮件合并错误”；“按类别分类”；
ms-word 
Ms word 如何使用Office JS启用/禁用Ms Word命令按钮？
ms-wordms-officeoffice-js 
                                       





随机文章推荐



                                                        
如何从Mule 3入站文件端点传递java.io.File
mule 
mule 3，如何将响应从出站端点发送到另一个端点
mule 
Mule独立服务器实例堆问题
mule 
Mule 通过java代码设置全局属性
mule 
使用mule salescloud connector在salesforce中创建多个记录
mule 
Mule 如何启动MMC代理？
mule 
Mule 是否将贴图对象列表转换为固定宽度的文件？
mule 
mule数据库迭代数据和映射
mule 
Mule重用请求应答（相同的JMS队列）
mule 
Mule http侦听器和带有域的log4j2配置
mule 
Mule中DataWeave的资源属性不接受流变量
mule 
Mule动态属性文件引用
mule 
Mule:将连接器引用提取到属性文件
mule 
在Mulesoft中使用默认值转换XML
mule 
如何通过Mule AS400连接器执行AS400命令？如何检查响应？
muleibm-midrange 
Mule 将映射转换为同一映射的值列表
mule 
Mule 3到Mule 4错误处理查询错误类型和http状态
mule 
带有标准War文件部署的Mule 4.x示例
mule 
在mule 4的日志中获取隐式注入错误
mule 
Mule 4:批处理：批处理步骤中的接受表达式出错
mule


                                        

                                        
                                        


                                                
                                                        [hive]相关推荐
                                                        
Hive 使用Apache配置单元对日志数据进行会话的更好方法？
									Hive
							 
Hive 在线程中运行配置单元-0.9.0异常时出错；“主要”；java.lang.NoSuchFieldError:类型
									Hive
							 
Hive 配置单元ddl脚本的标准文件扩展名？
									Hive
							 
Hive 如何使用配置单元向HBase插入实时查询数据
									Hive
							 									Hbase
							 
Hive 如何在将配置单元数据移动到DynamoDB时设置默认列值
									Hive
							 									Amazon Dynamodb
							 
Hive 向分区列添加注释
									Hive
							 
Hive 蜂巢中描述和描述的区别
									Hive
							 
Hive AWS-EMR中ETL的自动配置单元或级联
									Hive
							 
Hive 在配置单元数据输入中加载子字符串
									Hive
							 
Hive 使用配置单元更改数据捕获
									Hive
							 
Hive 计算蜂巢中的成对距离
									Hive
							 
Main类[org.apache.oozie.action.hadoop.HiveMain]，退出代码[40000]
									Hive
							 
Hive 配置单元中的数据集大小是多少
									Hive
							 
Hive 在配置单元中将字符串转换为日期/时间戳
									Hive
							 
Hive 如何更改配置单元表中列名的长度？
									Hive
							 
在HiveQL中读取HDFS扩展属性
									Hive
							 
将Null替换为Pig/Hive中同一列中先前已知的行值
									Hive
							 									Apache Pig
							 
如何在hive中将12小时时间戳转换为24小时时间戳？
									Hive
							 
Hive 使用Lastmodified的SQOOP增量导入
									Hive
							 
Hive 为什么配置单元中的窗口函数不支持按日期排序？
									Hive
							 
Hive 在SQL中重复组中的值
									Hive
							 
Hive SQOOP——SQL Server中的模式查询
									Hive
							 
Hive 如何从配置单元表中删除字符串列中的重复项
									Hive
							 
Hive 配置单元SQL正在提供无关的输入''；应为'；）'；错误
									Hive
							 
Hive 如何为配置单元查询生成一个随机数（仅生成一次）？
									Hive
							 
Hive “与”的区别是什么；地点“；及；路径“；蜂箱中的桌子
									Hive
							 
Hive Apache配置单元：重命名数组类型的列<；结构<&燃气轮机&燃气轮机；
									Hive
							 
Hive 配置单元SQL-条件计数
									Hive
							 
Hive 选择表示分组依据的最大（日期）的字符串列？[蜂箱]
									Hive
							 
Hive 使用Sqoop摄取的表的配置单元元存储中的行计数为零
									Hive
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Vaadin
Pdf
Enums
Google Compute Engine
Configuration
Knockout.js
Sublimetext2
Corda
Directx
Io
Entity Framework Core
Struts2
Cryptography
Calendar
Java Me
Scikit Learn
Dependencies
Gnuplot
Youtube
Asp Classic
Azure Service Fabric
Qt4
Dialogflow Es
Gruntjs
Zend Framework
Extjs4
Flutter
Random
Weblogic
Colors
Ibm Midrange
Xpages
Docusignapi
Arangodb
Ada
Video Streaming
Sencha Touch 2
Lua
Https
Mips
Polymer
Python 3.x
Scroll
Google App Maker
Xamarin
Visual C++
Firebase
Cluster Computing
Struct
Cmake
Lotus Notes
Inheritance
Debian
Dictionary
Windows Mobile
Amazon Redshift
Oracle10g
Jmeter
Vagrant
Continuous Integration
Drupal 6
Inno Setup
Flask
Glsl
Parsing
Machine Learning
Adobe
Camera
Terraform
Serialization
Scripting
Primefaces
Amp Html
Design Patterns
Xcode
Yocto
Web Crawler
Llvm
Air
Julia
Robotframework
Pandas
Ssas
Sorting
Xmpp
Artifactory
Pycharm
Zend Framework2
Cmd
Operating System
Akka
Azure Functions
Grails
Erlang
Common Lisp
Timer
Dns
Windows 10
Navigation
Version Control
Omnet++
Batch File
.htaccess
Mapbox
Netty
Coldfusion
Nsis
Jquery
Algorithm
Ruby On Rails 3
Itext
Build
Symfony1
Fullcalendar
Visual Studio 2017
Spring Integration
Backbone.js
Asp.net
Windows Services
Matplotlib
Gcc
Odata
Visual Studio 2012
Internationalization
.net 4.0
Cloud
Data Structures
Google Cloud Storage
Coding Style
Windows
File Io
Swing
Vb6
Localization
Svn
Entity Framework 4
Encoding
Ios7
Coffeescript
Ms Word
Botframework
Rabbitmq
Ffmpeg
Css
Join
Certificate
Webview
Phpstorm
Azure Cosmosdb
Visual Studio 2010
Codeigniter
Hash
User Interface
Blockchain
Highcharts
Next.js
Animation
.net Core
F#
Git
Fortran
Vector
Websphere
Map
Linkedin
Dynamic
Csv
Error Handling
Sequelize.js
Jboss
Jersey
Discord.js
Openerp
Sails.js
Joomla
Sitecore
Ibm Mq
Eclipse Plugin
Command Line
For Loop
Doxygen
Interface
C++
Sml
Node.js
Postman
Nhibernate
Django
Cron
Virtualbox
Monitoring
Sql
File
Drupal
Silverstripe
System Verilog
Recursion
Outlook
Scala
Xamarin.android
Wso2


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网