Performance 配置单元分析查询占用大量时间_Performance_Hadoop_Hive_Query Tuning_Apache Tez - Fatal编程技术网

Performance 配置单元分析查询占用大量时间

performance hadoop hive

Performance 配置单元分析查询占用大量时间,performance,hadoop,hive,query-tuning,apache-tez,Performance,Hadoop,Hive,Query Tuning,Apache Tez,为了加快大型表上的ETL查询，我们在晚上对这些表和日期列运行了许多analyze查询。但是这些分析列查询占用大量内存和时间。我们正在使用tez。是否有任何方法可以像某些set命令一样优化analyze查询。如果使用插入覆盖加载表，则可以在插入覆盖查询期间通过设置hive.stats.autogather=true来自动收集统计信息如果表已分区并且分区正在以增量方式加载，则只能分析最后一个分区 ANALYZE TABLE [db_name.]tablename [PARTITION(par

为了加快大型表上的ETL查询，我们在晚上对这些表和日期列运行了许多

analyze

查询。但是这些

分析

列查询占用大量内存和时间。我们正在使用tez。

是否有任何方法可以像某些set命令一样优化

analyze

查询。

如果使用插入覆盖加载表，则可以在插入覆盖查询期间通过设置

hive.stats.autogather=true来自动收集统计信息
如果表已分区并且分区正在以增量方式加载，则只能分析最后一个分区
ANALYZE TABLE [db_name.]tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 

请参见此处的示例：
对于ORC文件，可以指定hive.stats.gather.num.threads
以增加并行性
请参阅此处统计设置的完整列表：
如果使用插入覆盖加载表，则可以在插入覆盖查询期间通过设置hive.stats.autogather=true来自动收集统计信息
如果表已分区并且分区正在以增量方式加载，则只能分析最后一个分区
ANALYZE TABLE [db_name.]tablename [PARTITION(partcol1[=val1], partcol2[=val2], ...)] 

请参见此处的示例：
对于ORC文件，可以指定hive.stats.gather.num.threads
以增加并行性
请参见此处统计设置的完整列表：




[hadoop]相关文章推荐



                                                        
在从SVN通过Hadoop替换mapred/hdfs/common jar构建之后；“没有要停止的名称节点”；
hadoop 
Hadoop 字符引用“&#1“；是无效的XML字符
hadoop 
在Hadoop集群上运行Mahout
hadoop 
Hadoop FileInputFormat isSplitable false
hadoop 
Hbase&x2B；Hadoop'；s MapReduce使用负值给出错误的和
hadoophbase 
HBase在Hadoop 2.2.0中不起作用
hadoophbase 
Hadoop 如何在MapReduce框架中编写JOB1和JOB2之间的顺序代码？
hadoopmapreduce 
Hadoop ApacheSqoop和Flume可以互换使用吗？
hadoop 
Hadoop 配置单元插入在配置单元CLI中工作正常，但在终端中失败
hadoophive 
Hadoop上的TPC-DS基准测试-为什么使用星型模式
hadoop 
Hadoop 无法通过JDBC更新配置单元表
hadoopjdbchive 
Hadoop 有些贴图器比其他贴图器需要更多的时间（和拆分）
hadoopmapreduceapache-pig 
Systemd Hdfs服务[hadoop]-启动
hadoop 
Hadoop 配置单元从十进制转换为字符串将截断该值
hadoophive 
向单节点hadoop群集添加外部ssd
hadoop 
什么'；为S3提供Hadoop/Spark IAM基于角色访问的正确方法是什么？
hadoopapache-sparkamazon-s3amazon-ec2 
如何使用log4j在hadoop中编写登录的用户特定日志
hadooplog4j 
Hadoop“；权限被拒绝（公钥、密码、键盘交互）"；警告
hadoopsshinstallation 
Hadoop 适用于宽桌的火花拼花地板行组尺寸
hadoopapache-spark 
Hadoop容器失败，甚至100%完成
hadoopmapreduce 
                                       





随机文章推荐



                                                        
Process 软件开发过程与应用
processproject-management 
Process AMQP延迟传递并防止重复消息
process 
Process 如何阻止精益编程变成牛仔编码？
process 
Process 远程过程调用的优点和缺点是什么
process 
Process 线程/进程/任务之间有什么区别？
process 
Process 查找最年长的子进程同级进程-内核-结构任务\u结构
processkernel 
Process Erlang进程事件错误
processerlang 
Process 处理器如何知道将进程切换为高优先级进程？
processlinux-kerneloperating-system 
Process VHDL中FSM内部的计数器
processsynchronizationvhdl 
Process Bugtracker-拒绝和自动化工作流
processworkflow 
Process 需要使用注册表vb.net为每个正在运行的进程创建一个单独的路径
process 
Process AutoIT我如何才能找到真正的进程名？
process 
Process 处理SSAS表格-一个分区和一个数据库
processssas 
Process Electron是否有一种标准的方法来杀死一个无用的渲染器进程？
processelectron


                                        

                                        
                                        


                                                
                                                        [performance]相关推荐
                                                        
Performance 使用OPC标签提高性能
									Performance
							 									Automation
							 
Performance 如何在NHibernate中处理大量映射文件
									Performance
							 									Nhibernate
							 
Performance 我可以在谷歌应用程序引擎上禁用GZIP吗？
									Performance
							 									Google App Engine
							 
Performance ext-all.js VS ext-all-debug.js
									Performance
							 									Extjs
							 
Performance 为什么Scala在计算机语言基准测试游戏中比竞争对手多消耗2-3倍的RAM？
									Performance
							 									Scala
							 
Performance Ejabberd性能问题
									Performance
							 									Networking
							 
Performance SQL Server存储过程会降低每次执行的速度
									Performance
							 									Sql Server 2008
							 									Stored Procedures
							 
Performance 为什么基于haskell枚举器的IO如此频繁地调用sigprocmask？
									Performance
							 									Haskell
							 
Performance 如果a和p是n位数字，则mod p的大O运行时间是多少？
									Performance
							 									Big O
							 
Performance 在hibernate webapp中使用ehcache的好处
									Performance
							 									Spring
							 									Hibernate
							 
Performance Webgl和three.js在chrome上运行良好，但在firefox上运行糟糕
									Performance
							 									Firefox
							 									Three.js
							 									Webgl
							 
Performance 在运行时分配无成员类型在编译时v.s.绑定
									Performance
							 									F#
							 
Performance 这被认为是'；双缓冲'；？
									Performance
							 
Performance 为2个或更多元素设置动画时SVG性能下降，RaphaelJS
									Performance
							 									Svg
							 
Performance 配置单元查询性能问题
									Performance
							 									Hadoop
							 									Concurrency
							 									Hive
							 
Performance mongoDB的速度越来越慢
									Performance
							 									Mongodb
							 
Performance 朱莉娅：简单动力系统的优化模拟
									Performance
							 									Optimization
							 									Julia
							 
Performance 将向量组合成二维矩阵
									Performance
							 									Matlab
							 									Matrix
							 									Vector
							 
Performance 分支预测器在其预测中是否也包括I/O指令？
									Performance
							 									Loops
							 									Assembly
							 									X86
							 
Performance Cassandra轻型事务性能惩罚
									Performance
							 									Cassandra
							 									Nosql
							 
Performance React Native with Redux的性能较低
									Performance
							 									Reactjs
							 									React Native
							 									Redux
							 
Performance 谷歌浏览器&x27；浏览器的静态资源缓存机制不同于其他浏览器。是这样吗？
									Performance
							 									Google Chrome
							 									Firefox
							 									Caching
							 
Performance 做最少的GPU工作
									Performance
							 									Opengl
							 									Graphics
							 
Performance 如何估计TensorFlow模型的GPU占用空间？
									Performance
							 									Memory Management
							 									Tensorflow
							 									Deep Learning
							 
Performance 我是否应该在JMeter中创建相同的线程组来模拟同步的用户活动？
									Performance
							 									Jmeter
							 
Performance 朱莉娅-有没有一种方法可以避免在不使用太多内存的情况下使用循环
									Performance
							 									Julia
							 
Performance 被动记录应用程序在生产中的性能
									Performance
							 									Reactjs
							 									Logging
							 									Monitoring
							 
Performance 为什么从网络驱动器加载mongoose包非常慢？
									Performance
							 									Mongoose
							 
Performance 在Biguqery中，单个插槽是否会占用多个插槽？
									Performance
							 									Google Bigquery
							 
Performance 理解英特尔至强PHi 7210上的矩阵乘法
									Performance
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Notifications
Sapui5
Editor
Canvas
Mdx
Scheme
Cloud Foundry
Oracle10g
Windows Store Apps
Abap
Apache Pig
Asp.net Core
Cuda
Web Applications
Apache
Powershell
Protractor
Qt
Pyspark
Gis
Pine Script
Internationalization
Soap
Reference
Dojo
Winapi
Open Source
Doxygen
Javafx
Properties
Selenium
Passwords
Import
Arrays
View
Matplotlib
Google App Engine
Database Design
Debian
Architecture
Yii2
Rdf
Cobol
Google Cloud Dataflow
Asp.net Mvc 3
R
Rest
Sonarqube
Outlook
Bash
Breeze
Networking
Binding
Spring Batch
Shiny
Nsis
Requirejs
Jsp
Ssrs 2008
Silverlight
Magento2
Mediawiki
Loopbackjs
Redux
Layout
Testing
Ionic2
Chart.js
Playframework
Latex
Project Management
Ipad
Clojure
Ecmascript 6
Windows Phone
Internet Explorer
Visual Studio 2012
Audio
Encoding
Highcharts
Modelica
Rspec
Yii
Gcc
File
Merge
Cygwin
Symfony1
Opencl
Orm
Cucumber
Nestjs
Visual Studio 2008
Graphviz
Knockout.js
If Statement
Windows Services
Transactions
Ios
C++11
Microservices
.net 4.0
Ibm Mobilefirst
Gremlin
Objective C
Swiftui
Mapbox
Dom
Dialogflow Es
Button
Dictionary
Entity Framework
Google Maps Api 3
Mongodb
Ios4
Path
Ravendb
Uml
Swagger
Ruby On Rails 4
Doctrine Orm
Vhdl
Amp Html
Bots
Python 2.7
Google Chrome Extension
Google Api
Nativescript
Blazor
Tensorflow
Netty
Google Plus
Binary
Subsonic
Jasmine
Sharepoint 2007
Activerecord
Pagination
Db2
Vaadin
Web
Replace
Moodle
Amazon S3
System Verilog
Nlp
Backbone.js
Gatsby
Antlr4
Actionscript
Bluetooth
Grep
Kernel
Statistics
Liferay
Content Management System
Ms Office
Text
Junit
Discord.js
Vmware
Jsf 2
Ubuntu
Mysql
Sqlite
Gulp
Vim
Cassandra
Tcl
Cordova
Sqlalchemy
Sql Server 2008
Jaxb
Rust
Servlets
Batch File
Download
Ada
Visual Studio 2017
Routes
Signalr
Sharepoint 2010
Nosql
Cypress
Flutter
Umbraco
Rxjs
Xslt
Java 8
Twitter Bootstrap
Windows Runtime
Playframework 2.0
Virtual Machine
Jvm
Data Structures
Model View Controller
Parse Platform
Installation
Electron
Colors
Talend


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网