Sorting 配置单元-在索引或排序列中搜索，读取整个存储桶_Sorting_Hive_Bucket_Orc - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/sorting/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Sorting 配置单元-在索引或排序列中搜索，读取整个存储桶_Sorting_Hive_Bucket_Orc - Fatal编程技术网

Sorting 配置单元-在索引或排序列中搜索，读取整个存储桶

sorting hive

Sorting 配置单元-在索引或排序列中搜索，读取整个存储桶,sorting,hive,bucket,orc,Sorting,Hive,Bucket,Orc,配置单元中的查询不使用排序，读取整个存储桶。这是正常的还是误解表: col_a; col_b; values; 规格：我的桌子是按“col_a”列扣好并分类的表具有ORC格式结果: 当我查询“colu_a”时，将读取整个存储桶当我索引“colu_b”并查询“col_b”时，读取的数据量超过一整桶表配置： inputFormat:org.apache.hadoop.hive.ql.io.orc.orInputFormat outputFormat:org.apache.ha

配置单元中的查询不使用排序，读取整个存储桶。这是正常的还是误解

表:

col_a; col_b; values;

规格：

我的桌子是按“col_a”列扣好并分类的
表具有ORC格式

结果:

当我查询“colu_a”时，将读取整个存储桶

当我索引“colu_b”并查询“col_b”时，读取的数据量超过一整桶

表配置：

inputFormat:org.apache.hadoop.hive.ql.io.orc.orInputFormat

outputFormat:org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat

serializationLib:org.apache.hadoop.hive.ql.io.orc.OrcSerde

巴克特科尔斯：[col_a]

sortCols:col_a

订单：1

插入以填充表格并选择以获取值：

hive.enforce.sorting=true; hive.enforce.bucketing=true; FROM table_temp INSERT OVERWRITE TABLE table_sorted PARTITION (date=1) SELECT col_a, col_b DISTRIBUTE BY col_a SORT BY col_a; SELECT * from table_sorted where date=1 AND col_a=986123;

我的想法
我认为sort允许我们不读取整个bucket，而是允许我们访问特定行或一系列行。我还认为索引可以为我们提供一行或一个区间。我错了吗？顺便说一下，谢谢你抽出时间

[hive]相关文章推荐

Hive 如何更新/删除配置单元分区？ hive

Hive是以什么模式安装的？ hive

Hive 加载每n分钟接收一次的文件 hive apache-pig

Hive 如何在不舍入的情况下截断蜂巢中的两位数到四位数？ hive

Hive 外部表（配置单元）仅从文件中选择少数列 hive

Hive “ShowTableExtended”应该列出分区下的文件吗？ hive

'编译语句时出错：失败：HiveAuthzPluginException不支持权限类型All' hive

Hive 我们什么时候应该在配置单元中查找外部表和内部表？ hive

Hive 蜂箱压实工艺 hive

Hive 在配置单元中创建日期表 hive calendar

Hive SQOOP：在导出到postgress DB之前自定义输入数据 hive

Hive中的时间戳转换 hive

Hive 具有大WHERE条件的配置单元查询 hive

Hive 配置单元如何限制collect_集合中的条目数 hive

Hive 镶木地板格式的蜂巢桌负载 hive

Hive 直线查询输出采用JSON格式，而不是csv表 hive

Hive 如何将包含时间字符串值的csv文件加载到配置单元中的时间戳 hive

Hive 多个子查询表达式：配置单元 hive

当目标表较大时，Spark sql插入表创建.hive暂存目录 hive

tHiveConnection与Microsoft HDInsight分发版中作业结果文件夹和部署Blob的重要性 hive talend

随机文章推荐

Winforms 如何使用Windows窗体在窗口标题栏中绘制自定义按钮？ winforms winapi

Winforms Windows窗体设计时错误 winforms visual-studio-2008

更改wpf应用程序中托管winForms元素的可见性 winforms

Winforms F#和winform控件问题 winforms f#

Winforms 如何对动态加载的DataGridView进行数据绑定？ winforms

winforms中的cookies winforms cookies

Winforms树视图，递归检查子节点问题 winforms checkbox

按延迟顺序显示集合项-WinForms winforms

Winforms C更改winform左上角的图标 winforms

Winforms Datagrid以编程方式取消选择单元格 winforms

Winforms 我应该使用dataset类作为缓存来加速SQLite加载吗 winforms sqlite

Winforms 从网络共享运行.NET程序 winforms

Winforms EF：代码优先约束 winforms entity-framework

Winforms 秒表，更新日志窗口窗体 winforms visual-studio

ms winforms应用程序中ms Access的连接字符串 winforms ms-access

Winforms DesignMode Property是如何工作的？ winforms

Winforms Windows窗体：手动绘制组合框的SelectedItem winforms combobox

Winforms 诊断为什么Windows 10 IoT上的自定义外壳会出现黑屏 winforms shell

Winforms vc++；2015 RichTextBox的使用 winforms c++-cli

Winforms 当DropdownStyle为DropDown时如何清除组合框文本 winforms combobox

[sorting]相关推荐

Tags

Cloud Foundry Groovy Passwords Logging Asp.net Unit Testing Twitter Bootstrap 3 Odata Pagination Testing Google Plus Pip Ocaml Ruby Next.js Graph Timer Autocomplete Azure Cosmosdb Gis E Commerce Notepad++ Parallel Processing Three.js Jsf Continuous Integration Serialization Automated Tests Cron Jsf 2 Sql Server 2008 R2 Entity Framework 4 Sonarqube Here Api Wordpress Jsp Concurrency Dynamics Crm 2011 Glassfish Windows Phone Vim Abap Windows 8 Eclipse Cluster Computing Azure Sharepoint 2013 Jira Matlab Report Vue.js Rally Content Management System Swagger Magento2 C++ Mapreduce Grep X86 Compression Breeze Stored Procedures For Loop Go Jdbc Rdf Msbuild Cordova Cucumber Properties Pine Script Tfs Devexpress Clang Sharepoint 2010 Unicode Tableau Api Gitlab Autodesk Forge Types Html Text Symfony1 Typo3 Couchdb Amp Html Crystal Reports Couchbase Openerp Amazon Dynamodb Oracle Floating Point Mediawiki Nuget Axapta D3.js Yaml Xslt Dll Tomcat Encoding Push Notification Mono Events Reference Ckeditor Highcharts Sharepoint Odoo Openssl Ios7 Github Drop Down Menu Azure Service Fabric Cloud Opencl Vuejs2 Winapi Triggers Machine Learning Ffmpeg Pycharm Memory Management Encryption Visual Studio 2013 Ibm Midrange Doxygen Reflection Django Rest Framework Checkbox Excel Spring Batch Network Programming Svg .net Core Cmd Spring Karate Cocoa Cmake Keyboard Delphi Drupal 6 Programming Languages Vb6 Embedded Command Line Animation Apache Numpy Symfony Airflow Download Keycloak Gcc Ip Azure Active Directory Hyperlink Log4j Iphone Ssrs 2008 Pyspark Methods Date Google Chrome Extension Spring Security Elm Computer Vision Configuration Quickbooks Jquery Parse Platform Windows Runtime Shell Sockets Asp.net Mvc Excel Formula Jasper Reports Login Ms Word Xsd Tags Ruby On Rails 3.1 Youtube Ios8 Blazor Mobile Python Objective C Signalr Tcl Ios Testng Keras Jqgrid Oracle10g Sqlalchemy Azure Devops F# Plot Aurelia

Copyright © 2024. All Rights Reserved by - Fatal编程技术网