Hive 具有两列的配置单元分区

Hive 具有两列的配置单元分区,hive,Hive,这更像是一个理论问题 我有一个配置单元表,有两个分区列,分别是col1和col2 如果我写一个如下的查询,我会使用分区的好处吗 select count(*) from table A where col1='A' and col2 > '1' and col2 < '6' 我的第一个查询是否无法利用分区,有没有办法做到这一点?您要访问多少数据。你是如何比较时间的?第一个查询与第二个查询相比有70%的数据,在hueI中运行这个命令我说的是表的总大小。如果你只有很少的MB,你不会看到

这更像是一个理论问题 我有一个配置单元表,有两个分区列,分别是col1和col2 如果我写一个如下的查询,我会使用分区的好处吗

select count(*) from table A where col1='A' and col2 > '1' and col2 < '6'

我的第一个查询是否无法利用分区,有没有办法做到这一点?

您要访问多少数据。你是如何比较时间的?第一个查询与第二个查询相比有70%的数据,在hueI中运行这个命令我说的是表的总大小。如果你只有很少的MB,你不会看到太多的不同。另外,在那个时候,有多少其他查询正在运行也很重要。如果你真的想比较一下时间<代码>使用总资源的30%创建一个不同的队列
if you have high values of data then definitely the first query will give much better performance.
Also if you want to compare them I would suggest you run at least 3 time the same query and then calculate the average of them.
You make get some different because so many other jobs will run parallel, So might not get enough resource then first attempt will tale longer time. I mean say they so many reason.
if you have high values of data then definitely the first query will give much better performance.
Also if you want to compare them I would suggest you run at least 3 time the same query and then calculate the average of them.
You make get some different because so many other jobs will run parallel, So might not get enough resource then first attempt will tale longer time. I mean say they so many reason.