Dictionary 配置单元上映射的HBASE表,如何获取实际表大小
我正在使用Hive中的HBase表 例如:Dictionary 配置单元上映射的HBASE表,如何获取实际表大小,dictionary,hive,hbase,Dictionary,Hive,Hbase,我正在使用Hive中的HBase表 例如: CREATE EXTERNAL TABLE `ddid_link_msisdn`( `msisdn` string COMMENT 'from deserializer', `ddid` string COMMENT 'from deserializer') ROW FORMAT SERDE 'org.apache.hadoop.hive.hbase.HBaseSerDe' STORED BY 'org.apache.hadoo
CREATE EXTERNAL TABLE `ddid_link_msisdn`(
`msisdn` string COMMENT 'from deserializer',
`ddid` string COMMENT 'from deserializer')
ROW FORMAT SERDE
'org.apache.hadoop.hive.hbase.HBaseSerDe'
STORED BY
'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES (
'hbase.columns.mapping'=':key,fc:ddid',
'serialization.format'='1')
TBLPROPERTIES (
'hbase.table.name'='unifieddata:ddid_link_msisdn_ddid',
'transient_lastDdlTime'='1535099920')
一切正常,但。。。
当我执行analysistable ddid\u link\u msisdn compute static
并随后使用description extended ddid\u link\u msisdn
询问表说明时,我得到:
... a lot of info ...etc etc...
{
totalSize=35566884,
numRows=427422,
rawDataSize=35139462,
COLUMN_STATS_ACCURATE=true, numFiles=1, transient_lastDdlTime=1591783620}
... other info ...
numRows正好是行数,毫不奇怪,totalSize或rawDataSize大约为35MB,但如果我查看HDFS,我会看到:
root@mid1-db138hd-12 [10:42am] ~> hdfs dfs -du -h /hbase/data/unifieddata
337 1011 /hbase/data/unifieddata/ddid_hub
265.7 M 797.2 M /hbase/data/unifieddata/ddid_link_cat_ddid
>> 45.2 M 135.5 M /hbase/data/unifieddata/ddid_link_msisdn_ddid <---
root@mid1-db138hd-12[10:42am]~>hdfs-dfs-du-h/hbase/data/unifieddata
337 1011/hbase/data/unifieddata/ddid\u hub
265.7 M 797.2 M/hbase/data/unifieddata/ddid\u link\u cat\ddid
>>45.2 M 135.5 M/hbase/data/unifieddata/ddid_link_msisdn_ddid