Hbase 如何在google data proc中从bigtable读取数据

Hbase 如何在google data proc中从bigtable读取数据,hbase,bigtable,google-cloud-dataproc,google-cloud-bigtable,Hbase,Bigtable,Google Cloud Dataproc,Google Cloud Bigtable,我正在尝试从Google cloud data proc中的Bigtable读取数据。 下面是我用来从Bigdtable读取数据的代码 PipelineOptions options = PipelineOptionsFactory.fromArgs(args).create(); options.setRunner(BlockingDataflowPipelineRunner.class); Scan scan = new Scan(); sca

我正在尝试从Google cloud data proc中的Bigtable读取数据。 下面是我用来从Bigdtable读取数据的代码

PipelineOptions options = PipelineOptionsFactory.fromArgs(args).create();
        options.setRunner(BlockingDataflowPipelineRunner.class);
        Scan scan = new Scan();
        scan.setFilter(new FirstKeyOnlyFilter());
        Pipeline p = Pipeline.create(options);
        p.apply(Read.from(CloudBigtableIO.read(new CloudBigtableScanConfiguration.Builder()
                .withProjectId("xxxxxxxx").withZoneId("xxxxxxx")
                .withClusterId("xxxxxx").withTableId("xxxxx").withScan(scan).build())))
                .apply(ParDo.named("Reading data from big table").of(new DoFn<Result, Mutation>() {

                    @Override
                    public void processElement(DoFn<Result, Mutation>.ProcessContext arg0) throws Exception {

                        System.out.println("Inside printing");
                        if (arg0==null)
                        {
                            System.out.println("arg0 is null");
                        } else
                        {

                            System.out.println("arg0 is not null");
                            System.out.println(arg0.element());
                        }

                    }

                }));

        p.run();
谁能告诉我我做错了什么。

这是一个不幸的错误。我们修复了底层实现,我们希望在下周左右发布新版本的客户端。我建议更改这一行:

System.out.println(arg0.element())

例如:

System.out.println(Bytes.toStringBinary(arg0.element().getRow());

很抱歉给你添麻烦

2017-03-21T12:29:28.884Z: Error:   (deec5a839a59cbca): java.lang.ArrayIndexOutOfBoundsException: 12338
    at org.apache.hadoop.hbase.KeyValue.keyToString(KeyValue.java:1231)
    at org.apache.hadoop.hbase.KeyValue.keyToString(KeyValue.java:1190)
    at com.google.bigtable.repackaged.com.google.cloud.hbase.adapters.read.RowCell.toString(RowCell.java:234)
    at org.apache.hadoop.hbase.client.Result.toString(Result.java:804)
    at java.lang.String.valueOf(String.java:2994)
    at java.io.PrintStream.println(PrintStream.java:821)
    at com.slb.StarterPipeline$2.processElement(StarterPipeline.java:102)