hbase-表快速导出到本地磁盘

HBase表可以使用Hadoop的MapReduce框架进行快速导出到本地磁盘。以下是步骤：

编写一个MapReduce作业，其中Mapper读取HBase表中的数据，Reducer将数据写入本地磁盘。
在Mapper中，使用TableInputFormat类设置输入HBase表，并将Scan对象设置为该表的扫描器。
在Reducer中，使用FileOutputFormat类将数据写入本地磁盘。
打包MapReduce作业，并在Hadoop集群中运行。

以下是一个示例Mapper和Reducer：

public class HBaseMapper extends TableMapper<Text, Text> {
  private Text outKey = new Text();
  private Text outValue = new Text();

  public void map(ImmutableBytesWritable row, Result value, Context context) throws IOException, InterruptedException {
    String key = new String(row.get(), "UTF-8");
    String val = new String(value.getValue(Bytes.toBytes("cf"), Bytes.toBytes("col")), "UTF-8");
    outKey.set(key);
    outValue.set(val);
    context.write(outKey, outValue);
  }
}

public class LocalFileReducer extends Reducer<Text, Text, NullWritable, Text> {
  private Text outValue = new Text();

  public void reduce(Text key, Iterable<Text> values, Context context) throws IOException, InterruptedException {
    for (Text value : values) {
      outValue.set(key.toString() + "," + value.toString());
      context.write(NullWritable.get(), outValue);
    }
  }
}

在运行作业之前，需要将HBase的配置添加到MapReduce的类路径中。可以使用以下命令：

export HBASE_CONF_DIR=/path/to/hbase/conf
export HADOOP_CLASSPATH=$HADOOP_CLASSPATH:$HBASE_CONF_DIR

然后，可以运行作业：

hadoop jar /path/to/mapreduce.jar input_table output_path

其中，mapreduce.jar是打包好的MapReduce作业，input_table是要导出的HBase表名，output_path是输出路径。