Hbase篇--Hbase和MapReduce結合Api

LHBlog發表於2018-01-16

一.前述

Mapreduce可以自定義Inputforma物件和OutPutformat物件，所以原理上Mapreduce可以和任意輸入源結合。

二.步驟

將結果寫會到hbase中去。

2.1 Main函式

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;


/**
 * 分析hdfs 文字  統計單詞數量
 * 結果輸出到 hbase表
 * create 'wc','cf'
 * rowkey: 單詞        cf:count=單詞數量
 * @author root
 *
 */
public class WCDemo {

    /**
     * 
     * wc
     * 資料hbase表    rowkey  cell存放文字
     * 結果輸出到 hbase表
     * 
     */

    public static void main(String[] args) throws Exception {
        
        Configuration conf = new Configuration();
        
        conf.set("fs.defaultFS", "hdfs://node1:8020");//設定hdfs叢集nameservices名稱
        conf.set("hbase.zookeeper.quorum", "node4");
        
        Job job = Job.getInstance(conf);
        
        job.setJarByClass(WCDemo.class);
        
        job.setMapperClass(WCMapper.class);
        job.setMapOutputKeyClass(Text.class);
        job.setMapOutputValueClass(IntWritable.class);
        
//        job.setReducerClass();
        
        //addDependencyJars  本地方式執行： 設定為false
//        TableMapReduceUtil.initTableReducerJob("wc", WCReducer.class, job);
        TableMapReduceUtil.initTableReducerJob("wc",WCReducer.class, job,
                null, null, null, null, false);
        
        Path path = new Path("/user/wc");
        FileInputFormat.addInputPath(job, path);
        
        boolean flag = job.waitForCompletion(true);
        if(flag) {
            System.out.println("success~~");
        }
    }
    
}

2.2 Mapper函式（和正常的Mapper沒啥區別）

import java.io.IOException;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Mapper;

public class WCMapper extends Mapper<LongWritable, Text, Text, IntWritable> {

    @Override
    protected void map(LongWritable key, Text value, Context context)
            throws IOException, InterruptedException {
        String[] words = value.toString().split(" ");
        
        for (String w : words) {
            context.write(new Text(w), new IntWritable(1));
        }
    }
}

2.3 Reduce函式（主要是把Put物件寫出去）

import java.io.IOException;

import org.apache.hadoop.hbase.client.Put;
import org.apache.hadoop.hbase.io.ImmutableBytesWritable;
import org.apache.hadoop.hbase.mapreduce.TableReducer;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;

public class WCReducer extends
        TableReducer<Text, IntWritable, ImmutableBytesWritable> {

    @Override
    protected void reduce(Text text, Iterable<IntWritable> iterable,
            Context context) throws IOException, InterruptedException {
        
        int sum = 0;
        
        for (IntWritable i : iterable) {
            sum += i.get();
        }
        
        Put put = new Put(text.toString().getBytes());
        put.add("cf".getBytes(), "count".getBytes(), (sum+"").getBytes());
        
        context.write(null, put);
    }
}

MapReduce和Spark讀取HBase快照表
2023-09-30
Spark
黑猴子的家：HBase 自定義HBase-MapReduce案列一
2018-10-05
HBase學習之Hbase的邏輯結構和物理結構
2021-01-02
hbase region 合併
2022-07-04
Hbase基礎篇
2018-08-22
HBase Region合併分析
2018-09-15
HBase 基本入門篇
2020-04-06
HBase （三）之 API的使用
2020-09-26
API
hbase之hbase shell
2018-10-09
HBase2實戰：HBase Flink和Kafka整合
2019-01-09
Kafka
HBase學習的第四天--HBase的進階與API
2024-08-16
API
Hbase(二)Hbase常用操作
2020-11-14
一條資料HBase之旅，簡明HBase入門教程開篇
2018-06-15
HBase學習的第五天--HBase進階結尾和phoenix開頭
2024-08-17
hbase - [04] java訪問hbase
2024-03-28
Java
HBase 教程：什麼是 HBase？
2021-12-30
HBase
2024-08-06
快速理解HBase和BigTable
2018-10-30
Hbase架構和搭建
2024-11-17
架構
hbase 2.0.2 java api的簡單使用
2018-09-14
JavaAPI
HBase知識點總結
2021-11-23
Hbase問題小結(一)
2021-05-12
HBase 系列（五）——HBase常用 Shell 命令
2021-09-09
Hbase-原理-region合併和hfile的合併（大合併、小合併）
2020-11-27
Hbase單機部署 java連線Hbase
2020-11-09
Java
Hbase一：Hbase介紹及特點
2023-02-25
Hbase學習二：Hbase資料特點和架構特點
2023-02-25
架構
Hive和Hbase的區別
2023-01-10
Hive
HBase知識點集中總結
2020-06-03
HBase 資料儲存結構
2021-02-28
HBase概述
2018-05-10
hbase整理
2018-03-12
HBase實操：HBase-Spark-Read-Demo 分享
2021-09-09
Spark
HBase學習之二: hbase分頁查詢
2018-05-22
hbase與phoenix整合(使用phoenix操作hbase資料)
2019-03-17
spark與hbase
2018-11-19
Spark
HBase學習
2019-04-14
HBase vs Hive
2018-06-09
Hive
HBase進階
2020-10-26

Hbase篇--Hbase和MapReduce結合Api

相關文章