map/reduce實現排序

林六天發表於2014-07-07

原文網址 : https://www.cnblogs.com/6tian/p/3829169.html

  1 import java.io.IOException;
  2 
  3 import org.apache.hadoop.conf.Configuration;
  4 import org.apache.hadoop.conf.Configured;
  5 import org.apache.hadoop.fs.Path;
  6 import org.apache.hadoop.io.IntWritable;
  7 import org.apache.hadoop.io.LongWritable;
  8 import org.apache.hadoop.io.Text;
  9 import org.apache.hadoop.mapreduce.Job;
 10 import org.apache.hadoop.mapreduce.Mapper;
 11 import org.apache.hadoop.mapreduce.Partitioner;
 12 import org.apache.hadoop.mapreduce.Reducer;
 13 import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
 14 import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;
 15 import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
 16 import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;
 17 import org.apache.hadoop.util.Tool;
 18 import org.apache.hadoop.util.ToolRunner;
 19 public class Sort extends Configured implements Tool {
 20     /*
 21      * 排序
 22      * 輸入格式：每個資料佔一行
 23      * 輸出格式：
 24      * 1 21
 25      * 2 32
 26      * 3 62
 27      * 設計思路：
 28      * 使用reduce自帶的預設排序規則。MapReduce按照key值進行排序。如果Key值為Intwritable型別，則按照數字大小排序
 29      * 如果key值為Text型別，則按照字典順序對字串進行排序。
 30      * 注意：要重寫Partition函式。Reduce排序只能保證自己區域性的資料順序，並不能保證全域性的。
 31      * */
 32     public static class Map extends Mapper<LongWritable,Text,IntWritable,IntWritable>{
 33         private IntWritable line=new IntWritable();
 34         public void map(LongWritable key,Text value,Context context)throws IOException,InterruptedException{
 35             line.set(Integer.parseInt(value.toString()));
 36             context.write(line, new IntWritable(1));            
 37         }
 38         
 39     }
 40     
 41     public static class Reduce extends Reducer<IntWritable,IntWritable,IntWritable,IntWritable>{
 42         private IntWritable num=new IntWritable(1);
 43         public void reduce(IntWritable key,Iterable<IntWritable> values,Context context)throws IOException,InterruptedException{
 44             for(IntWritable var:values){
 45             context.write(num, key);
 46             num=new IntWritable(num.get()+1);
 47             }
 48         }
 49         
 50     }
 51     
 52     public static class Partition extends Partitioner<IntWritable ,IntWritable>{
 53 
 54         @Override
 55         public int getPartition(IntWritable key, IntWritable value, int numPartitions) {
 56             // TODO Auto-generated method stub
 57             System.out.println(numPartitions);
 58             int maxnum=65223;
 59             int bound=maxnum/numPartitions+1;
 60             for(int i=0;i<numPartitions;i++)
 61             {
 62                 if(key.get()>=bound*(i-1)&&key.get()<=bound*i)
 63                 {
 64                     return i;
 65                 }
 66             }
 67             return 0;
 68         }
 69         
 70     }
 71     
 72     public int run(String[] args)throws Exception{
 73         Configuration conf=new Configuration();
 74         Job job=new Job(conf,"Sort");
 75         job.setJarByClass(Sort.class);
 76         
 77         job.setOutputKeyClass(IntWritable.class);
 78         job.setOutputValueClass(IntWritable.class);
 79         
 80         
 81         job.setMapperClass(Map.class);
 82         job.setReducerClass(Reduce.class);
 83         job.setPartitionerClass(Partition.class);
 84         
 85         job.setInputFormatClass(TextInputFormat.class);
 86         job.setOutputFormatClass(TextOutputFormat.class);
 87         
 88         FileInputFormat.addInputPath(job, new Path(args[0]));
 89         FileOutputFormat.setOutputPath(job, new Path(args[1]));
 90         
 91         boolean success=job.waitForCompletion(true);
 92         return success?0:1;
 93     }
 94     
 95     public static void main(String[] args)throws Exception{
 96         int ret=ToolRunner.run(new Sort(), args);
 97         System.exit(ret);
 98     }
 99 
100 }

reduce實現filter,map 陣列扁平化等
2019-04-30
Filter陣列
JS Array.reduce 實現 Array.map 和 Array.filter
2018-12-08
JSFilter
在幕後看看Swift中的Map，Filter和Reduce的實現
2019-02-21
SwiftFilter
Hadoop Map Reduce 漫談
2018-10-30
Hadoop
forEach、map、reduce比較
2018-12-10
java8 實現map以value值排序
2018-09-13
Java排序
例項講解hadoop中的map/reduce查詢(python語言實現
2021-09-09
HadoopPython
python內建函式 map/reduce
2019-02-16
Python函式
JavaScript map和reduce的區別
2024-11-22
JavaScript
分散式計算與Map Reduce
2021-01-03
分散式
Map-Reduce資料分析之二
2018-11-19
map、reduce、filter、for...of、for...in等總結
2019-01-23
Filter
map切片排序
2022-02-08
排序
Python學習筆記 - filter，map，reduce，zip
2019-01-07
Python筆記Filter
python之高階函式map，reduce，filter用法
2018-08-11
Python函式Filter
python-python的sao操作 map reduce filter
2018-07-18
PythonFilter
陣列的 map, filter ，sort和 reduce 用法
2018-07-29
陣列Filter
javascript高階函式---filter---map---reduce
2020-10-25
JavaScript函式Filter
五、GO程式設計模式：MAP-REDUCE
2022-02-06
Go程式設計設計模式
JavaScript（1）高階函式filter、map、reduce
2021-06-30
JavaScript函式Filter
GO程式設計模式05：MAP-REDUCE
2020-12-30
Go程式設計設計模式
[譯] 圖解 Map、Reduce 和 Filter 陣列方法
2019-04-11
圖解Filter陣列
理解Swift高階函式之map, filter, reduce
2018-03-11
Swift函式Filter
【web前端】自己實現Array.reduce()
2018-08-06
Web前端
python中快速處理關鍵字map,reduce,filter
2020-10-07
PythonFilter
陣列的forEach,map,filter,reduce,reduceRight,every,some方法
2019-04-20
陣列Filter
JavaScript 4/30: 陣列的 map, filter 和 reduce 用法
2018-04-19
JavaScript陣列Filter
陣列的reduce操作+物件陣列的map操作
2024-07-08
陣列物件
Array.prototype.reduce 的理解與實現
2018-12-16
用whistle實現map local
2018-05-01
javascript實現Map結構
2021-09-09
JavaScript
python常用函式進階(2)之map,filter,reduce,zip
2019-07-29
Python函式Filter
【大資料】深入原始碼解析Map Reduce的架構
2020-09-23
大資料原始碼架構
map和set對vector排序
2024-04-20
排序
精讀《用 Reduce 實現 Promise 序列執行》
2018-10-29
Promise
Map-Reduce 思想在 ABAP 程式設計中的一個實際應用案例
2022-03-18
程式設計
[翻譯]map和reduce，處理資料結構的利器
2019-02-25
資料結構
Python 進階之路 (五) map, filter, reduce, zip 一網打盡
2019-02-11
PythonFilter
Golang Map實現（四） map 的賦值和擴容
2020-04-30
Golang賦值

map/reduce實現 排序

相關文章

map/reduce實現排序