配置步驟為

1. 將$HADOOP_HOME/contrib/fairscheduler/hadoop-fairscheduler-0.20.2-cdh3u3.jar複製到$HADOOP_HOME/lib資料夾中

2. 修改$HADOOP_HOME/conf/mapred-site.xml配置檔案

[html]view plaincopyprint?
			
			<property>  
		
			   <name>mapred.jobtracker.taskSchedulername>  
		
			   <value>org.apache.hadoop.mapred.FairSchedulervalue>  
		
			 property>  
		
			 <property>  
		
			     <name>mapred.fairscheduler.allocation.filename>  
		
			     <value>/home/hadoop/hadoop-0.20.2-cdh3u3/conf/fair-scheduler.xmlvalue>  
		
			  property>  
		
			  <property>  
		
			    <name>mapred.fairscheduler.preemptionname>  
		
			    <value>truevalue>  
		
			  property>  
		
			  <property>  
		
			    <name>mapred.fairscheduler.assignmultiplename>  
		
			    <value>truevalue>  
		
			  property>  
		
			  <property>  
		
			    <name>mapred.fairscheduler.poolnamepropertyname>  
		
			    <value>mapred.queue.namevalue>  
		
			    <description>job.set("mapred.queue.name",pool); // pool is set to either 'high' or 'low' description>  
		
			  property>  
		
			  <property>  
		
			    <name>mapred.fairscheduler.preemption.only.logname>  
		
			    <value>truevalue>  
		
			  property>  
		
			  <property>  
		
			    <name>mapred.fairscheduler.preemption.intervalname>  
		
			    <value>15000value>  
		
			  property>  
		
			  <property>  
		
			    <name>mapred.queue.namesname>  
		
			    <value>default,hadoop,hivevalue>  
		
			  property>

3. 在$HADOOP_HOME/conf/新建配置檔案fair-scheduler.xml

[html]view plaincopyprint?
			
			xml version="1.0"?>  
		
			<allocations>  
		
			<pool name="hive">  
		
			  <minMaps>90minMaps>  
		
			  <minReduces>20minReduces>  
		
			  <maxRunningJobs>20maxRunningJobs>  
		
			  <weight>2.0weight>  
		
			  <minSharePreemptionTimeout>30minSharePreemptionTimeout>  
		
			pool>  
		
			<pool name="hadoop">  
		
			  <minMaps>9minMaps>  
		
			  <minReduces>2minReduces>  
		
			  <maxRunningJobs>20maxRunningJobs>  
		
			  <weight>1.0weight>  
		
			  <minSharePreemptionTimeout>30minSharePreemptionTimeout>  
		
			pool>  
		
			<user name="hadoop">  
		
			    <maxRunningJobs>6maxRunningJobs>  
		
			user>  
		
			<poolMaxJobsDefault>10poolMaxJobsDefault>  
		
			<userMaxJobsDefault>8userMaxJobsDefault>  
		
			<defaultMinSharePreemptionTimeout>600defaultMinSharePreemptionTimeout>  
		
			<fairSharePreemptionTimeout>600fairSharePreemptionTimeout>  
		
			allocations>

4. 在叢集的各個節點執行以上步驟，然後重啟叢集，在即可檢視到排程器執行狀態，如果修改排程器配置的話，只需要修改檔案fair-scheduler.xml ，不需重啟配置即可生效。

5. 在執行hive任務時，設定hive屬於的佇列set mapred.queue.name=hadoop; (set mapred.job.queue.name=hadoop;)

設定hive的任務名稱set mapred.job.name=goldts;

設定任務的優先順序別set mapred.job.priority=HIGH;

6. 如果在執行MR JOB的時候出現XX使用者訪問不了YY佇列的話，就需要在mapred-queue-acls.xml裡配置相應的屬性，來對訪問許可權進行控制，比如：

[html]view plaincopyprint?
			
			<property>  
		
			  <name>mapred.queue.default.acl-submit-jobname>  
		
			  <value>*value>  
		
			  <description> Comma separated list of user and group names that are allowed  
		
			    to submit jobs to the 'default' queue. The user list and the group list  
		
			    are separated by a blank. For e.g. user1,user2 group1,group2.  
		
			    If set to the special value '*', it means all users are allowed to  
		
			    submit jobs. If set to ' '(i.e. space), no user will be allowed to submit  
		
			    jobs.  
		
			    It is only used if authorization is enabled in Map/Reduce by setting the  
		
			    configuration property mapred.acls.enabled to true.  
		
			    Irrespective of this ACL configuration, the user who started the cluster and  
		
			    cluster administrators configured via  
		
			    mapreduce.cluster.administrators can submit jobs.  
		
			  description>  
		
			property>  
		
			<property>  
		
			  <name>mapred.queue.default.acl-administer-jobsname>  
		
			  <value>*value>  
		
			  <description> Comma separated list of user and group names that are allowed  
		
			    to view job details, kill jobs or modify job's priority for all the jobs  
		
			    in the 'default' queue. The user list and the group list  
		
			    are separated by a blank. For e.g. user1,user2 group1,group2.  
		
			    If set to the special value '*', it means all users are allowed to do  
		
			    this operation. If set to ' '(i.e. space), no user will be allowed to do  
		
			    this operation.  
		
			    It is only used if authorization is enabled in Map/Reduce by setting the  
		
			    configuration property mapred.acls.enabled to true.  
		
			    Irrespective of this ACL configuration, the user who started the cluster and  
		
			    cluster administrators configured via  
		
			    mapreduce.cluster.administrators can do the above operations on all the jobs  
		
			    in all the queues. The job owner can do all the above operations on his/her  
		
			    job irrespective of this ACL configuration.  
		
			  description>  
		
			property>

配置hadoop 使用fair scheduler排程器

相關文章