Cloudera Certified Administrator for Apache Hadoop(CCAH認證)
Exam Sections and Blueprint
1. HDFS (17%)
- Describe the function of HDFS daemons
- Describe the normal operation of an Apache Hadoop cluster, both in data storage and in data processing
- Identify current features of computing systems that motivate a system like Apache Hadoop
- Classify major goals of HDFS Design
- Given a scenario, identify appropriate use case for HDFS Federation
- Identify components and daemon of an HDFS HA-Quorum cluster
- Analyze the role of HDFS security (Kerberos)
- Determine the best data serialization choice for a given scenario
- Describe file read and write paths
- Identify the commands to manipulate files in the Hadoop File System Shell
2. YARN and MapReduce version 2 (MRv2) (17%)
- Understand how upgrading a cluster from Hadoop 1 to Hadoop 2 affects cluster settings
- Understand how to deploy MapReduce v2 (MRv2 / YARN), including all YARN daemons
- Understand basic design strategy for MapReduce v2 (MRv2)
- Determine how YARN handles resource allocations
- Identify the workflow of MapReduce job running on YARN
- Determine which files you must change and how in order to migrate a cluster from MapReduce version 1 (MRv1) to MapReduce version 2 (MRv2) running on YARN
3. Hadoop Cluster Planning (16%)
- Principal points to consider in choosing the hardware and operating systems to host an Apache Hadoop cluster
- Analyze the choices in selecting an OS
- Understand kernel tuning and disk swapping
- Given a scenario and workload pattern, identify a hardware configuration appropriate to the scenario
- Given a scenario, determine the ecosystem components your cluster needs to run in order to fulfill the SLA
- Cluster sizing: given a scenario and frequency of execution, identify the specifics for the workload, including CPU, memory, storage, disk I/O
- Disk Sizing and Configuration, including JBOD versus RAID, SANs, virtualization, and disk sizing requirements in a cluster
- Network Topologies: understand network usage in Hadoop (for both HDFS and MapReduce) and propose or identify key network design components for a given scenario
4. Hadoop Cluster Installation and Administration (25%)
- Given a scenario, identify how the cluster will handle disk and machine failures
- Analyze a logging configuration and logging configuration file format
- Understand the basics of Hadoop metrics and cluster health monitoring
- Identify the function and purpose of available tools for cluster monitoring
- Be able to install all the ecoystme components in CDH 5, including (but not limited to): Impala, Flume, Oozie, Hue, Cloudera Manager, Sqoop, Hive, and Pig
- Identify the function and purpose of available tools for managing the Apache Hadoop file system
5. Resource Management (10%)
- Understand the overall design goals of each of Hadoop schedulers
- Given a scenario, determine how the FIFO Scheduler allocates cluster resources
- Given a scenario, determine how the Fair Scheduler allocates cluster resources under YARN
- Given a scenario, determine how the Capacity Scheduler allocates cluster resources
6. Monitoring and Logging (15%)
- Understand the functions and features of Hadoop’s metric collection abilities
- Analyze the NameNode and JobTracker Web UIs
- Understand how to monitor cluster daemons
- Identify and monitor CPU usage on master nodes
- Describe how to monitor swap and memory allocation on all nodes
- Identify how to view and manage Hadoop’s log files
- Interpret a log file
來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/30089851/viewspace-2055292/,如需轉載,請註明出處,否則將追究法律責任。
相關文章
- Apache Hadoop管理認證培訓(Cloudera)ApacheHadoopCloud
- Cloudera hadoop認證總結CloudHadoop
- 透過Docker快速部署啟動Apache Hadoop及ClouderaDockerApacheHadoopCloud
- Portworx Certified Admin (PCA) 認證工程師火熱上線!PCA工程師
- Hadoop 許可權認證Hadoop
- nginx,apache的alias和認證功能NginxApache
- 【原創】Apache HTTP Server認證初探ApacheHTTPServer
- 透過Cloudera最高階別認證,再迎創新里程碑!Cloud
- AWS 認證解決方案架構師 – 助理級 (AWS Certified Solutions Architect – Associate)架構
- 【喜報】恭喜孫治榮獲Oracle Certified Master(OCM)認證大師稱號OracleAST
- Hadoop將死?Cloudera CEO怒懟GartnerHadoopCloud
- CCAH-CCA-500-4題:Where are Hadoop task log files stored?Hadoop
- Hadoop哪家強?Hortonworks、Cloudera、MapR案例集錦HadoopCloud
- cloudera learning7:Hadoop資源管理CloudHadoop
- hadoop認證須知和報名流程Hadoop
- Hadoop叢集機器的SSH認證配置Hadoop
- [翻譯-Shiro]-Apache Shiro Java認證指南ApacheJava
- What Is Apache HadoopApacheHadoop
- 建立本地yum軟體源,為本地Package安裝Cloudera Manager、Cloudera Hadoop及Impala做準備PackageCloudHadoop
- 基於MySQL的Apache使用者認證例項(轉)MySqlApache
- 周邊生態|StreamNative 聯合 Cloudera 開源 Apache Pulsar+Apache NiFi 整合處理器CloudApacheNifi
- Apache Hadoop Day5ApacheHadoop
- Apache Hadoop 入門教程ApacheHadoop
- 什麼是 Apache Hadoop?ApacheHadoop
- RHCE7認證學習筆記38——Apache配置與管理筆記Apache
- Cloudera和Hortonworks宣佈合併:對Hadoop的一記重創!CloudHadoop
- HTTP認證之基本認證——Basic(一)HTTP
- HTTP認證之基本認證——Basic(二)HTTP
- Apache Ranger——Hadoop ACL控制工具ApacheRangerHadoop
- Apache Hadoop Yarn 學習(二)ApacheHadoopYarn
- Apache Hadoop creator — Doug CuttingApacheHadoop
- Oracle中兩種認證方式:OS認證與口令檔案認證Oracle
- 從零自學Hadoop(01):認識HadoopHadoop
- Cloudera編譯好的各種hadoop,oozie等元件壓縮包URLCloud編譯Hadoop元件
- 學習三:基於Hadoop的Cloudera CDH3平臺安裝HadoopCloud
- 認證授權方案之JwtBearer認證JWT
- HTTP認證之摘要認證——Digest(一)HTTP
- Laravel 認證原理及完全自定義認證Laravel