设为首页收藏本站

大数据论坛

 找回密码
 立即注册

QQ登录

只需一步,快速开始

搜索
查看: 242|回复: 1

CCA Spark and Hadoop Developer Exam (CCA175)考试大纲

[复制链接]
发表于 2020-6-5 15:00:36 | 显示全部楼层 |阅读模式


  • Number of     Questions: 8–12     performance-based (hands-on) tasks on Cloudera Enterprise cluster. See     below for full cluster configuration
  • Time Limit: 120 minutes
  • Passing Score: 70%
  • Language: English


Exam Question Format
Each CCA question requires you to solve a particular scenario.In some cases, a tool such as Impala or Hive may be used. In most cases, codingis required.

Evaluation, Score Reporting, and Certificate
Your exam is graded immediately upon submission and you aree-mailed a score report within three days of your exam. Your score reportdisplays the problem number for each problem you attempted and a grade on thatproblem. If you fail a problem, the score report includes the criteria youfailed (e.g., “Records contain incorrect data” or “Incorrect file format”). Wedo not report more information in order to protect the exam content.

If you pass the exam, you receive a second e-mail within a weekof your exam with your digital certificate as a PDF and your license number.

Audience and Prerequisites
There are no prerequisites required to take any Clouderacertification exam. The CCA Spark and Hadoop Developer exam (CCA175) followsthe same objectives as ClouderaDeveloper Training for Spark and Hadoop and the trainingcourse is an excellent preparation for the exam.


Required Skills

Transform,Stage, and Store

Convert a set ofdata values in a given format stored in HDFS into new data values or a new dataformat and write them into HDFS.


  • Load data from HDFS for use in Spark applications
  • Write the results back into HDFS using Spark
  • Read and write files in a variety of file formats
  • Perform standard extract, transform, load (ETL)     processes on data using the Spark API

Data Analysis

Use Spark SQL tointeract with the metastore programmatically in your applications. Generatereports by using queries against loaded data.


  • Use metastore tables as an input source or an output     sink for Spark applications
  • Understand the fundamentals of querying datasets in     Spark
  • Filter data using Spark
  • Write queries that calculate aggregate statistics
  • Join disparate datasets using Spark
  • Produce ranked or sorted data

Configuration

This is apractical exam and the candidate should be familiar with all aspects ofgenerating a result, not just writing code.


  • Supply command-line options to change your     application configuration, such as increasing available memory

Exam deliveryand cluster information

CCA175 is aremote-proctored exam available anywhere, anytime.

CCA175 is ahands-on, practical exam using Cloudera technologies. Each user is given theirown CDH6 (currently 6.1.1) cluster pre-loaded with Spark 2.4.

All websites,including Google/search functionality and access to Spark external packages isdisabled. You may not use notes or other exam aids.


回复

使用道具 举报

发表于 2020-6-5 15:00:39 | 显示全部楼层
路过 帮顶 嘿嘿
回复 支持 反对

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

QQ|Archiver|手机版|大数据论坛 ( 京ICP备10002193号-4 京公海网安备110108001289号  

GMT+8, 2020-8-11 05:31 , Processed in 0.264261 second(s), 26 queries , Gzip On.

Powered by Discuz! X3.1

© 2001-2013 Comsenz Inc.

快速回复 返回顶部 返回列表