溫馨提示×

溫馨提示×

您好,登錄后才能下訂單哦!

密碼登錄×
登錄注冊×
其他方式登錄
點擊 登錄注冊 即表示同意《億速云用戶服務(wù)條款》

CDH培訓——Cloudera Developer Training for Spark and hadoop

發(fā)布時間:2020-07-17 19:31:57 來源:網(wǎng)絡(luò) 閱讀:882 作者:IRENEN2007 欄目:大數(shù)據(jù)

Cloudera Developer Training for Spark and hadoop

Course Time2016627-30

Course Location:上海市 浦東新區(qū) 張江高科 伯克利工程創(chuàng)新中心

Contact us400-679-6113

QQ1438118790


CertificationCCA-175

Learn how toimport data into your Apache Hadoop closter and process it with sparkhive、flumesqoop、impala and other Hadoop ecosystem tools.


Audience and Prerequisites

This coursedesigned for developers and engineers who have programming experience. Apachespark examples and hands-on exercises are presented in Scala and Python, so theability to program in one of those languages is required. Basic familiaritywith the Linux command line is assumed. Basic knowledge of SQL is helpful. Priorknowledge of Hadoop is not required.


Course outlineDeveloperTraining for Spark and hadoop

  • Introduction to Hadoop and the Hadoop ecosystem

  • Hadoop architecture and HDFS

  • Importing relational data with Apache spoop

  • Introduction to impala and hive

  • Modeling and managing data with impala and hive

  • Data formats

  • Data partitioning

  • Capturing data with Apache flume

  • Spark basics

  • Working with RDDs in spark

  • Writing and deploying spark applications

  • Parallel programming with spark

  • Spark caching and persistence

  • Common patterns in spark data processing

  • Previewspark SQL


向AI問一下細節(jié)

免責聲明:本站發(fā)布的內(nèi)容(圖片、視頻和文字)以原創(chuàng)、轉(zhuǎn)載和分享為主,文章觀點不代表本網(wǎng)站立場,如果涉及侵權(quán)請聯(lián)系站長郵箱:is@yisu.com進行舉報,并提供相關(guān)證據(jù),一經(jīng)查實,將立刻刪除涉嫌侵權(quán)內(nèi)容。

AI