使用Alluxio(前Tachyon)來加速大資料計算

OReillyData發表於2017-04-17

講師:Bin Fan (Alluxio), Haoyuan Li (Alluxio)

09:00–12:30 週四, 2017-07-13

資料工程和架構

地點: 多功能廳2

觀眾水平: 中級


必要預備知識

Basic concept of Hadoop/Spark.


您將學到什麼

瞭解Alluxio是什麼,如何配置/執行Alluxio和如何構建簡單的應用程式以受益於Alluxio。


描述

在這個三個小時的教學課中, 我們將向參與者講授Alluxio基礎知識,演示Alluxio如何工作以及如何使用此係統幫助分散式計算引擎(如Spark或MapReduce)以記憶體速度共享資料。在上機環節裡, 講師將指導參與者部署和執行Alluxio,將外部儲存系統(如S3)掛載至Alluxio名稱空間,以及使用Alluxio命令列工具以及WebUI,最後使用通用計算引擎(例如,Apache Spark,Hadoop MapReduce)來搭建一個簡單的大資料應用,並使用這一應用從Alluxio來讀取和寫入資料。


講師介紹

Bin Fan (Alluxio)

Bin Fan is a software engineer at Alluxio and a PMC member of the Alluxio project. Prior to Alluxio, Bin worked at Google building next-generation storage infrastructure, where he won Google’s Technical Infrastructure award. Bin has a PhD in computer science from Carnegie Mellon University.


Haoyuan Li (Alluxio)

Haoyuan Li is founder and CEO of Alluxio (formerly Tachyon Nexus), a memory-speed virtual distributed storage system. Before founding the company, Haoyuan was working on his PhD at UC Berkeley’s AMPLab, where he cocreated Alluxio. He is also a founding committer of Apache Spark. Previously, he worked at Conviva and Google. Haoyuan holds an MS from Cornell University and a BS from Peking University.




Strata Data Conference北京站已經開啟註冊系統,閱讀原文可瀏覽截止到目前為止的講師名單和已經確認的議題,最優惠票價期截止到5月5日為止儘快註冊以確保留位

640?wx_fmt=png


相關文章