Home Big Data Managing Big Data with R and Hadoop

Managing Big Data with R and Hadoop

358
0
Managing Big Data with R and Hadoop

Learn how to manage and analyse big data using the R programming language and Hadoop programming framework.

PRACE Online Course Highlights
  • 5 weeks long
  • 4 hours per week
  • Learn for FREE, Ugpradable
  • Self-Paced
  • Taught by: Janez Povh, Biljana Mileva Boshkoska, Leon Kos
  • View Course Syllabus

Online Course Details:

This course will give you access to a virtual environment with installations of Hadoop, R and Rstudio to get hands-on experience with big data management. Several unique examples from statistical learning and related R code for map-reduce operations will be available for testing and learning.

Those with basic knowledge in statistical learning and R will better understand the methods behind and how to run them in parallel using map-reduce functions and Hadoop data storage. At the end of the course you will get access to RHadoop on a supercomputer at University of Ljubljana.

Why join the course?

This course will give you access to a virtual environment with installations of Hadoop, R and Rstudio to get hands-on experience with big data management. Several unique examples from statistical learning and related R code for map-reduce operations will be available for testing and learning.

Those with basic knowledge in statistical learning and R will better understand the methods behind and how to run them in parallel using map-reduce functions and Hadoop data storage. At the end of the course you will get access to RHadoop on a supercomputer at University of Ljubljana.

What topics will you cover?

  • Welcome to BIG DATA
  • Working with Hadoop
  • First steps in R and RHadoop
  • Statistical learning with RHadoop: clustering
  • Statistical learning with RHadoop: regression and classification

What will you achieve?

By the end of the course, you’ll be able to…

  • Explore basic functionality of Apache Hadoop and of RHadoop
  • Experiment how to achieve performance of modern supercomputing
  • Experiment regression, clustering and classification with RHadoop
  • Investigate basic functionality of Bash terminal window
  • Knowledge about statistical learning to instances of data provided by edcators
  • How to do big data management with RHadoop on real supercomputer provided by Universiy of Ljubljana

Take This Online Course