Download the webinar replay and presentation: R + Hadoop = Big Data Analytics

RHadoop is an open source project spearheaded by Revolution Analytics to grant data scientists access to Hadoop’s scalability from their favorite language, R.   It allows users to write general MapReduce programs, offering the full power and ecosystem of an existing, established programming language.  RHadoop is comprised of three packages.

  1. RHDFS, which provides file level manipulation for HDFS, the Hadoop file system
  2. RHBASE, which provides access to HBASE, the Hadoop database
  3. rmr, which allows you to write MapReduce programs in R

In this webinar, Antonio will provide a brief introduction to Hadoop and R.  He will describe how rmr allows R developers to program in the MapReduce framework, and provides for all developers an alternative way to implement MapReduce programs that strikes a delicate compromise between power and usability.  He’ll explain:

  • Its simplicity.  For example, you don’t need to replace the R interpreter with a special run-time—it is just a library
  • How rmr’s handful of functions (with a modest number of arguments and sensible defaults) can be combined in many useful ways
  • Using examples such as machine learning and statistics, he’ll show examples of the power of the API 
  • Ways you can contribute to the further development of the RHadoop project.

Complete the form below to download.

  •  
Legal   |   Contact Us © 2014 Revolution Analytics