Data Science & Hadoop Workflows at Scale With Scalding

Provided by:
0/10 stars
based on  0 reviews
Provided by:
Cost $29/mo
Start Date On demand
Data Science & Hadoop Workflows at Scale With Scalding

Course Details

Cost

$29/mo

Get access to the entire library of over 5000 courses on software engineering, technology and more. Different billing options are available.

Start Free Trial

Upcoming Schedule

  • On demand

Course Provider

Pluralsight online courses
Pluralsight's mission is to publish high quality online training courses for professional developers, IT admins and creative artists. Over 750,000 Learners have already made use of the 3,500+ Courses available in this extensive library. Pluralsight delivers world-class training that¹s easy to comprehend and quick to learn. That¹s the beauty of being taught by experts. Course authors are an elite group of tech and creative professionals, innovators and leaders. As a subscriber, you are ...
Pluralsight's mission is to publish high quality online training courses for professional developers, IT admins and creative artists. Over 750,000 Learners have already made use of the 3,500+ Courses available in this extensive library. Pluralsight delivers world-class training that¹s easy to comprehend and quick to learn. That¹s the beauty of being taught by experts. Course authors are an elite group of tech and creative professionals, innovators and leaders. As a subscriber, you are connected with authors through discussion boards for ongoing, real-time learning. Courses give you the experience and skills you need to succeed on the job and grow your career. With downloadable exercise files, you can follow along with the video to practice applying your new skills. Pre- and post-course assessments track your progress. When you successfully complete a course, we you receive a certificate and an official transcript to validate and build your online resume.

Provider Subject Specialization
Sciences & Technology
Business & Management
8 reviews

Course Description

This course teaches you how to use Scalding (a domain specific language) built on Scala and Cascading to build distributed applications on Hadoop. The course also focuses on the data science aspect using Algebird, an abstract algebra library for Scala, to solve real-world sketching/streaming problems on distributed systems. You will learn how to reason about a variety of problems, how to build and test locally, and how to deploy on Hadoop. You will also learn the algorithms used to solve problems at scale where performance, compute and memory resources, and the window of time you have to process streaming data are all challenges you'll have to overcome, and how you can use Scalding and Algebird to solve for these constraints. This course also covers some Scala basics to get you up to speed and looks into how you can monitor, visualize, and troubleshoot your application's workflow and performance problems. Watch this course if you wer... This course teaches you how to use Scalding (a domain specific language) built on Scala and Cascading to build distributed applications on Hadoop. The course also focuses on the data science aspect using Algebird, an abstract algebra library for Scala, to solve real-world sketching/streaming problems on distributed systems. You will learn how to reason about a variety of problems, how to build and test locally, and how to deploy on Hadoop. You will also learn the algorithms used to solve problems at scale where performance, compute and memory resources, and the window of time you have to process streaming data are all challenges you'll have to overcome, and how you can use Scalding and Algebird to solve for these constraints. This course also covers some Scala basics to get you up to speed and looks into how you can monitor, visualize, and troubleshoot your application's workflow and performance problems. Watch this course if you were considering, or already know how to use Pig, Hive, or any other DSL for Hadoop and not only wanted more power over your workflows, but also a DSL that is actively being developed to support up and coming execution frameworks like Apache Tez and Apache Spark with all the flexibility that a full functional programming language like Scala has to offer. If you're serious about learning how to build enterprise-grade applications on Hadoop, data science, and Lambda architectures, then this course is for you.
Reviews 0/10 stars
0 Reviews for Data Science & Hadoop Workflows at Scale With Scalding

Ratings details

  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars

Rankings are based on a provider's overall CourseTalk score, which takes into account both average rating and number of ratings. Stars round to the nearest half.

No reviews yet. Be the first!

Rating Details


  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars

Rankings are based on a provider's overall CourseTalk score, which takes into account both average rating and number of ratings. Stars round to the nearest half.