Data Manipulation at Scale: Systems and Algorithms

Provided by:
5/10 stars
based on  2 reviews
Provided by:
Cost FREE , Add a Verified Certificate for $79
Start Date Upcoming
Data Manipulation at Scale: Systems and Algorithms

Course Details

Cost

FREE,
Add a Verified Certificate for $79

Upcoming Schedule

  • Upcoming

Course Provider

Coursera online courses
Coursera's online classes are designed to help students achieve mastery over course material. Some of the best professors in the world - like neurobiology professor and author Peggy Mason from the University of Chicago, and computer science professor and Folding@Home director Vijay Pande - will supplement your knowledge through video lectures. They will also provide challenging assessments, interactive exercises during each lesson, and the opportunity to use a mobile app to keep up with yo...
Coursera's online classes are designed to help students achieve mastery over course material. Some of the best professors in the world - like neurobiology professor and author Peggy Mason from the University of Chicago, and computer science professor and Folding@Home director Vijay Pande - will supplement your knowledge through video lectures. They will also provide challenging assessments, interactive exercises during each lesson, and the opportunity to use a mobile app to keep up with your coursework. Coursera also partners with the US State Department to create “learning hubs” around the world. Students can get internet access, take courses, and participate in weekly in-person study groups to make learning even more collaborative. Begin your journey into the mysteries of the human brain by taking courses in neuroscience. Learn how to navigate the data infrastructures that multinational corporations use when you discover the world of data analysis. Follow one of Coursera’s “Skill Tracks”. Or try any one of its more than 560 available courses to help you achieve your academic and professional goals.

Provider Subject Specialization
Humanities
Sciences & Technology
4723 reviews

Course Description

Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales. In this course, you will learn the landscape of relevant systems, the principles on which they rely, their tradeoffs, and how to evaluate their utility against your requirements. You will learn how practical systems were derived from the frontier of research in computer science and what systems are coming on the horizon. Cloud computing, SQL and NoSQL databases, MapReduce and the ecosyste... Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales. In this course, you will learn the landscape of relevant systems, the principles on which they rely, their tradeoffs, and how to evaluate their utility against your requirements. You will learn how practical systems were derived from the frontier of research in computer science and what systems are coming on the horizon. Cloud computing, SQL and NoSQL databases, MapReduce and the ecosystem it spawned, Spark and its contemporaries, and specialized systems for graphs and arrays will be covered. You will also learn the history and context of data science, the skills, challenges, and methodologies the term implies, and how to structure a data science project. At the end of this course, you will be able to: Learning Goals: 1. Describe common patterns, challenges, and approaches associated with data science projects, and what makes them different from projects in related fields. 2. Identify and use the programming models associated with scalable data manipulation, including relational algebra, mapreduce, and other data flow models. 3. Use database technology adapted for large-scale analytics, including the concepts driving parallel databases, parallel query processing, and in-database analytics 4. Evaluate key-value stores and NoSQL systems, describe their tradeoffs with comparable systems, the details of important examples in the space, and future trends. 5. “Think” in MapReduce to effectively write algorithms for systems including Hadoop and Spark. You will understand their limitations, design details, their relationship to databases, and their associated ecosystem of algorithms, extensions, and languages. write programs in Spark 6. Describe the landscape of specialized Big Data systems for graphs, arrays, and streams
Data Manipulation at Scale: Systems and Algorithms course image
Reviews 5/10 stars
2 Reviews for Data Manipulation at Scale: Systems and Algorithms

Ratings details

  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars

Rankings are based on a provider's overall CourseTalk score, which takes into account both average rating and number of ratings. Stars round to the nearest half.

Sort By
Student profile image
Student profile image

Student

4/10 starsTaking Now
3 years, 2 months ago
Disorganised lectures, the lecture has no skill in explaining the materials so it can be understand easily, too much rambling and not getting to the important point , wasted lots of time! Lecture are not related whatsoever to the assignment
Was this review helpful? Yes0
 Flag
 profile image
 profile image

6/10 starsDropped
  • 2 reviews
  • 1 completed
3 years, 10 months ago
This course has appealing assignments and covers interesting topics. The course, however, has two fatal flaws. First, the lectures are a bit disjointed. While there is much to learn in the lectures, the lecturers style is a bit halting and scattered (it would have been much better presented if the lecturer had a script to read off of.) As is, the lectures are mediocre, which is unfortunate since the lecturer is clearly knowledgeable about the topics presented. Second, the assignments suffer from a lack of good error messaging and no support in the forums (aside from what you will find from other students, which can be very helpful at times.) The assignments themselves are a great approach to learning concepts (and you get to work with real data, like the Twitter data), but without good error messaging when you submit a script you pretty much end guessing where you are taking a wrong turn. I had high hopes for this course, ... This course has appealing assignments and covers interesting topics. The course, however, has two fatal flaws. First, the lectures are a bit disjointed. While there is much to learn in the lectures, the lecturers style is a bit halting and scattered (it would have been much better presented if the lecturer had a script to read off of.) As is, the lectures are mediocre, which is unfortunate since the lecturer is clearly knowledgeable about the topics presented. Second, the assignments suffer from a lack of good error messaging and no support in the forums (aside from what you will find from other students, which can be very helpful at times.) The assignments themselves are a great approach to learning concepts (and you get to work with real data, like the Twitter data), but without good error messaging when you submit a script you pretty much end guessing where you are taking a wrong turn. I had high hopes for this course, but it seems as though it fails on execution.
Was this review helpful? Yes0
 Flag

Rating Details


  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars
  • 5 stars
  • 4 stars
  • 3 stars
  • 2 stars
  • 1 stars

Rankings are based on a provider's overall CourseTalk score, which takes into account both average rating and number of ratings. Stars round to the nearest half.