Loading Events

« All Events

  • This event has passed.

Data Product at Scale with Scala (12-week evening course)

June 27 @ 6:00 pm - September 14 @ 9:00 pm


Course Description
This part-time evening course will enable you to integrate data products at scale using Scala. In this class we cover how to work with distributed systems in order to efficiently collect, analyze, and parallelize large quantities of varied data. This course is intended for data engineers that want to advance their career with the latest technology and applications.

In this course you will:

  • Work with Linux virtual machine
  • Deploy and manipulate data in the cloud
  • Write complex SQL queries
  • Design a database that conforms to the third normal form (3NF)
  • Identify embarrassingly parallelizable tasks
  • Describe and apply the MapReduce algorithm
  • Describe and apply Spark’s DataFrame abstraction
  • Perform machine learning on a cluster
  • Apply probabilistic data structures to handle high volume/velocity data
  • Build an end-to-end distributed data-pipeline

This course is ideal for students who:

  • Fluent with programming languages such as: Python, C, Java, etc,.
  • Are familiar with Data Exploration, basic Feature Engineering
  • Are familiar with the basic fundamentals of Machine Learning models

Class Structure

This course is an “active” learning environment. You’ll learn through doing. The focus will be on explaining concepts in your words and applying concepts through programming.

Course Schedule

This is a 12-week course. Classes run June 27 – September 14. Classes meet twice per week – on Tuesdays + Thursdays from 6pm-9pm PDT.


June 27 @ 6:00 pm
September 14 @ 9:00 pm


44 Tehama Street
San Francisco, CA 94105 United States