Learn the basics of PySpark by working through a movie recommendation engine. You will be exposed to all steps from loading and exploring data to making predictions.
Course instructor: Vida Ha
Vida is currently a Solutions Engineer at Databricks. In her past, she worked on scaling Square’s Reporting Analytics System. She first began working with distributed computing at Google – where she improved search rankings of mobile specific web content and built and tuned language models for speech recognition using a year’s worth of Google search queries. She’s passionate about accelerating the adoption of Apache Spark to bring the combination of speed and scale of data processing to the mainstream.
99 Madison Ave., 15th Floor