Please join us for a free, IBM Sponsored, proof of technology for clients and practitioners on Apache Spark. This is a full day of education on Spark with hands on exercises instructed in person by Spark experts. The POT will provide a detail overview of Apache Spark. The exercises will be performed on Jupyter notebooks with publicly available datasets. Participants will use IBM’s fully managed free Cloud platform available for educational purposes.
Hands on Introduction to Apache Spark for Data Engineers, Data Scientist and Developers
Who should go:
Anyone interested in learning more about Apache Spark.
A working knowledge of Coding (Preferred Python and/or Scala), understand distributed computing, Spark and SQL.
What to expect:
Expect to spend a full day of lecture and hands on exercises attacking real-world data challenges using Apache Spark. In 8 hours you will learn the basic essentials of Apache Spark and why it’s important to your organization. This workshop will focus on data wrangling and machine learning.
For this event, you will need to bring your own laptop. Laptops will not be provided.
Full Day Agenda:
8:30am – 9:00am Breakfast, Socialize
9:00am – 10:00am Kickoff, Apache Spark Overview
10:00am – 11:00am Lab 1, Hello Spark – Hand on exercise
11:00 am – 12:00pm Apache Spark SQL Overview
12:00 pm – 1:00pm Lunch
1:00pm – 2:00pm Lab 2, Spark SQL – Hands on exercises
2:00pm – 300pm Overview of Data Science & Machine Learning w/ Apache Spark
3:00pm – 4:00pm Lab 3, Machine Learning w/ Spark – Hands on exercises
4:00pm – 4:30pm Wrap up – Feedback from attendees