This one-day Spark training is an introduction to the foundations of Spark and how to use it on the Databricks environment. Spark is a unified computing engine and a set of libraries for parallel processing of big data. Companies like Netflix, Yahoo and eBay have used Spark to achieve lightning-fast processing for large scale data, and more and more companies are implementing the tool.
You will leave this training with an understanding of the foundational concepts of Spark and its architecture. You will see use cases and learn how to write simple applications using Spark.
For this training, you’ll use your own computer and the databricks environment to work through examples — please do not forget to bring a laptop!
Lunch will be provided.
Who Should Attend?
Business Intelligence Analysts
**Knowledge of at least one programming language is highly recommended**
About the Instructor
Daniel Cadenas is a computer engineer with 18+ year of experience in Business Intelligence, Data Warehousing and Big Data. Currently, working as Big Data Engineer at Ultimate Software, he is involved in building applications using Spark and Hadoop (HDP). Daniel is also a Data Scientist enthusiast with hands-on experience with machine learning and deep learning. Daniel has taught different classes during his career such as DB2, Cognos, Microstrategy, Data Stage and others.
- Spark 2 Architecture
- Databricks Community Edition
- Spark RDD
- Spark SQL/DataFrames
- Spark Streaming
- Spark ML
- Performance and tuning considerations