4.65 out of 5
4.65
20 reviews on Udemy

Databricks Data Engineer Associate Certification Preparation

Updated Dec 2023 - Databricks Certified Data Engineer Associate V3
Databricks Lakehouse Platform and its tools
Build ETL pipelines
Process data incrementally
Create production pipelines
Create Dashboards in Databricks
Implement best security practices
Databricks
Databricks certification
Databrick Data engineer

Note: The course has been updated to include a number of lectures for version 3 certification for Databricks data engineer associate exam.

Whether you’re a seasoned data professional or just starting your journey, this course provides the perfect blend of theory and hands-on examples to ensure your success. With practical exercises and step-by-step guidance, you will learn how to navigate the Data Lakehouse architecture, explore the Data Science and Engineering workspace, and master the powerful Delta Lake.

A Certified Databricks Data Engineer unlocks endless possibilities in the world of data processing and analytics. In this comprehensive course, you will gain the knowledge and skills to harness the power of the Databricks Lakehouse Platform, empowering you to tackle real-world data challenges with confidence and efficiency.

Here’s a breakdown of the topics covered in this course:

  • Databricks Lakehouse Platform:

    • Databricks user interface

    • Notebooks

    • Connecting to repository / CICD

    • All purpose and job clusters

    • Accounts and workspaces

    • Data Lakehouse (architecture, descriptions, benefits)

    • Data Science and Engineering workspace (clusters, notebooks, data storage)

    • Delta Lake (general concepts, table management and manipulation, optimizations)

  • Data transformation with Apache Spark:

    • Relational entities (databases, tables, views)

    • Extracting data from files

    • Views, temporary views and CTEs

    • Creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs

    • Facilitating Spark SQL with string manipulation and control flow

    • passing data between PySpark and Spark SQL

    • Using Pyspark and SQL for various transformations such as count, count_if, removing duplicates, external tables, timestamps, JSON, structs, arrays, CASE WHEN and many more

  • Data management with Delta Lake:

    • Reading files using SQL in Databricks

    • Using CTAS

    • Table constraints, partitions, Operations, time travel,

    • Optimizing using z-ordering and vaccum

    • Delta cloning and external tables

  • Data pipeline with Delta live tables:

    • Structured Streaming (general concepts, triggers, watermarks)

    • Auto Loader (streaming reads)

    • Multi-hop Architecture (bronze-silver-gold, streaming applications)

    • Delta Live Tables (benefits and features)

    • Change Data Capture

  • Build production pipelines / Workloads:

    • Jobs (scheduling, task orchestration, UI, CRON)

    • Job notifications and history

    • Dashboards (endpoints, scheduling, alerting, refreshing)

  • Unity catalog and entity permissions:

    • Unity Catalog (benefits and features)

    • Entity Permissions (team-based permissions, user-based permissions)

These topics provide a comprehensive coverage of the Databricks Lakehouse Platform and its tools, allowing learners to gain a solid understanding of data engineering concepts and practices using Databricks.

You can view and review the lecture materials indefinitely, like an on-demand channel.
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don't have an internet connection, some instructors also let their students download course lectures. That's up to the instructor though, so make sure you get on their good side!
4.7
4.7 out of 5
20 Ratings

Detailed Rating

Stars 5
12
Stars 4
6
Stars 3
0
Stars 2
0
Stars 1
2
f18eeba828053292c3a8028265609dce

Includes

6 hours on-demand video
Certificate of Completion

Warning: Undefined array key "student_url_profile" in /home/itcoursesitjobbo/public_html/wp-content/plugins/masterstudy-lms-learning-management-system/_core/lms/helpers.php on line 1403

Warning: Undefined array key "student_url_profile" in /home/itcoursesitjobbo/public_html/wp-content/plugins/masterstudy-lms-learning-management-system/_core/lms/helpers.php on line 1408