Resources
Hand-crafted technical documentation and learning paths.
Learning Paths
Cheatsheets
dbutils CheatsheetFilesystem, secrets, notebooks, widgets, libraries, jobs.PySpark Data ManipulationDiagnostics, cleaning, transformations, aggregations, windows, joins.Delta Lake MaintenanceSafe DDL, schema evolution, housekeeping, time travel, CDF, monitoring, cleanup.Delta Lake OptimizationOPTIMIZE, Z-ORDER, Liquid Clustering, partitioning, VACUUM, and health checks.Unity Catalog Admin & Security (SQL)Catalog/schema setup, grants, external locations, row/column security, SPNs, auditing.Soda CoreData quality checks, scans, metrics and monitoring for pipelines.PySpark Configurationspark.conf settings for performance, memory, SQL, Delta, Photon.
Certification Guides
Lakehouse FundamentalsPrep for the Lakehouse Fundamentals accreditation.Data Engineer AssociatePrep path, official courses, and practice exams.Data Engineer ProfessionalAdvanced prep with optimization and best practices.Associate Developer for Apache Spark (PySpark)Developer-focused prep for the PySpark certification.