Job Overview

Title:

Lead Azure Data Engineer

key responsibilities

• design and implement streaming data pipelines integrating kafka with databricks using structured streaming.

• architect and maintain medallion architecture with well-defined bronze, silver, and gold layers.

• implement efficient ingestion using databricks autoloader for high-throughput data loads.

• work with large volumes of structured and unstructured data, ensuring high availability and performance.

• apply performance tuning techniques such as partitioning, caching, and cluster resource optimization.

• collaborate with cross-functional teams (data scientists, analysts, business users) to build robust data solutions.

• establish best practices for code versioning, deployment automation, and data governance.

required technical skills:

• strong expertise in azure databricks and spark structured streaming

• 7+ years' experience in data engineering

• processing modes (append, update, complete)

• output modes (append, complete, update)

• checkpointing and state management

• experience with kafka integration for real-time data pipelines

• deep understanding of medallion architecture

• proficiency with databricks autoloader and schema evolution

• deep understanding of unity catalog and foreign catalog

• strong knowledge of spark sql, delta lake, and dataframes

• expertise in performance tuning (query optimization, cluster configuration, caching strategies)

• must have data management strategies

• excellent with governance and access management

• strong with data modelling, data warehousing concepts, databricks as a platform

• solid understanding of window functions.

$514963-$840023 Annual

Celebal Technologies

Not Specified, Maharashtra, India