Job Title:
Lead Data Engineer with
Azure Databricks and Unity Catalog Location:
Charlotte, NC-3 days office
Duration:
Long Term Role Overview We are looking for an experienced Azure Databricks Engineer with strong expertise in cloud-based data engineering, ETL development, and distributed data processing. The ideal candidate should have solid hands-on experience with PySpark, Delta Lake, Azure Data Factory, and building scalable data pipelines on Azure. The engineer will work closely with business, Data Architects, and cross-functional teams to design, develop, and optimize data pipelines for enterprise‑grade analytics and reporting. Key Responsibilities Data Engineering & Pipeline Development Design, develop, and optimize ETL/ELT pipelines using Azure Databricks (PySpark).
Build scalable data ingestion workflows from various structured and unstructured sources.
Implement transformation logic, data cleansing, enrichment, and validation frameworks.
Work with Delta Lake to build medallion architecture (Bronze/Silver/Gold layers).
Develop reusable Databricks notebooks and jobs for production data workflows.
Azure Cloud & Integration Build and orchestrate pipelines using Azure Data Factory (ADF).
Integrate Databricks with other Azure services-ADLS, Azure SQL, Event Hub, Key Vault, Synapse.
Optimize compute environments (clusters, pools, autoscaling).
Implement DevOps processes using Git, CICD, Azure DevOps.
Performance, Quality & Governance Optimize PySpark jobs for performance and cost efficiency.
Implement best practices for data governance, security, and access control.
Troubleshoot production issues and perform root-cause analysis.
Conduct code reviews ensuring coding standards and data quality.
Collaboration & Documentation Work with Data Architects to define architecture and design patterns.
Prepare technical documents, solution diagrams, and runbooks.
Collaborate with business stakeholders to understand requirements and translate them into technical solutions. 3. Mandatory Skills Azure Databricks
- notebooks, jobs, workflows, Delta Lake.
PySpark
- dataframes, Spark SQL, optimization & debugging.
Azure Data Factory (ADF)
- triggers, pipelines, integration runtime.
Data Lake Storage (ADLS Gen2)
- folder structures, partitioning, security.
CI/CD
- Git (branching strategies), Azure DevOps pipelines.
SQL
- strong proficiency in writing optimized queries.
4. Good-to-Have Skills Azure Synapse Analytics
Azure Event Hub / Kafka
Azure Functions
DataBricks REST APIs
Streaming pipelines (Structured Streaming)
Experience with data modelling
Knowledge of Lakehouse architecture 5. Behavioral & Soft Skills Strong analytical and problem-solving skills.
Ability to work independently and in cross-functional teams.
Good communication skills for stakeholder interaction.
Comfortable working in Agile/Scrum models.
Educational Qualifications:
- Required
- Bachelor s degree in Computer Science, Information Technology, Computer Engineering or closely related or equivalent.
- Preferred
- Master s degree in Management Information Systems (MIS), Computer Science, Big Data or Analytics or equivalent.
Travel:
- Open to travel based-up on the nature of the engagement.
Thanks & Regards Srikanth Donkani Resource Manager (w): (E): 2260 Haggerty Road, Suite 285 Northville, MI 48167 Equal Employment Opportunity Reliable Software employment does not discriminate on the basis of race, religion, gender, sexual orientation, age or any other basis as covered by federal, state, or local law. Employment decisions are based solely on qualifications, merit and business needs.