Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
Databricks Architect Must Have Technical/Functional Skills Experience:
5+ years of hands-on data engineering experience, with at least 3 years focused on the
Databricks/Spark Ecosystem Databricks Expertise:
Deep, hands-on expertise with the Databricks Lakehouse Platform, including Delta Lake, Structured Streaming, Delta Live Tables, and cluster configuration/optimization.
Programming Mastery:
Expert-level proficiency in Python and PySpark. Advanced SQL skills are essential.
Data Warehousing Concepts:
Strong understanding of data modeling principles, including dimensional modeling (Kimball), data warehousing concepts, and ETL/ELT design patterns.
Cloud Proficiency:
Proven experience working with a major cloud provider (Azure, AWS, or GCP), particularly with data storage S3 and related services.
Software Engineering Mindset:
Experience with software engineering best practices, including version control (Git), code reviews, testing, and CI/CD.
Roles and Responsibilities Data Pipeline Development:
Design, code, and deploy robust and scalable batch and streaming data pipelines using PySpark, Spark SQL, and Delta Live Tables to ingest data from sources such as Point-of-Sale (POS), e-commerce platforms, loyalty systems, and marketing clouds.
Data Modeling and Transformation:
Implement complex data transformations and business logic within the Medallion architecture (Bronze, Silver, Gold layers). Build and optimize the final "Gold" customer-dimension tables that will serve as the single source of truth.
Data Quality:
Implement data quality frameworks and cleansing routines to ensure the accuracy and trustworthiness of the Customer 360 data.
Performance Optimization:
Proactively monitor, debug, and tune Databricks jobs and Spark clusters for performance and cost-efficiency. Implement best practices for partitioning, caching, and data layout in Delta Lake. Infrastructure as Code (IaC) &
CI/CD:
Work with DevOps teams to manage Databricks environments, clusters, and job deployments using tools like Terraform and AWS DevOps/GitHub Actions. Champion and implement CI/CD best practices for data pipelines.
Data Governance and Security:
Implement data governance features within Databricks Unity Catalog, including data lineage tracking, access controls, and data masking to ensure compliance and security.
Collaboration:
Partner closely with Functional Consultants, Data Scientists, and Analytics Engineers to understand their data requirements and deliver well-structured, consumption-ready datasets.
Education Bachelors Salary Range:
$120000 - $150000 a year Location Marlborough, MA Job Function
TECHNOLOGY
Role Technical Architect Job Id 409979 Desired Skills Data Warehouse Salary Range $120,000-$150,000 a year
Desired Candidate Profile Qualifications :
BACHELOR OF COMPUTER SCIENCE
Technical Architect-Datawarehousing (part of Tata group) 3.9 3.9 out of 5 stars Marlborough, MA $120,000 - $150,000 a year Tata Consultancy Services 23,588 reviews $120,000 - $150,000 a year
Databricks Architect Must Have Technical/Functional Skills Experience:
5+ years of hands-on data engineering experience, with at least 3 years focused on the
Databricks/Spark Ecosystem Databricks Expertise:
Deep, hands-on expertise with the Databricks Lakehouse Platform, including Delta Lake, Structured Streaming, Delta Live Tables, and cluster configuration/optimization.
Programming Mastery:
Expert-level proficiency in Python and PySpark. Advanced SQL skills are essential.
Data Warehousing Concepts:
Strong understanding of data modeling principles, including dimensional modeling (Kimball), data warehousing concepts, and ETL/ELT design patterns.
Cloud Proficiency:
Proven experience working with a major cloud provider (Azure, AWS, or GCP), particularly with data storage S3 and related services.
Software Engineering Mindset:
Experience with software engineering best practices, including version control (Git), code reviews, testing, and CI/CD.
Roles and Responsibilities Data Pipeline Development:
Design, code, and deploy robust and scalable batch and streaming data pipelines using PySpark, Spark SQL, and Delta Live Tables to ingest data from sources such as Point-of-Sale (POS), e-commerce platforms, loyalty systems, and marketing clouds.
Data Modeling and Transformation:
Implement complex data transformations and business logic within the Medallion architecture (Bronze, Silver, Gold layers). Build and optimize the final "Gold" customer-dimension tables that will serve as the single source of truth.
Data Quality:
Implement data quality frameworks and cleansing routines to ensure the accuracy and trustworthiness of the Customer 360 data.
Performance Optimization:
Proactively monitor, debug, and tune Databricks jobs and Spark clusters for performance and cost-efficiency. Implement best practices for partitioning, caching, and data layout in Delta Lake. Infrastructure as Code (IaC) &
CI/CD:
Work with DevOps teams to manage Databricks environments, clusters, and job deployments using tools like Terraform and AWS DevOps/GitHub Actions. Champion and implement CI/CD best practices for data pipelines.
Data Governance and Security:
Implement data governance features within Databricks Unity Catalog, including data lineage tracking, access controls, and data masking to ensure compliance and security.
Collaboration:
Partner closely with Functional Consultants, Data Scientists, and Analytics Engineers to understand their data requirements and deliver well-structured, consumption-ready datasets.
Education Bachelors Salary Range:
$120000 - $150000 a year Location Marlborough, MA Job Function
TECHNOLOGY
Role Technical Architect Job Id 409979 Desired Skills Data Warehouse Salary Range $120,000-$150,000 a year