Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Apply Offsite

Data Bricks Migration and Support engineer

Job

Tata Consultancy Services

Seattle, WA (In Person)

$130,000 Salary, Full-Time

Posted 2 days ago (Updated 15 hours ago) • Actively hiring

Expires 7/7/2026

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Data Bricks Migration and Support engineer (part of Tata group) 3.9 3.9 out of 5 stars Seattle, WA $120,000 - $140,000 a year Tata Consultancy Services 23,598 reviews $120,000 - $140,000 a year Must Have Technical/Functional Skills Successfully executed a data migration or modernization to Data Bricks, preferably IBM Data Stage to Data Bricks on AWS Should have Experience in handling Large Migrations to Data Bricks. Should have good analytical skills to compare the legacy and modern data platform end to end right from source to target. Good understanding of DataBricks implementation of Medallion layer architecture. Independently Lead and Managed large Data Bricks migrations.

CI/CD Integration:

Implement version control (e.g., Git) and automated deployment processes for Databricks assets Technical and architectural skills required are below. Core Data Engineering Languages Experience in Advanced SQL for building modular analytics workflows, utilizing advanced Common Table Expressions (CTEs), and writing high-performance queries inside Data Bricks SQL Analytics. Experience in Python or Scala to build, optimize, and debug complex data transformation scripts, custom functions, and machine learning pipelines. Big Data & Architecture Core Experience in Apache Spark Ecosystem for understanding cluster execution flow, memory allocation, driver/worker nodes, and handling data frames. Experience in Delta Lake Architecture to understand ACID transactions on object storage, data skipping, partition strategies, and automated data compaction. Databricks Platform Expertise Experience in Delta Live Tables (DLT) & Workflows for constructing and orchestrating production-ready, declarative streaming, and batch ETL pipelines. Experience in Unity Catalog for setting up data governance, column/row-level access control, and tracking end-to-end data lineage across workspaces. Experience in Auto Loader for implementing modern, incremental data ingestion patterns from cloud blob storage into the lakehouse.

Code Translation & Refactoring Pipeline Conversion:

Translate visual DataStage Parallel Jobs and Sequences into Python/PySpark scripts or Data bricks

Notebooks Legacy Refactoring:

Modernize legacy logic rather than applying "lift and shift" anti-patterns; adapt workflows to think in distributed DataFrames rather than DataStage stages.

Logic Mapping:

Map DataStage components—such as Aggregators, Joiners, Transformers, and Sort stages—to equivalent Spark operations

Testing & Reconciliation Validation & Reconciliation:

Build automated reconciliation frameworks to compare row counts, checksums, and aggregate sums between legacy DataStage outputs and new Databricks output

Data Cleansing:

Identify and resolve data type discrepancies, null-handling differences, and encoding issues during the extraction and loading phases Platform Orc hestration &

Governance Orchestration:

Replace DataStage sequence jobs with Databricks workflows ( or external orchestrators like Azure Data Factory/Airflow) to schedule and manage dependencies

Data Governance:

Enforce data lineage, security, and cataloging using Unity Catalog to ensure compliance in the new Lakehouse environment. GOOD TO Cloud Infrastructure & CI/CD Cloud Providers (AWS): Understanding underlying cloud object storage , identity access management (IAM), and network security configurations.

DevOps & Bundles:

Familiarity with Databricks Asset Bundles (DABs) and CI/CD tools to automate the deployment of workspaces and pipeline assets.

Legacy Assessment & Migration Mechanics Code Conversion & Translation:

The ability to parse legacy code structures and refactor them into Databricks-native code.

AI-Assisted Migration:

Skills in using AI coding assistants and open framework agent tools to analyze application interdependencies, automate schema mapping, and accelerate lift-and-shift workloads

Code Conversion & Translation:

The ability to parse legacy code structures from ETL pipelines, Informatica, data Stage preferred Experience working in Agile teams and understanding of data governance frameworks. Responsibilities Support post-migration environment from IBM DataStage to

Databricks Incident & Lifecycle Management CI/CD Deployment:

Support code deployments across Development, Test, and Production environments using

Databricks Repos and REST APIs Monitoring & Alerting:

Set up monitoring via Databricks System Tables and observability tools to catch job failures, data anomalies, or latency spikes early

Pipeline Maintenance & Orchestration Workflow Management:

Transition from DataStage job sequences to native data bricks workflows for scheduling, dependency tracking, and alerts

ETL Refactoring:

Troubleshoot and fix issues in generated PySpark or Spark SQL code that replaced legacy DataStage Transformer or Lookup stages

Streaming & Batch Integration:

Support ongoing data ingestion using data bricks autoloader to process files continuously from cloud storage

Performance Tuning & Cost Optimization Compute Management:

Monitor and configure serverless or classic clusters to prevent over-provisioning

Query Optimization:

Analyze Spark execution plans. Replace inefficient row-by-row processing logic (a common DataStage carryover) with vectorized operations and native Spark functions

Storage Optimization:

Maintain Delta Lake tables by enforcing layout optimization ($ZORDER$

Data Governance & Security Access Control:

Implement granular permissions, column-masking, and row-level filters using Data bricks unity catalog to replace DataStage's legacy security policies

Data Quality:

Utilize Delta Live Tables (DLT) to build pipelines with built-in, declarative data quality expectations and monitoring Additional Skills Excellent communication Skills Ability to collaborate with Legacy and Modernize application teams and stake holders

Base Salary Range :

$120,000 to $140,000

Per Annum TCS Employee Benefits Summary:

Discretionary Annual Incentive.

Comprehensive Medical Coverage:

Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans.

Family Support:

Maternal & Parental Leaves.

Insurance Options:

Auto & Home Insurance, Identity Theft Protection.

Convenience & Professional Growth:

Commuter Benefits & Certification & Training Reimbursement.

Time Off:

Vacation, Time Off, Sick Leave & Holidays.

Legal & Financial Assistance:

Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing. #LI-SV2 #

LI-KUMARAN

Location Seattle, WA Job Function

TECHNOLOGY

Role Senior Engineer Job Id 416051 Desired Skills Data Migration Salary Range $120,000-$140,000 a year

Desired Candidate Profile Qualifications :

BACHELOR OF COMPUTER

SCIENCE