Data Architect with OCP

Job

Rivago infotech inc

Dallas, TX (In Person)

Full-Time

Posted 3 days ago (Updated 15 hours ago) • Actively hiring

Expires 6/12/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

See Job Scorecard

Review key factors to help you decide if the role fits your goals.

How is this calculated?

Pay Growth

out of 5

Not enough data

Not enough info to score pay or growth

Job Security

out of 5

Not enough data

Calculating job security score...

Total Score

out of 100

Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Role :

Google Cloud Data Architect

IAM Data Modernization Location :
Dallas, TX / Charlotte, NC (Hybrid
4 days office) Highly Preferred OCP exp Project/Program Identity & Access Management (IAM) Data Modernization
migration of an on‑premises SQL data warehouse to a target‑state Data Lake on Google Cloud (Google Cloud Platform) , enabling metrics & reporting, advanced analytics, and GenAI use cases (natural language querying, accelerated summarization, cross‑domain trend analysis) leveraging PySpark‑based processing, cloud‑native DevOps CI/CD pipelines, and containerized deployments on OpenShift (OCP) to deliver scalable, secure, and high‑performance data solutions.

About Program/Project The IAM Data Modernization project involves migrating an on-premises SQL data warehouse to a target state Data Lake in Google Cloud Platform cloud environment.

Key highlights include:

Integration Scope:

30+ source system data ingestions and multiple downstream integrations

Capabilities:

Metrics, reporting, and Gen AI use cases with natural language querying, advanced pattern/trend analysis, faster summarizations, and cross-domain metric monitoring

Benefits:

Scalability and access to advanced cloud functionality
Highly available and performant semantic layer with historical data support
Unified data strategy for executive reporting, analytics, and Gen AI across cyber domains This modernization establishes a single source of truth for enterprise-wide data-driven decision-making. Required Skills DevOps / CI‑CD
Experience implementing CI/CD pipelines for data and analytics workloads
Familiarity with Git‑based source control, build automation , and deployment strategies Containers & Platform
Experience with OpenShift Container Platform (OCP) for deploying data workloads and services
Understanding of containerized architecture, scaling, and environment management
Proven ability to build CI/CD pipelines for data and infrastructure workloads
Experience managing secrets securely using Google Cloud Platform Secret Manager
Ownership of observability, SLOs, dashboards, alerts, and runbooks
Proficiency in logging, monitoring, and alerting for data pipelines and platform reliability Big Data & Processing
Hands‑on experience with PySpark for ETL/ELT, data transformation, and performance optimization
Solid understanding of distributed data processing concepts Data & Cloud Architecture
Strong experience designing data platforms on Google Cloud Platform (Google Cloud Platform)
Experience with Data Lakes, data warehousing, and large‑scale migration programs Data Lake Architecture & Storage
Proven experience designing and implementing data lake architectures (e.g., Bronze/Silver/Gold or layered models).
Strong knowledge of Cloud Storage (GCS) design, including bucket layout, naming conventions, lifecycle policies, and access controls
Experience with Hadoop/HDFS architecture, distributed file systems, and data locality principles
Hands-on experience with columnar data formats (Parquet, Avro, ORC) and compression techniques
Expertise in partitioning strategies , backfills, and large-scale data organization
Ability to design data models optimized for analytics and BI consumption Data Ingestion & Orchestration
Experience building batch and streaming ingestion pipelines using Google Cloud Platform-native services
Knowledge of Pub/Sub-based streaming architectures , event schema design, and versioning
Strong understanding of incremental ingestion and CDC patterns , including idempotency and deduplication
Hands-on experience with workflow orchestration tools (Cloud Composer / Airflow)
Ability to design robust error handling, replay, and backfill mechanisms Data Processing & Transformation
Experience developing scalable batch and streaming pipelines using Dataflow (Apache Beam) and/or Spark (Dataproc)
Strong proficiency in BigQuery SQL , including query optimization, partitioning, clustering, and cost control.
Hands-on experience with Hadoop MapReduce and ecosystem tools (Hive, Pig, Sqoop)
Advanced Python programming skills for data engineering, including testing and maintainable code design
Experience managing schema evolution while minimizing downstream impact Analytics & Data Serving
Expertise in BigQuery performance optimization and data serving patterns
Experience building semantic layers and governed metrics for consistent analytics
Familiarity with BI integration , access controls, and dashboard standards
Understanding of data exposure patterns via views, APIs, or curated datasets Data Governance, Quality & Metadata
Experience implementing data catalogs, metadata management, and ownership models
Understanding of data lineage for auditability and troubleshooting
Strong focus on data quality frameworks , including validation, freshness checks, and alerting
Experience defining and enforcing data contracts, schemas, and SLAs Good to have Security, Privacy & Compliance
Hands-on experience implementing fine-grained access controls for BigQuery and GCS
Experience with Sprint planning and helping team technically.
Strong stakeholder communication and solution‑architecture skills Qualifications

Experience:

[10-14]+ years in DevOps and Data Architecture, 5+ years designing on Pyspark/Google Cloud Platform/OCP at scale; prior on‑prem → cloud migration a must.

Education:

Bachelor s/Master s in Computer Science, Information Systems, or equivalent experience.

Certifications:

Google Cloud Professional Cloud Architect/DevOps/OCP (required or within 3 months).

Plus:

Professional Data Engineer, Security Engineer.

Similar remote jobs

Job
Field Service Lead (Commercial Generators)
SC
Southern Company
Durham, NC
Posted2 days ago
Updated15 hours ago
Job
Treasury Analyst
AT
AHU Technologies Inc
Washington
Posted2 days ago
Updated15 hours ago
Job
Corrections Utility Plant Operator SCI Coal Township
CO
Commonwealth of PA
Pennsylvania
Posted2 days ago
Updated15 hours ago
Job
Financial Analyst II - Real Estate and Operations
MS
Memorial Sloan Kettering Cancer Center
New York, NY
Posted2 days ago
Updated15 hours ago
Job
NICU & Infant Care Development Specialist
UO
University of Minnesota
Saint Paul, MN
Posted2 days ago
Updated15 hours ago

Similar jobs in Dallas, TX

Job
RS Avionics Installer II - Weekend Days
TS
The Structures Company
Dallas, TX
Posted2 days ago
Updated15 hours ago
Job
Sales Development Representative (Tech / B2B SaaS)
AS
a Snaphunt Client
Dallas, TX
Posted2 days ago
Updated15 hours ago
Job
Grill Cook
U
UnitedStates
Dallas, TX
Posted2 days ago
Updated15 hours ago
Job
Purchasing & Inventory Coordinator
GP
GRAHAM PERSONNEL SERVICES
Dallas, TX
Posted2 days ago
Updated15 hours ago
Job
Events Staffing Manager
A
A & Associates
Dallas, TX
Posted2 days ago
Updated15 hours ago

Similar jobs in Texas

Job
Dietary Aide Dishwasher
LC
Life Care
Haltom City, TX
Posted2 days ago
Updated15 hours ago
Job
Dental Assistant
CD
Castle Dental
Austin, TX
Posted2 days ago
Updated15 hours ago
Job
5-Axis CNC Programmer
LD
LAUNCH Defense
Midland, TX
Posted2 days ago
Updated15 hours ago
Job
Emergency Room Technician - PRN
AC
Altus Community Healthcare
Eagle Pass, TX
Posted2 days ago
Updated15 hours ago
Job
Clubline Concierge Representative
C
ClubCorp
Irving, TX
Posted2 days ago
Updated15 hours ago