Apply Now

Data Architect (Google Cloud )– IAM Data Modernization

Hybrid in Dallas, TX, US • Posted 1 day ago • Updated 1 day ago

Contract Corp To Corp

Contract Independent

Contract W2

No Travel Required

Hybrid

$65 - $70/hr

Rivago infotech inc

Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

gcp
iam
Data Architect
Architect

Summary

Role : Google Cloud Data Architect – IAM Data Modernization with (OpenShift Container Platform (OCP) )

Location : Dallas, TX / Charlotte, NC (Hybrid – 4 days office)

Contract

End Client: Banking

Implementation partner - ********

Experience: 12+ Years

Highly Preferred OCP exp

Project/Program

Identity & Access Management (IAM) Data Modernization – migration of an on‑premises SQL data warehouse to a target‑state Data Lake on Google Cloud (Google Cloud Platform), enabling metrics & reporting, advanced analytics, and GenAI use cases (natural language querying, accelerated summarization, cross‑domain trend analysis) leveraging PySpark‑based processing, cloud‑native DevOps CI/CD pipelines, and containerized deployments on OpenShift (OCP) to deliver scalable, secure, and high‑performance data solutions.

About Program/Project

The IAM Data Modernization project involves migrating an on-premises SQL data warehouse to a target state Data Lake in Google Cloud Platform cloud environment. Key highlights include:

Integration Scope: 30+ source system data ingestions and multiple downstream integrations
Capabilities: Metrics, reporting, and Gen AI use cases with natural language querying, advanced pattern/trend analysis, faster summarizations, and cross-domain metric monitoring
Benefits:

Scalability and access to advanced cloud functionality
Highly available and performant semantic layer with historical data support
Unified data strategy for executive reporting, analytics, and Gen AI across cyber domains

This modernization establishes a single source of truth for enterprise-wide data-driven decision-making.

Required Skills

DevOps / CI‑CD

Experience implementing CI/CD pipelines for data and analytics workloads
Familiarity with Git‑based source control, build automation, and deployment strategies

Containers & Platform

Experience with OpenShift Container Platform (OCP) for deploying data workloads and services
Understanding of containerized architecture, scaling, and environment management
Proven ability to build CI/CD pipelines for data and infrastructure workloads
Experience managing secrets securely using Google Cloud Platform Secret Manager
Ownership of observability, SLOs, dashboards, alerts, and runbooks
Proficiency in logging, monitoring, and alerting for data pipelines and platform reliability

Big Data & Processing

Hands‑on experience with PySpark for ETL/ELT, data transformation, and performance optimization
Solid understanding of distributed data processing concepts

Data & Cloud Architecture

Strong experience designing data platforms on Google Cloud Platform (Google Cloud Platform)
Experience with Data Lakes, data warehousing, and large‑scale migration programs

Data Lake Architecture & Storage

Proven experience designing and implementing data lake architectures (e.g., Bronze/Silver/Gold or layered models).
Strong knowledge of Cloud Storage (GCS) design, including bucket layout, naming conventions, lifecycle policies, and access controls

· Experience with Hadoop/HDFS architecture, distributed file systems, and data locality principles

Hands-on experience with columnar data formats (Parquet, Avro, ORC) and compression techniques
Expertise in partitioning strategies, backfills, and large-scale data organization
Ability to design data models optimized for analytics and BI consumption

Data Ingestion & Orchestration

· Experience building batch and streaming ingestion pipelines using Google Cloud Platform-native services

· Knowledge of Pub/Sub-based streaming architectures, event schema design, and versioning

· Strong understanding of incremental ingestion and CDC patterns, including idempotency and deduplication

· Hands-on experience with workflow orchestration tools (Cloud Composer / Airflow)

· Ability to design robust error handling, replay, and backfill mechanisms

Data Processing & Transformation

· Experience developing scalable batch and streaming pipelines using Dataflow (Apache Beam) and/or Spark (Dataproc)

· Strong proficiency in BigQuery SQL, including query optimization, partitioning, clustering, and cost control.

· Hands-on experience with Hadoop MapReduce and ecosystem tools (Hive, Pig, Sqoop)

· Advanced Python programming skills for data engineering, including testing and maintainable code design

· Experience managing schema evolution while minimizing downstream impact

Analytics & Data Serving

· Expertise in BigQuery performance optimization and data serving patterns

· Experience building semantic layers and governed metrics for consistent analytics

· Familiarity with BI integration, access controls, and dashboard standards

· Understanding of data exposure patterns via views, APIs, or curated datasets

Data Governance, Quality & Metadata

· Experience implementing data catalogs, metadata management, and ownership models

· Understanding of data lineage for auditability and troubleshooting

· Strong focus on data quality frameworks, including validation, freshness checks, and alerting

· Experience defining and enforcing data contracts, schemas, and SLAs

Good to have

Security, Privacy & Compliance

· Hands-on experience implementing fine-grained access controls for BigQuery and GCS

· Experience with Sprint planning and helping team technically.

· Strong stakeholder communication and solution‑architecture skills

Qualifications

Experience: [10–14]+ years in DevOps and Data Architecture, 5+ years designing on Pyspark/Google Cloud Platform/OCP at scale; prior on‑prem → cloud migration a must.
Education: Bachelor’s/Master’s in Computer Science, Information Systems, or equivalent experience.
Certifications: Google Cloud Professional Cloud Architect/DevOps/OCP (required or within 3 months). Plus: Professional Data Engineer, Security Engineer.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91131106
Position Id: 8964568
Posted 1 day ago

Company Info

About Rivago infotech inc

Rivago Infotech Inc has been a leader in IT staffing and Software development for over 5 years and is one of the largest diversity and development firms in the industry. We are known for our high-touch, customer-eccentric approach, offering our clients unmatched quality, responsiveness and flexibility . We are appreciated by our clients for our streamlined execution, highly efficient service and exceptional talent management that go above and beyond traditional staffing services.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Data Architect (with OpenShift OCP)

Hybrid in Dallas, Texas

•

Today

Project: Identity & Access Management (IAM) Data Modernization Migration of an on-premises SQL data warehouse to a modern enterprise Data Lake platform, enabling analytics and GenAI use cases. The platform leverages PySpark-based processing, CI/CD pipelines, and containerized deployments on OpenShift (OCP), with Google Cloud Platform as a preferred cloud platform, to deliver scalable, secure, and high-performance data solutions About Program/Project The IAM Data Modernization program focuses on

Easy Apply

Third Party, Contract

$80 - $82

Google Cloud Platform Data Engineer with Dataplex and AI (Last round In - person)

Irving, Texas

•

8d ago

Fine With Relocation Interview Mode: One video interview and theclient interview will be in personat any Client office Role :Google Cloud Platform DataEngineer with Dataplex and AI (Last Round In-person) Location : Irving, Texas 75039 (100% onsite) Hire type : Contract Implementation partner - End Client - Banking / Finance Experience: 9+ years Must have : Dataplex exp Overview We are seeking aGoogle Cloud Platform DataEngineer with deep, hands-on architectural and development experience inG

Easy Apply

Contract, Third Party

55 - 60

GPU Infrastructure Architect (GPU Cluster Stand-up, Configuration, Operations, AMD MI350)

Remote

•

Yesterday

The GPU Cluster Architect is responsible for designing, provisioning, and operatingAMD MI350based GPU clusterson a cloud platform. The role ensuresscalable, secure, and reproducible GPU infrastructureto support distributed training and high-performance workloads. Key Responsibilities Design end-to-end GPU cluster architecture covering compute, networking, storage, and control services.Provision and operationalize up to9 AMD MI350 GPU clustersbased on confirmed cloud SKU availability.Configure G

Easy Apply

Third Party, Contract

85 - 87

Principal AI/ML Architect (MLOps)

Arizona City, Arizona

•

3d ago

Role :Principal AI/ML Architect(MLOps) Location :Scottsdale,AZ85250 (onsite) Hire Type : Contract Client: Service base Company Implementation partner: Experience: 15+ Years Role Summary Principal AI/ML Architectresponsible for defining, architecting, and operationalizing enterprise-grade MLOps platforms using Domino.ai (or any other leading MLOps platform) and AWS. The role focuses on scalable model deployment, lifecycle management, governance, and production reliability, while partnering with

Easy Apply

Third Party, Contract

65 - 70

Search all similar jobs