Overview
Skills
Job Details
Hi
Role: Data Architect
Duration: 6-12+ Months Contract
Location: San Jose, CA/McLean, VA (Remote is Okay)
Education - very important,
only Btech in Computer Science degree is allowed.
or B E in Computer Science.
Candidates must have demonstrated experience designing and owning an end-to-end data platform from scratch, including data ingestion, storage, processing, and consumption layers, and be able to clearly explain how these components interact.
There is a strong emphasis on scalability and performance engineering, including designing for large-scale, high-volume data systems, optimization strategies, and overall platform efficiency.
Distributed data processing is critical, especially Spark fundamentals. Candidates must demonstrate real hands-on experience, not just theoretical or surface-level exposure.
Architects who can create and evolve architecture documentation, think beyond existing processes, and communicate designs clearly with strong architectural ownership and depth.
The Data Architect will be a key contributor to designing, evolving, and optimizing our company's cloud-based data architecture. This role requires a strong background in data engineering, hands-on experience building cloud data solutions, and a talent for communicating complex designs through clear diagrams and documentation.
Mandatory - no exceptions - education - B. Tech in Computer Science
Core Responsibilities
Cloud Data Architecture Design & Strategy:
Design and implement secure, scalable cloud-based data pipelines, data warehouses, and data lakes.
Drive the selection and integration of cloud data services (e.g., storage, databases, analytics tools).
Develop comprehensive cloud data strategies in alignment with business goals.
Diagramming & Documentation:
Produce clear and informative visual diagrams (e.g., data flow diagrams, entity-relationship diagrams, system architecture diagrams) to guide implementation and knowledge sharing.
Maintain detailed documentation of data architecture, design decisions, and processes.
Hands-on Implementation & Optimization:
Actively contribute to the hands-on implementation of cloud data solutions.
Proactively identify and implement performance optimization strategies for cloud data systems.
Troubleshoot and resolve issues related to data pipelines, data quality, and data accessibility.
Required Qualifications
Bachelor's degree in Computer Science
Minimum of 5 years of hands-on data engineering experience using distributed computing approaches (Spark, Map Reduce, DataBricks)
Proven track record of successfully designing and implementing cloud-based data solutions in Azure
Deep understanding of data modeling concepts and techniques.
Strong proficiency with database systems (relational and non-relational).
Exceptional diagramming skills with tools like Visio, Lucidchart, or other data visualization software.
Preferred Qualifications
Advanced knowledge of cloud-specific data services (e.g., DataBricks, Azure Data Lake).
Expertise in big data technologies (e.g., Hadoop, Spark).
Strong understanding of data security and governance principles.
Experience in scripting languages (Python, SQL).
Additional Skills
Communication: Exemplary written and verbal communication skills to collaborate effectively with all teams and stakeholders.
Problem-solving: Outstanding analytical and problem-solving skills for complex data challenges.
Teamwork & Leadership: Ability to work effectively in cross-functional teams and demonstrate potential for technical leadership.