Role : Data Architect - Active Metadata
Location : Menlo Park, CA (Remote)
Experience : 12+ Years
Job Description
Role Overview:
We are seeking a visionary to architect a Self-Healing, Autonomous Data Fabric. You will replace legacy ETL with a "nervous system" where metadata is active, governance is computational, and data sharing is zero-copy.
Mandatory Skills:
- Active Metadata: Experience building closed-loop automation (e.g., metadata-triggered autonomous schema repair).
- Semantic Engineering: Mastery of RDF, OWL, and SHACL for ontology-first modeling and SPARQL reasoning.
- Production-level Open Policy Agent (OPA)/ Policy-as-Code(Zero-Trust) for dynamic, context-aware access control.
Other Technical Skills:
- Advanced Privacy: Implementation of Homomorphic Encryption (FHE) or SMPC for analytics on encrypted PII.
- Zero-Copy Architecture: Expertise in Delta Sharing for cross-cloud analytics without egress.
- Compute: Trino (GraalVM), StarRocks, DuckDB (WASM).
- Orchestration: Dagster, Airflow (Provider-level).
- Semantic Layer: Stardog, Apache Jena, GraphQL Federation.
- System Languages: Rust, Clojure, or Java.