Company Profile
Blackstraw.ai is an end-to-end technology services company specializing in Artificial Intelligence (AI) and Engineering solutions across Data Science, Data Engineering, LLM/GenAI and LLMOps. Foundedin 2018, we help global enterprises across North America, Europe and Asia to build and operationalize AI systems that create measurable business impact. Our mission is to make AIadoption simpler, faster and scalable through a blend of deep domain expertise, reusableaccelerators and proven engineering practices.
With a 450+ strong team of engineers, data scientists and AI specialists, we partner with organizations to deliver real-world outcomes in areas such as predictive analytics, computer vision, natural language processing and Generative AI.
Headquartered in Florida (USA) with operations in USA, Canada and India, Blackstraw.ai continues toempower global enterprises to unlock the true potential of AI.
Location: USA / Canada
Experience: 10 to 15 years
Employment Type: Full-time
Role Overview:
- The Architect will design the blueprint for multi-workspace Databricks environments on Google Cloud Platform.
- The Architect will design a unified data platform centered on BigQuery and Dataplex.
- The Architect will design modern Lakehouse and Mesh architectures that leverage the best of both worlds (Google Cloud Platform native and Databricks specific).
- You are responsible for the "Medallion" data flow, security perimeters within Google Cloud, and ensuring seamless interoperability between Databricks and BigQuery.
Key Responsibilities
- Infrastructure Design: Architect scalable solutions on Google Cloud Platform (Google Cloud Platform) and manage storage integration with Google Cloud Storage (GCS).
- Data Modeling: Design highly performant Kimball dimensional models for enterprise business processes in Delta Lake schemas (Bronze/Silver/Gold) optimized for Google Cloud Platform''s distributed storage.
- Governance & Security: Implement Unity Catalog for fine-grained access control, integrated with Google Cloud IAM and Secret Manager.
- Integration Strategy: Define patterns for data sharing between Databricks and BigQuery (using the BigQuery connector) for hybrid analytics.
- Performance Optimization: Establish best practices for Partitioning, Z-Ordering, and Liquid Clustering to minimize egress costs and maximize query speed.
Technical Requirements
- Platform: 10+ years of Data Architecture in general, with at least 4 years focused on Google Cloud Platform (BigQuery, GCS, Cloud Pub/Sub).
- Databricks Mastery: Deep expertise in Delta Lake, Unity Catalog, and Databricks SQL.
- Networking: Knowledge of Google Cloud Platform VPC Peering, Private Service Connect, and Shared VPCs for secure Databricks deployments.
- Orchestration: Experience designing workflows using Databricks Workflows, Google Cloud Platform Workflows or Cloud Composer (Airflow).
Soft Skills
- Ability to translate complex technical concepts into actionable insights
- Strong problemsolving mindset with a bias for experimentation and innovation.
- Collaborative, proactive, and comfortable working in fastpaced environments.
We are an equal opportunity employer. Employment decisions are based on qualifications, merit, and business needs. We do not discriminate on any basis protected by applicable laws in the countries where we operate.