Senior AI Data Engineer

Herndon, VA, US • Posted 1 day ago • Updated 7 hours ago
Full Time
On-site
USD $165,000.00 - 180,000.00 per year
Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

  • Pivotal
  • Version Control
  • Continuous Integration
  • Continuous Delivery
  • Automated Testing
  • Meta-data Management
  • Data Governance
  • Access Control
  • Collaboration
  • Business Analysis
  • Business Analytics
  • Computer Science
  • Computer Engineering
  • Data Science
  • Python
  • PySpark
  • Apache Airflow
  • Generative Artificial Intelligence (AI)
  • Data Engineering
  • Workflow
  • Unity
  • Machine Learning Operations (ML Ops)
  • Cloud Computing
  • Amazon S3
  • Databricks
  • Amazon Web Services
  • Data Analysis
  • Data Processing
  • Apache Kafka
  • Streaming
  • Data Quality
  • Analytics
  • Open Source
  • Docker
  • Orchestration
  • Kubernetes
  • Vector Databases
  • Artificial Intelligence
  • Machine Learning (ML)
  • Law
  • Training
  • Recruiting
  • Industrial Security
  • DoD
  • Security Clearance

Summary

Why Karsun?

Join Karsun Solutions to grow your career with the company transforming possible for the US Government.

At Karsun, collaboration drives our community. We're committed to building an environment where team members from diverse backgrounds can innovate, learn and grow with us. Here at Karsun, the only limit to your potential is the limit of your curiosity.

Join Team Karsun, and Find Your Next!

Summary

We are seeking a highly skilled and motivated Sr. AI Data Engineer with a proven track record in building scalable data platforms and incorporating Generative AI into data engineering workflows. The ideal candidate will have deep expertise in Databricks capabilities-including Delta Lake and Unity Catalog-to power AI and machine learning initiatives. You will play a pivotal role in setting up and operationalizing MLOps directly within Databricks, while seamlessly integrating a variety of open-source tools to enhance data quality, workflow automation, and metadata generation.

What You'll Be Doing:
  • Databricks AI Solutions: Design, build, and maintain scalable data pipelines and workflows using Databricks to directly support AI/ML and analytics workloads. Leverage core capabilities like Delta Lake, Delta Live Tables, and Databricks Workflows to create high-performance data platforms.
  • MLOps Operationalization: Set up, establish, and operationalize MLOps practices directly within the Databricks environment, including version control, CI/CD for data pipelines, automated testing, and model deployment strategies.
  • Open-Source Integration: Utilize and integrate open-source tools such as Python, PySpark, and Apache Airflow for distributed data processing and workflow orchestration.
  • GenAI-Enhanced Workflows: Implement GenAI-enhanced workflows using LLMs to automate metadata generation, create data dictionaries, validate data quality, and track data lineage.
  • Architecture & Governance: Leverage medallion architecture (Bronze, Silver, Gold layers) following data lakehouse best practices. Integrate Unity Catalog for enterprise data governance and access control. Implement and operationalize best practices.
  • Data Preparation: Collaborate with AI/ML teams to curate, prepare, and serve high-quality datasets for model training and inference.

Required Qualifications:
  • BA or BS degree in Computer Science, Computer Engineering, Data Science, or a related field (Master's degree is a plus).
  • Open-Source Proficiency: 5+ years of strong proficiency in open-source languages and frameworks, specifically Python and PySpark, for distributed data processing. Strong knowledge of open-source data orchestration tools like Apache Airflow.
  • AI/Data Engineering: 5+ years of proven experience building large-scale data platforms, with at least 2+ years incorporating Generative AI into data engineering workflows.
  • Databricks Expertise: 3+ years of hands-on experience with the Databricks platform, specifically leveraging data engineering and AI features (Delta Lake, DLT, Workflows, Unity Catalog).
  • MLOps: Proven experience setting up, maintaining, and operationalizing MLOps frameworks within Databricks.
  • Cloud & Architecture: 3+ years of experience with AWS data services (e.g., S3, Glue, Lambda) and a deep understanding of data lakehouse architecture.
  • Certifications in Databricks Data Engineer Associate/Professional or AWS Data Analytics.

Preferred Qualifications:
  • Experience with open-source streaming data processing tools like Apache Kafka or Structured Streaming.
  • Familiarity with open-source data quality and analytics engineering tools such as dbt (data build tool), Great Expectations, or Sweetviz.
  • Experience with open-source containerization (Docker) and orchestration (Kubernetes) for data applications.
  • Understanding of vector databases and embedding pipelines for AI/ML applications.

Things to Know:

Commitment to Non-Discrimination

All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.

Salary Range

The proposed salary range for this role is $165,000 to $180,000 USD. The salary range provided is a good faith estimate representative of all experience levels. Karsun considers several factors when extending an offer, including but not limited to, the role, function and associated responsibilities, a candidate's work experience, location, education/training, and key skills.

Third Party Resumes: Karsun does not accept unsolicited resumes through or from search firms or staffing agencies. All unsolicited resumes will be considered the property of Karsun and Karsun will not be obligated to pay a placement fee.

Clearance Information

This position requires the eligibility to obtain a security clearance. The Defense Industrial Security Clearance Office (DISCO), an agency of the Department of Defense, handles and adjudicates the security clearance process. More information about Security Clearances can be found on the US Department of State government website: ;br>
Location

To be considered for this role, you must reside in one of the following states: CA, CO, DC, FL, GA, IL, MD, NJ, NY, NC, OH, OK, PA, SC, TX, VA, WV.

Applicants must be authorized to work in the U.S. We may consider candidates currently in H-1B status who are eligible for transfer.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX15a3f1
  • Position Id: 57a814f71d9de2996619a1791ce51bda
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Herndon, Virginia

Today

Full-time

USD 160,000.00 - 190,000.00 per year

McLean, Virginia

Today

Full-time

McLean, Virginia

Today

Full-time

Reston, Virginia

Today

Easy Apply

Full-time

$80,000 - $120,000

Search all similar jobs