Data & Software Engineer

McLean, VA, US • Posted 1 hour ago • Updated 1 hour ago
Full Time
On-site
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Data Flow
  • Java
  • Data Security
  • Privacy
  • Regulatory Compliance
  • Extract
  • Transform
  • Load
  • Workflow
  • Orchestration
  • Cloud Computing
  • MySQL
  • PostgreSQL
  • Performance Tuning
  • Query Optimization
  • Analytical Skill
  • Git
  • Geospatial Analysis
  • Bash
  • Scripting
  • Data Processing
  • Machine Learning (ML)
  • Problem Solving
  • Conflict Resolution
  • Debugging
  • Data Quality
  • Data Migration
  • Data Engineering
  • Documentation
  • Design Patterns
  • Computer Science
  • Finance
  • Apache Spark
  • PySpark
  • Python
  • Pandas
  • NumPy
  • Docker
  • Amazon Web Services
  • Amazon S3
  • Step-Functions
  • SQL
  • NoSQL
  • Amazon DynamoDB
  • Unity
  • Operations Support Systems
  • Apache HTTP Server
  • Terraform
  • PostGIS

Summary

Overview

We are seeking a Data & Software Engineer works with a small team to build complex data flows for a custom application. Successful candidate will have advanced Python programming skills, familiarity with Java, an understanding of data security, privacy, governance and compliance principles and a demonstrated history of building production data pipelines and ETL workflows at scale. Candidate must have experience:

What will you do?

  • Building end-to-end data pipelines leveraging Python

    Using orchestration tools to deploy data pipelines, including configuring and updating Spark Jobs
    Containerizing and deploying applications in cloud environments like AWS.
    Working with MySQL and PostgreSQL including performance tuning, schema design, and query optimization for complex, analytical workloads.
    Leveraging industry standard tools for code control (Git, IaaC control, etc.)
    Working with data catalogs, tracking data lineage and handling a variety of data formats, including Geospatial.
    Using Bash scripting for automation and data processing tasks
    Integrating Al/ML services and models
  • Work with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight
    Leverage strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks
    Leverage a background in large-scale data migration or platform modernization efforts

    Contribute to data engineering documentation, best practices, and design patterns.

Do you have what it takes?

  • Active TS/SCI W/ Polygraph required.
  • Bachelor's degree in Computer Science, Engineering, Finance, or a related technical field, or equivalent practical experience.
  • Minimum of 5 years' experience with:
    Apache Spark & PySpark
    Advanced Python skills (including Pandas & NumPy)
    Docker, Podman
    AWS S3, Lambda & Step functions
    Apache Iceberg, Airflow, etc.
    SQL (with Trino)
    NoSQL, DynamoDB
    Unity Catalog OSS, Apache Polaris
    Apache Superset
    Terraform or CloudFormation
    OpenLineage
    H3, PostGIS
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTL806649
  • Position Id: de7ef592029520e7e1531e49ef4fbe25
  • Posted 1 hour ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

McLean, Virginia

Today

Full-time

USD 111,800.00 - 221,800.00 per year

McLean, Virginia

Today

Full-time

USD 77,600.00 - 176,000.00 per year

Arlington, Virginia

Today

Full-time

USD 108,400.00 - 203,400.00 per year

Washington, District of Columbia

Today

Full-time

USD 93,400.00 - 176,200.00 per year

Search all similar jobs