Data Engineer

    • Full Time


    • Software development
    • Mathematics
    • Optimization
    • Big data
    • Java
    • Database
    • Ceph
    • Apache Hadoop
    • Apache NiFi
    • Python

    Job Description

    Job ID: 2212770

    Location: MCLEAN, VA, US

    Date Posted: 2022-08-24

    Category: Information Technology

    Subcategory: Data Scientist

    Schedule: Full-time

    Shift: Day Job

    Travel: No

    Minimum Clearance Required: TS/SCI with Poly

    Clearance Level Must Be Able to Obtain: None

    Potential for Remote Work: No


    SAIC, a leading provider of systems development & deployment, targeting & intelligence analysis, systems engineering & integration, and training capabilities and solutions for the Intelligence Community, is seeking creative and dedicated professionals to fulfill their career goals and objectives while delivering mission excellence on programs of national importance. The d ata engineer position supports a big data ecosystem and is located in McLean, VA. Be part of a team while completing individual assignments to deliver the best possible solution to our customer!

    The d ata engineer supports the design, development, deployment, and maintenance of a sophisticated big data ecosystem critical to answering key intelligence questions. The successful candidate will support a wide variety of data processing, data-flow, data management, data modeling, and data optimization efforts critical to our customer's mission. You identify needs associated with database design, optimization, and implementation to store big data datasets; orchestrate complex data flow patterns and data enrichment analytics from a diverse and constantly growing range of data sets; and build and test solutions to address mission requirements for real-time data ingest and analysis.

    Responsibilities include:
    • Manage and test new models for data processing and data flow patterns. Work with key stakeholders to identify and remediate issues related to broken data flows.
    • Leverage mathematics, computer science, and data science expertise to support analytic design, development, and implementations to support critical mission requirements.
    • Evaluate, demonstrate, and deploy hot/cold storage design patterns for cost optimization.
    • Maintain awareness of emerging technologies and advancements in database design and management, data science, and machine learning.

    • Active TS/SCI clearance with polygraph
    • Bachelors degree
    • 5 years of experience in data science technologies and tools
    • Experience with distributed compute technologies such as Spark, Hadoop, or similar.
    • Experience with data flow management and orchestration tools such as NiFi, Airflow, or similar.
    • Experience with coding languages such as Python, Java, Spark, or similar.
    • Experience with implementing database technologies such as SQL, Mongo, or similar.
    • Evaluate, prototype, and deploy big data database technologies such as Accumulo, HBASE, Cassandra, Elastic, or similar.
    • Utilize containerization technologies such as Docker, Podman, Kubernetes, or similar.
    • Experience with distributed file systems such as Hadoop File System, Gluster, Ceph, or similar.
    • Experience with bucket storage technologies such as S3 or similar.

    Covid Policy: SAIC does not require COVID-19 vaccinations or boosters. Customer site vaccination requirements must be followed when work is performed at a customer site.