Job DescriptionSALARY RANGE $190,000 - $260,000/year.
DUTIES As part of the Secure the Enterprise initiative, develop capabilities to shift from the current manual system security evaluation and authorization process to a new model that emphasizes
automation, streamlined processes and approvals, continuous monitoring and assessment, and network data gathering across the entire life cycle of a project.
The Data Scientist is responsible for the following core activities:
- Algorithm Development: Create machine learning, statistical, data mining, and graph-based algorithms to interpret complex datasets.
- Model Selection: Evaluate multiple prototype algorithms and select a final model based on relevant performance metrics.
- Data Generation: Construct experiments or models to generate necessary data when standard training or example datasets are missing.
- Reporting and Visualization: Develop reports and visual tools that summarize data findings to provide clients with actionable, data-driven insights.
- Process Automation: Collaborate with subject matter experts to convert manual data analysis methods into automated analytic solutions.
- Operational Integration: Deploy prototype algorithms into production systems to ensure they are integrated effectively into analyst workflows.
Essential Duties and Responsibilities:- Leverage Python as a primary language to process data to accurately determine if resources and systems are secure
- Look for outliers to help track the progress of systems through the Risk Management Framework lifecycle.
- Identify which System a Resource belongs to determined by various attributes of the identified lost Resource against known potential System information.
- Design, develop, and maintain ETL pipelines to extract security and compliance data from multiple sources (network sensors, security tools, compliance databases), transform the data for analysis and reporting, and load it into target data repositories to support continuous monitoring and automated assessments.
- Support data engineering operations including data quality validation, pipeline monitoring, and optimization of data workflows to ensure reliable, scalable, and timely delivery of security-related data for Risk Management Framework automation and decision-making.
Required SkillsRequired:
Background in statistical analysis
Experience with building, tuning, and testing predictive models
Experience creating analytic charts and dashboards
Desired SkillsDesired:
Elasticsearch
RegEx
Machine learning
Natural Language Processing
Regression and predictive analysis
Python
MATLAB or R
SQL or Mongodb
Metric Database (Grafana/Graphite/InfluxDB)