Overview
Remote
USD 72,150.00 - 130,425.00 per year
Full Time
Skills
NMCI
Service Management
Backbone.js
Intranet
Cyber Security
Network Operations
Network Engineering
Service Desk
Operational Excellence
Requirements Elicitation
Functional Requirements
Operational Efficiency
Reporting
User Experience
Dashboard
Performance Metrics
KPI
Decision-making
Clarity
Design Review
Workflow
Change Management
Documentation
Process Flow
Specification Gathering
Training
Data Analysis
Process Improvement
Continuous Improvement
Innovation
Software Engineering
DoD
Scrum
Microsoft Azure
Analytical Skill
Conflict Resolution
Problem Solving
Communication
Collaboration
Agile
DevSecOps
Atlassian
JIRA
Confluence
Bitbucket
Risk Management Framework
RMF
STIG
Business Analysis
Certified Business Analysis Professional
Software Development
Software Development Methodology
DevOps
Incident Management
Reliability Engineering
Service Level
MEAN Stack
Recovery
Automated Testing
Scalability
Performance Testing
Job Details
More About the Role:
The NMCI Service Management Integration and Transport (SMIT) group at Leidos has an opening for a Site Reliability Engineering (SRE) Business Analyst to bridge the gap between business objectives and technical requirements for the Site Reliability Engineering (SRE) teams. Under the SMIT Contract, the Leidos team is responsible for the core backbone for the Navy-Marine Corps Intranet, including cybersecurity services, network operations, network engineering, service desk, seat support services, and data transport.
The SRE Business Analyst will work closely with internal and customer stakeholders across the engineering organization to understand business needs, gather requirements, and translate them into technical specifications that drive system reliability and performance. You will analyze existing processes, identify improvement opportunities, and support the implementation of solutions that align with the organization's goals for reliability and operational excellence. Your work will contribute to the development of robust and scalable services that operate reliably in production.
What You'll Get to Do:
Requirements Gathering and Analysis:
Collaborate with business stakeholders to identify and understand their needs and expectations regarding system reliability and performance.
Gather and document functional and non-functional requirements for SRE initiatives, ensuring alignment with business objectives.
Analyze existing processes and systems to identify gaps and areas for improvement in reliability and operational efficiency.
Data Analysis and Reporting:
Collect, analyze, and interpret data related to system performance, incident management, and user experience.
Develop dashboards and reports to provide insights into system reliability, performance metrics, and key performance indicators (KPIs) relevant to SRE efforts with our quality team.
Present findings and recommendations to stakeholders, enabling data-driven decision-making for reliability initiatives.
Collaboration with Technical Teams:
Work closely with SRE, development, and operations teams to translate business requirements into technical specifications and actionable tasks.
Facilitate communication between technical teams and business stakeholders to ensure clarity and alignment with designated SRE teams.
Participate in design reviews and assist in validating that solutions meet business requirements and reliability standards.
Process Improvement:
Identify opportunities for process improvements that enhance the reliability and efficiency of systems and workflows.
Lead initiatives to implement best practices in incident management, change management, and other operational processes to minimize downtime and enhance service quality.
Collaborate with teams to establish and refine service level objectives (SLOs) and service level indicators (SLIs) that reflect business priorities.
Documentation and Training:
Create and maintain documentation related to business requirements, process flows, and technical specifications for SRE teams in Jira and/or Azure DevOps.
Develop training materials and conduct training sessions for stakeholders to promote understanding of SRE practices and tools.
Continuous Improvement:
Stay up to date with industry trends and best practices related to site reliability engineering, data analysis, and process improvement.
Participate in continuous improvement efforts, contributing to a culture of learning and innovation within the SRE team.
You'll Bring These Qualifications:
Requires B.S. Degree and 4-8 years of prior relevant experience or Masters with 2-6 years of prior relevant experience (in IT, software development or related technical domain) and 3-5 years as a Business Analyst within a technical or software engineering environment.
Must be able to obtain a DoD 8570.01 IAT Level II Certification and maintain certification while supporting the SMIT Contract.
Experience with Agile and Scrum methodologies and tools like Jira, Confluence, Trello, or Azure DevOps.
Strong understanding of site reliability engineering or DevSecOps principles, practices, and methodologies.
Familiarity with monitoring and observability tools used in SRE.
Strong analytical and problem-solving skills, with the ability to synthesize complex information and provide actionable insights.
Ability to evaluate and prioritize business needs and align them with technical capabilities.
Skilled at working with geographically distributed teams.
Excellent communication skills, both written and verbal, with the ability to convey technical concepts to non-technical stakeholders.
Proven ability to collaborate effectively with cross-functional teams and build strong relationships with stakeholders.
Knowledge of Agile and DevSecOps/SRE concepts and best practices, with a desire to grow that knowledge.
Hand-on experience with Atlassian products (Jira, Confluence, Bitbucket, etc.).
Knowledge of the Risk Management Framework (RMF), DISA STIGs.
These Qualifications Would be Nice to Have:
Certification in business analysis (i.e. CBAP, CCBA) or related field.
Experience in the software development lifecycle (SDLC) and understanding of DevOps practices.
Knowledge of incident management and service reliability best practices.
Key Metrics of Success for the Team:
Improved system reliability, as measured by adherence to Service Level Objectives (SLOs) and reduced Mean Time to Recovery (MTTR).
Comprehensive and regularly updated automated test coverage for all critical systems and infrastructure components.
Timely identification and resolution of performance bottlenecks and failure points.
Increased scalability and performance of systems under high load due to effective performance testing.
NGEN
At Leidos, we don't want someone who "fits the mold"-we want someone who melts it down and builds something better. This is a role for the restless, the over-caffeinated, the ones who ask, "what's next?" before the dust settles on "what's now."
If you're already scheming step 20 while everyone else is still debating step 2... good. You'll fit right in.
Original Posting:
October 17, 2025
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
Pay Range:
Pay Range $72,150.00 - $130,425.00
The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
The NMCI Service Management Integration and Transport (SMIT) group at Leidos has an opening for a Site Reliability Engineering (SRE) Business Analyst to bridge the gap between business objectives and technical requirements for the Site Reliability Engineering (SRE) teams. Under the SMIT Contract, the Leidos team is responsible for the core backbone for the Navy-Marine Corps Intranet, including cybersecurity services, network operations, network engineering, service desk, seat support services, and data transport.
The SRE Business Analyst will work closely with internal and customer stakeholders across the engineering organization to understand business needs, gather requirements, and translate them into technical specifications that drive system reliability and performance. You will analyze existing processes, identify improvement opportunities, and support the implementation of solutions that align with the organization's goals for reliability and operational excellence. Your work will contribute to the development of robust and scalable services that operate reliably in production.
What You'll Get to Do:
Requirements Gathering and Analysis:
Collaborate with business stakeholders to identify and understand their needs and expectations regarding system reliability and performance.
Gather and document functional and non-functional requirements for SRE initiatives, ensuring alignment with business objectives.
Analyze existing processes and systems to identify gaps and areas for improvement in reliability and operational efficiency.
Data Analysis and Reporting:
Collect, analyze, and interpret data related to system performance, incident management, and user experience.
Develop dashboards and reports to provide insights into system reliability, performance metrics, and key performance indicators (KPIs) relevant to SRE efforts with our quality team.
Present findings and recommendations to stakeholders, enabling data-driven decision-making for reliability initiatives.
Collaboration with Technical Teams:
Work closely with SRE, development, and operations teams to translate business requirements into technical specifications and actionable tasks.
Facilitate communication between technical teams and business stakeholders to ensure clarity and alignment with designated SRE teams.
Participate in design reviews and assist in validating that solutions meet business requirements and reliability standards.
Process Improvement:
Identify opportunities for process improvements that enhance the reliability and efficiency of systems and workflows.
Lead initiatives to implement best practices in incident management, change management, and other operational processes to minimize downtime and enhance service quality.
Collaborate with teams to establish and refine service level objectives (SLOs) and service level indicators (SLIs) that reflect business priorities.
Documentation and Training:
Create and maintain documentation related to business requirements, process flows, and technical specifications for SRE teams in Jira and/or Azure DevOps.
Develop training materials and conduct training sessions for stakeholders to promote understanding of SRE practices and tools.
Continuous Improvement:
Stay up to date with industry trends and best practices related to site reliability engineering, data analysis, and process improvement.
Participate in continuous improvement efforts, contributing to a culture of learning and innovation within the SRE team.
You'll Bring These Qualifications:
Requires B.S. Degree and 4-8 years of prior relevant experience or Masters with 2-6 years of prior relevant experience (in IT, software development or related technical domain) and 3-5 years as a Business Analyst within a technical or software engineering environment.
Must be able to obtain a DoD 8570.01 IAT Level II Certification and maintain certification while supporting the SMIT Contract.
Experience with Agile and Scrum methodologies and tools like Jira, Confluence, Trello, or Azure DevOps.
Strong understanding of site reliability engineering or DevSecOps principles, practices, and methodologies.
Familiarity with monitoring and observability tools used in SRE.
Strong analytical and problem-solving skills, with the ability to synthesize complex information and provide actionable insights.
Ability to evaluate and prioritize business needs and align them with technical capabilities.
Skilled at working with geographically distributed teams.
Excellent communication skills, both written and verbal, with the ability to convey technical concepts to non-technical stakeholders.
Proven ability to collaborate effectively with cross-functional teams and build strong relationships with stakeholders.
Knowledge of Agile and DevSecOps/SRE concepts and best practices, with a desire to grow that knowledge.
Hand-on experience with Atlassian products (Jira, Confluence, Bitbucket, etc.).
Knowledge of the Risk Management Framework (RMF), DISA STIGs.
These Qualifications Would be Nice to Have:
Certification in business analysis (i.e. CBAP, CCBA) or related field.
Experience in the software development lifecycle (SDLC) and understanding of DevOps practices.
Knowledge of incident management and service reliability best practices.
Key Metrics of Success for the Team:
Improved system reliability, as measured by adherence to Service Level Objectives (SLOs) and reduced Mean Time to Recovery (MTTR).
Comprehensive and regularly updated automated test coverage for all critical systems and infrastructure components.
Timely identification and resolution of performance bottlenecks and failure points.
Increased scalability and performance of systems under high load due to effective performance testing.
NGEN
At Leidos, we don't want someone who "fits the mold"-we want someone who melts it down and builds something better. This is a role for the restless, the over-caffeinated, the ones who ask, "what's next?" before the dust settles on "what's now."
If you're already scheming step 20 while everyone else is still debating step 2... good. You'll fit right in.
Original Posting:
October 17, 2025
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
Pay Range:
Pay Range $72,150.00 - $130,425.00
The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.