Job Description:
***Crop to Crop resumes are accepted
Location Requirement: Onsite, hybrid, or remote: Candidate MUST be CURRENT WI resident. NO RELOCATION ALLOWED. All work must be done within the State of Wisconsin. 100% remote work is an option. Can accommodate if candidate wants to work onsite.
The Cloud Operations and Security Monitoring Engineer is responsible for the day to day reliability, performance, and security visibility of the organization s cloud environment, with a primary focus on Amazon Web Services (AWS). This role ensures cloud services are resilient, well governed, cost efficient, and continuously monitored for security threats and operational risks. The position partners closely with application and security teams to maintain stable operations while improving automation, observability, and incident response capabilities.
Key Responsibilities: Cloud Operations
- Administer and support AWS infrastructure, including compute, storage, networking, and identity services.
- Monitor system health, availability, and performance across cloud workloads.
- Manage deployments, configuration changes, and environment provisioning using infrastructure-as-code and automation tools.
- Troubleshoot service disruptions, performance degradation, and integration issues.
- Optimize cloud resource utilization and cost management through rightsizing and lifecycle controls.
- Maintain backup, recovery, and disaster recovery readiness for critical systems.
Security Monitoring and Response
- Monitor cloud security telemetry, logs, and alerts to identify potential threats or anomalous behavior.
- Investigate security events and coordinate incident response activities.
- Maintain and tune detection rules, alert thresholds, and monitoring dashboards.
- Support vulnerability management, patching, and remediation tracking for cloud resources.
- Assist with security audits, compliance evidence collection, and control validation.
- Collaborate with security engineering and governance teams to strengthen cloud security posture.
Automation, Observability, and Continuous Improvement
- Implement automated monitoring, alerting, and remediation where feasible.
- Develop and maintain operational dashboards and reporting for system health and security status.
- Contribute to runbooks, standard operating procedures, and incident response documentation.
- Identify recurring operational or security issues and recommend long-term corrective actions.
- Participate in on-call rotation and post-incident reviews.
Required Skills: 5-7+ years
- Experience operating and supporting workloads in AWS environments.
- Hands on experience with cloud monitoring, logging, and alerting tools.
- Foundational understanding of networking, identity and access management, and system administration.
- Experience investigating incidents, troubleshooting outages, or responding to security alerts.
- Familiarity with scripting or automation (e.g., PowerShell, Python, or Bash).
Desired Skills:
- Experience with cloud security services, SIEM, or threat detection platforms.
- Understanding of compliance or regulatory frameworks (e.g., NIST, CIS, or similar standards).Experience configuring and hardening cloud services to align with organizational security and operational standards.