The Senior Cloud Operations Analyst is responsible for leading the management, optimization, and automation of cloud and on-premises infrastructure to ensure seamless operations and business continuity. This role includes driving improvements in observability, server and batch operations, and data center management while proactively identifying and resolving performance and reliability issues. The Senior Cloud Operations Analyst provides technical leadership, mentors team members, and consults with cross-functional teams to enhance operational excellence through best practices, process enhancements, and cutting-edge technologies.
Essential Tasks/Major Duties:
- Independently develop, implement, and maintain observability tools to monitor cloud and on-premises systems.
- Actively support infrastructure teams in the management and maintenance of server systems running on Windows and Linux.
- Create dashboards, alerts, and reports to track system health, performance, and availability.
- Analyze metrics and logs to identify trends, prevent potential issues, and optimize system performance.
- Act as the lead consultant with FinOps teams to monitor resource utilization and ensure cost-effective operations across cloud environments.
- Manage the lifecycle of cloud and on-premises servers, including provisioning, patching, configuration, and decommissioning.
- Troubleshoot and resolve server-related issues, ensuring minimal downtime. Implement and enforce server security policies and compliance requirements.
- Schedule, monitor, and manage batch processes to ensure timely execution of critical tasks.
- Identify and resolve batch failures or delays, coordinating with relevant teams to ensure smooth operations.
- Building new batch jobs for improved performance and resource utilization.
- Lead on-site and remote data center operations, ensuring proper functioning of hardware, power, cooling, and network infrastructure.
- Coordinate with vendors and service providers for hardware maintenance, replacements, and upgrades.
- Participate in on-call rotations to address system incidents and outages promptly.
- Conduct root cause analysis and implement solutions to prevent recurrence of issues.
- Document and communicate incident resolution processes to relevant stakeholders.
- Work closely with cross-functional teams, including DevOps, Networking, and Application Development, to implement and maintain system integrations.
- Maintain comprehensive documentation for configurations, processes, and incident resolutions.
- Provide training and support to team members and other departments.
Nonessential Tasks/Marginal Duties
Knowledge, Skills & Abilities:
- Bachelor s degree in computer science, Information Technology, or a related field, or equivalent experience.
- 5+ years of experience working with monitoring and observability tools (e.g., Datadog, PagerDuty).
- Certified PagerDuty Administrator or equivalent experience required.
- 5+ years of experience in cloud operations or server management roles.
- 5+ years of progressive server administration experience (Windows, Linux).
- 5+ years of experience in designing, implementing, and managing IT workload automation solutions to optimize scheduling, orchestration, and execution of enterprise workflows across on-prem and cloud environments.
- Experience leveraging artificial intelligence to drive innovation and solve complex problems. Demonstrated ability to utilize AI-driven solutions that optimize processes, enhance decision-making, or create transformative business outcomes.
- Demonstrated experience working with Infrastructure as Code (Terraform, CloudFormation, and Ansible).
- Experience leveraging artificial intelligence to drive innovation and solve complex problems. Demonstrated ability to utilize AI-driven solutions that optimize processes, enhance decision-making, or create transformative business outcomes.
- 5+ years working with cloud platforms (AWS, Azure, OCI).
- Certified AWS SysOps Administrator or equivalent experience required.
- Strong experience with data center infrastructure and best practices.
- Proficiency in scripting and automation tools (Python, Bash, PowerShell).
- Strong understanding of networking, security, and identity management in cloud environments.
- Working knowledge of security best practices and compliance standards.
- Working knowledge of agile methodologies.
- Excellent troubleshooting, problem-solving, and communication skills.
Salary/Rate: $50-$58/HR (depends on experience level). This is a contract position with candidates expected to work 40 hours/ week.
About The Company
Peterson Technology Partners (PTP) is an Equal Opportunity Employer committed to creating a transparent, inclusive, and human-centered hiring experience.
For more than 28 years, PTP has operated as one of the top IT staffing and recruiting firms in the USA built on trust, long-term partnerships, and technical excellence.
Based in the Chicago suburb of Park Ridge, IL, our team of more than 500 employees and consultants is dedicated to:
Helping every client make the best hiring decisions possible
Matching professionals with the right IT jobs and career opportunities
As part of that commitment, we believe in providing clear information about how our hiring technologies work and how your data is used. The following section outlines our AI-assisted interview process and your rights as a candidate.
AI-Assisted Interview Experience (Pete & Gabi Rebecca)
To provide a consistent, fair, and flexible experience for all candidates, we use AI-assisted tools to support parts of the interview process. This includes our proprietary AI platform Pete & Gabi, which includes AI recruiter Rebecca.
These AI hiring tools help us:
- Transcribe interviews
- Summarize candidate responses
- Generate job-related insights
- Streamline communication and scheduling
Please note that:
The AI does NOT make hiring decisions; all decisions are made by our human recruiters, hiring managers, or client partners.
The AI does not evaluate facial expressions, emotions, or physical traits; it is used only to support fairness, consistency, and efficiency.
If you prefer a non-AI interview format, we will gladly provide an alternative.
Technical or Case Interviews (Role-Dependent):
When applying for certain tech jobs, you may participate in:
- A technical interview
- A coding challenge
- A case study
- A client-specific assessment
We will always explain what to expect in advance so you can prepare with confidence.
Human Review & Selection:
Every candidate's profile including interviews, conversations, and assessments is reviewed by experienced recruiters and hiring leaders.
AI insights may assist with organization and evaluation, but final decisions are always human-driven.
Your Rights as a Candidate:
At PTP, every candidate has the right to:
Request a non-AI interview path
Ask how your data is being used
Request access to transcripts or interview recordings
Request deletion of your AI-recorded interview
Receive clear, timely communication
Our goal is to ensure you feel respected, informed, and supported throughout your experience.
Our Commitment:
For more than 28 years, PTP has focused on putting people first candidates, consultants, employees, and clients.
We're committed to a hiring process that is:
- Transparent
- Compliant
- Equitable
- Powered by innovative technology that enhances not replaces human judgment
Welcome to the future of hiring at Peterson Technology Partners.
We're excited to learn more about you.
Equal Employment Opportunity:
Peterson Technology Partners is an Equal Opportunity Employer. All qualified applicants will receive consideration without regard to race, color, religion, national origin, gender identity, sexual orientation, disability, veteran status, or any other protected characteristic.