Capacity Engineer

Remote • Posted 6 hours ago • Updated 6 hours ago
Full Time
No Travel Required
Remote
Depends on Experience
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Capaciti
  • Data Engineer

Summary

About the client:

Our client is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies.

 

Role--Capacity Engineer with Data Engineer

Location --Remote 

 

  • Data Engineering Stack: SQL, Python, Spark, Airflow for data processing and orchestration.  
  • Monitoring & Observability: Prometheus, Grafana, Datadog.
  • Chaos Engineering: Test system resilience under stress.
  • Infrastructure as Code: Terraform, Ansible, Harness.

 

Data Engineer with strong Site Reliability Engineering (SRE) expertise in capacity planning. This role ensures our infrastructure scales efficiently to meet user demand, balancing performance with cost. The engineer will forecast growth, analyze usage trends, and automate resource provisioning to prevent outages, over-provisioning, or under-provisioning. In addition, the role requires building robust data pipelines and analytical models to support forecasting and decision-making.
Key Responsibilities

·       Data Pipeline Development: Design and maintain ETL/ELT pipelines to collect, transform, and store infrastructure usage data.

·       Data Modeling: Build models to analyze system metrics and predict future resource needs.

·       Demand Forecasting: Analyze historical usage patterns to predict CPU, memory, and storage requirements.

·       Load Testing & Scaling: Simulate traffic spikes to identify bottlenecks and ensure systems scale linearly.

·       Cost Efficiency: Optimize resource allocation to avoid unnecessary costs while maintaining service availability.

·       Automation: Use Infrastructure as Code (IaC) tools like Terraform to automate scaling and provisioning.

·       Architecture Review: Collaborate with software teams to flag single points of failure and ensure resilient service design.

Tools & Techniques

·       Monitoring & Observability: Prometheus, Grafana, Datadog.

·       Chaos Engineering: Test system resilience under stress.

·       Infrastructure as Code: Terraform, Ansible, Harness.

·       Data Engineering Stack: SQL, Python, Spark, Airflow for data processing and orchestration.

 

Qualifications

·       Strong background in data engineering and SRE practices.

·       Hands-on experience with capacity planning, forecasting, and scaling.

·       Proficiency in IaC tools (Terraform, Ansible, Harness).

·       Experience with data pipelines, ETL/ELT frameworks, and big data tools.

·       Familiarity with monitoring/observability platforms (Prometheus, Grafana, Datadog).

·       Knowledge of chaos engineering and resilience testing.

·       Excellent collaboration and communication skills.


Capacity Engineer1SQL,Python,Data Engineer,SRE,capacityN/AFull TimeUnited States
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10211255
  • Position Id: 237140-14311-
  • Posted 6 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Easy Apply

Contract

Depends on Experience

Remote

Today

Full-time

USD 117,900.00 - 168,000.00 per year

Remote

25d ago

Easy Apply

Full-time, Third Party

Depends on Experience

Remote

Today

Full-time

USD 102,000.00 - 170,000.00 per year

Search all similar jobs