Job Title Infrastructure Development Engineer Job Summary We are looking for an Infrastructure Development Engineer to design, operate, and scale foundational datacenter services that power bare-metal, virtualization, and cloud-adjacent platforms. This role owns the automation to boot and manage critical services such as corporate IPAM/DDI, CMDB, and datacenter bootstrapping systems. You will work across hardware, networking, and platform teams to ensure infrastructure is discoverable, automated, reliable, and ready for self-service consumption. Key Responsibilities Build automation and tools in Python Develop Pythonbased tools and services for provisioning, configuration, monitoring, and selfservice workflows Automate repetitive operational tasks (imaging, deployments, health checks, remediation) and reduce manual intervention Integrate with internal and external APIs to orchestrate infrastructure workflows (compute, storage, network, cloud) Developers with experience in other programming languages, such as C++ or Java, will also be considered. Software Defined Network Services: IPAM, DDI & CMDB Own and operate the corporate IP Address Management (IPAM) and DDI (DNS, DHCP, IPAM) platforms. Design scalable IP allocation, DNS, and DHCP strategies across multiple datacenters and environments. Integrate IPAM/DDI systems with provisioning, bootstrapping, and CMDB workflows. Act as a steward of the CMDB , ensuring accuracy, consistency, and automation-driven updates. Define and enforce standards for asset discovery, lifecycle state, and dependency mapping.
Monitoring, observability, and reliability Implement and improve monitoring, alerting, and dashboards for infrastructure health (e.g., Prometheus, Grafana, ELK/Nagios or similar) Define and track key metrics (availability, latency, capacity, error rates), and drive improvements based on data Participate in incident response, perform root cause analysis, and implement longterm fixes and runbooks Required Skills & Experience Core Technical Skills Experience with bare-metal provisioning and hypervisor deployment. Hands-on experience with OpenStack, VMware, KubeVirt , or similar virtualization platforms. Deep understanding of IPAM, DNS, and DHCP at enterprise scale. Experience operating or integrating CMDB systems as a source of truth. Solid knowledge of datacenter networking concepts, including Fibre Channel. Proficiency with Linux systems and troubleshooting at hardware and OS layers. Automation & Systems Thinking Experience building infrastructure automation and onboarding pipelines. Familiarity with API-driven integrations and workflow orchestration. Ability to reason about infrastructure as a platform, not just individual systems. Collaboration & Ownership Comfortable working cross-functionally with hardware, network, storage, and SRE teams. Strong operational mindset with a focus on reliability, correctness, and supportability. Ability to drive ambiguous problems to clear, automated solutions.
Nice to Have Experience with large-scale internal platforms or infrastructure as a product . Background in SRE or reliability engineering. Exposure to self-service infrastructure platforms and developer enablement. Experience operating in multi-datacenter or hybrid environments. Server Bootstrapping & Provisioning automation Familiar with datacenter bootstrapping services, including PXE, imaging, and initial OS/hypervisor provisioning. Ensure seamless handoff from hardware arrival to production-ready infrastructure. Improve time-to-serve metrics for new racks, clusters, and testbeds. Impact In this role, you ll directly influence how fast and safely infrastructure is delivered to engineers. Your work will reduce manual effort, improve reliability, and create a clear, automated path from hardware to production-ready services.