Overview
Skills
Job Details
Need local candidate. C2C will work:
Below is the Job Description for AWS Infrastructure/Platform Engineer. This is an urgent requirement but please note that we need someone seasoned who is also good at managing their time and they can effectively communicate with engineers as well a Project managers/Scrum masters.
Title: Senior AWS Platform Engineer (Infrastructure & SRE)
Duration: 6 months Contract with possible extension
Location: Hybrid - 3 days a week in Naperville office
Interview Process: Immediate Interview on Call/video, followed by in person interviews
Note: Please screen candidates internally and send us only your top 2 candidates.
About the Role
We re hiring a handson ( 80% build/operate) AWS Platform Engineer to design, harden, and run our cloud platform with an emphasis on Amazon Connect contact centers, Lex V2 conversational bots, and Lambda (Python) based integrations. You ll own reliability and scalability endtoend instrumenting, automating, and governing a multiaccount AWS environment via Infrastructure as Code while enabling new capabilities (including Bedrockpowered AI) safely and at speed.
What You ll Do (Key Responsibilities)
Platform Engineering & Infrastructure
- Design, provision, and operate AWS foundations (multiaccount landing zone, VPCs, subnets, routing, security controls, IAM guardrails) using Terraform (and/or CDK) and automated change management.
- Build a paved road of reusable Terraform modules, CI/CD templates, and runbooks for serverless and containerized workloads (Lambda, API Gateway, Step Functions, EventBridge, ECS/EKS as needed).
- Implement robust secrets, encryption, and config management (KMS, Secrets Manager, SSM Parameter Store) and enforce leastprivilege IAM.
SRE, Reliability & Observability
- Define and manage SLIs/SLOs for platform and contact flows; implement error budgets and lead incident response, postmortems, and game days.
- Build endtoend visibility with CloudWatch, Datadog, and Operata (dashboards, traces, logs, RUM/synthetics, alerting), including custom Connect/Lex metrics and business KPIs.
- Engineer high availability and disaster recovery (multiAZ, crossRegion failover where needed); manage Lambda concurrency, coldstart mitigation, throttling, and backoff patterns.
Amazon Connect & Conversational AI
- Architect and implement Amazon Connect solutions: contact flows, queues/routing profiles, hours, quick connects, CTR pipelines, Contact Lens based analytics, recordings/retention, and realtime/nearrealtime reporting.
- Build and optimize Lex V2 bots (intents, slots, multiturn, locales), including Lambda (Python) fulfillment/validation hooks, session attributes, and error handling with strong observability.
- Integrate Connect & Lex with CRMs (e.g., Salesforce Service/Experience Cloud), ticketing, knowledge bases, and data platforms; instrument and continuously tune AHT, containment, and CSAT.
- Leverage AWS Bedrock (where appropriate) for generative call/chat assist, summarization, and routing via safe, auditable Lambda/Step Functions patterns.
Automation, CI/CD & Quality
- Own and improve CI/CD (GitHub Actions, Azure DevOps; optionally CodeBuild/CodePipeline) with automated tests, security scans (e.g., Sonarqube, AIkido), and progressive delivery for IaC and app code.
- Write clean, testable Python for Lambdas, tooling, and platform automation (unit/integration tests, packaging, type hints), and codify operational tasks (runbooks, CLI tools).
Security, Compliance & Cost
- Implement PCIaware integrations (e.g., payment flows, DTMF redaction/tokenization, vaulting) and ensure logging, retention, and audit readiness (CloudTrail, Config).
- Drive FinOps: tagging, budgets, rightsizing, lifecycle policies, and data egress controls especially for Connect recordings/CTR/Contact Lens in S3, Athena/Glue analytics, and Kinesis streams.
CrossFunctional Impact
- Partner with product, data, and engineering teams to ship reliable features; communicate clearly with both technical and business stakeholders on tradeoffs, incidents, and roadmaps.
- Support scalable integrations with thirdparty platforms (e.g., Vtex, Shopify) and PCIcompliant payment providers (e.g., Stripe).
MustHave Qualifications
- Deep, handson AWS expertise: Amazon Connect, Lex V2, Lambda (Python), API Gateway, Step Functions, EventBridge, S3, DynamoDB/RDS, CloudWatch, IAM, VPC networking, WAF/Shield, KMS.
- Proven experience designing and operating production Connect environments (contact flows, CTR/Contact Lens analytics, call recording/retention, reporting, routing, telephony integration).
- Lex V2 bot design and operation, including Lambdabased fulfillment/validation in Python (boto3), robust error handling, and conversation analytics.
- Strong Infrastructure as Code with Terraform (modules, workspaces, policyascode, drift detection) and GitOps workflows.
- SRE skills: SLIs/SLOs, incident management/oncall, postmortems, capacity/perf tuning, high availability, and DR patterns.
- Observability at scale with Datadog, CloudWatch, and/or Operata (metrics, traces, logs, alerts, synthetics).
- CI/CD automation (GitHub Actions, Azure DevOps) for both application and IaC pipelines.
- Experience integrating AWS services with Salesforce Service/Experience Cloud and with PCIcompliant payment platforms (e.g., Stripe).
- Strong debugging/rootcause skills across distributed systems; clear written and verbal communication with technical and nontechnical audiences.
- Participate in an oncall rotation with welldocumented runbooks and automated remediation.
Nice to Have
- Architectural leadership for largescale, multiRegion AWS platforms; AWS Control Tower/Organizations experience.
- Exposure to call center tech beyond Connect (SIP, SBCs, carrier management) and analytics tooling for contact centers.
- Experience with AI assistants/agentic patterns leveraging Bedrock; realtime agent assist and summarization in Connect.
- Containers/Kubernetes (ECS/EKS) and service mesh (where appropriate).
- AWS certifications (Solutions Architect, DevOps Engineer, SysOps).
- Fault Injection Simulator/chaos engineering, and security/compliance frameworks (SOC2, ISO 27001, PCIDSS).
Tech Stack (Representative)
AWS (Connect, Lex V2, Lambda, API GW, Step Functions, EventBridge, S3, DynamoDB/RDS, CloudWatch, IAM, KMS, WAF/Shield), Terraform, Python (boto3), GitHub Actions/Azure DevOps, Datadog, Operata, Salesforce (Service/Experience Cloud), Stripe, Vtex/Shopify.