Infrastructure architect

  • Lansing, MI
  • Posted 12 hours ago | Updated 12 hours ago

Overview

Hybrid
$60 - $100
Full Time

Skills

Linux System Administration
Ubuntu
HPC (High Performance Computing)
Slurm Workload Manager
Bash Scripting
Python
R Programming
Ansible
Puppet
Chef
Nextflow
Docker
Singularity
SAN Storage
NAS Storage
Qumulo
Network Appliance Clustered Servers
Mellanox Switches
Cloud Infrastructure (AWS/GCP/Azure)
Compute Engine
Storage Buckets
PostgreSQL
MySQL
Oracle
SQL Server
Virtualization
Disaster Recovery (DR)
Backup and Recovery Solutions
Automation Tools
System Monitoring
Log Analysis (IIS
Dynatrace)
Web.config Analysis
HL7 Messaging
Load Balancing
Network Configuration
Memory Management
Firewall Configuration
Failover Systems
Cloudflare
ForcePoint
Security Compliance
Storage Optimization
Infrastructure Architecture
CDC Applications (Preferred)
CI/CD (Basic Knowledge)
Storage Mount Strategies
rsync
Big Data Monitoring Tools

Job Details

job description:

we are seeking an experienced infrastructure architect with extensive background in high performance computing (hpc), linux systems, storage solutions, and automation frameworks. the ideal candidate will be responsible for designing, implementing, and maintaining secure, scalable, and resilient hpc infrastructure and storage systems, supporting scientific computing environments, managing cloud and on-prem systems, and ensuring best practices in disaster recovery and data security.


responsibilities:

  • design, implement, and maintain hpc infrastructure and security

  • manage san/nas storage systems, backups, and virtualization infrastructure

  • support enterprise backup, dr, and continuity plans

  • configure and manage automation tools (ansible, puppet, chef)

  • maintain high-speed network storage systems (e.g., mellanox switches, clustered nas)

  • support cloud infrastructure (compute engines, storage buckets)

  • manage sql and nosql databases (e.g., postgresql, mysql, oracle)

  • assist teams in utilizing computing and storage resources

  • collaborate with labs and dtmb to manage computing infrastructure

  • review system logs, monitor resource usage, and address anomalies

  • participate in failover and dr planning/testing


required skills & experience:

  • 10+ years of experience in linux system administration (ubuntu, cli, firewalls, memory, vm, etc.)

  • 10+ years of experience with scripting languages (bash, python, r)

  • 10+ years with hpc environment setup and maintenance

  • 10+ years experience with workload managers (e.g., slurm)

  • deep knowledge in database setup and administration (postgresql, mysql, etc.)

  • hands-on experience with network appliances and clustered storage

  • experience in backup/recovery systems and disaster recovery planning

  • familiarity with cloud infrastructure setup (aws, Google Cloud Platform, or azure preferred)

  • experience in automation & configuration management (ansible, puppet, nextflow)

  • knowledge of containerization tools (docker, singularity)

  • strong troubleshooting and log analysis skills (iis, dynatrace, etc.)

  • experience in reviewing config files (e.g., web.config)

  • exposure to hl7 messaging, cloudflare, forcepoint (rule sets like c86), junction configurations

  • experience assisting with failover and dr implementation/testing

  • familiarity with cdc hosted applications is a plus

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Efovinity Inc