Overview
Skills
Job Details
Position : Production Engineering - Data Services
Location: Plano, TX
Duration: 12 Months contract
100% onsite in Plano, TX
Mandatory Skills:
o 9+ years in enterprise application support
o Managed File Transfer (MFT), MuleSoft, ETL/EDL, Dynatrace/Datadog.
Preferred Skills: Tableau, AWS or other cloud platforms, Terraform, General incident management capabilities.
Work Schedule: Shift-based (morning/afternoon/evening); between 7:00 AM and 7:00 PM, no night shifts.
On-Call Rotation: Once per month, includes weekend coverage.
Requirements:
Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world s most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We re looking for diverse, talented team members who want to Dream. Do. Grow. with us. o AWS services (EC2, S3, RDS, Lambda, VPC, Route 53, Kubernetes)
o Infrastructure as Code: Terraform, CloudFormation
o Troubleshooting Linux/Windows systems
o AWS CLI, scripting (Python, Bash, PowerShell)
o AWS certifications (Solutions Architect preferred)
o Troubleshoot and resolve issues related to data ingestion, transformation, and querying. Support maintenance of Snowflake data warehouses.
o Provide support for Tableau Servers.
o Manage and maintain Kafka clusters, ensuring data streaming reliability and scalability.
o Troubleshoot Kafka-related issues and optimize performance.
o Troubleshoot and resolve issues related to big data frameworks.
o Monitor and manage EMR clusters, ensuring high availability and performance.
o Provide technical support for integration solutions using MFT protocols (SFTP, FTPS, AS2, HTTPS) and MuleSoft Any point Platform.
o Support the automation of file transfers and scheduling tasks using enterprise schedulers and tools.
o Diagnosing and resolving API-related problems, working closely with development teams to implement fixes.
o Continuously monitor API performance and usage, using tools to identify and resolve potential issues before they impact users.
o Provide training and support to internal teams on API usage and best practices.
o Monitor the performance and health of APIs using tools such as Datadog and Dynatrace
o Collaborate with the Major Incident Management team to address and resolve critical issues in the production environment.
o Participate in incident response activities, including root cause analysis and post-incident reviews.
o Maintain comprehensive documentation of support processes, incident resolutions, and cloud configurations.
o Collaborate on reviewing scalable architectures, implementing CI/CD pipelines, and troubleshooting application issues.
o Monitor system health, perform regular maintenance, and respond to incidents.
o Provide cross skill training and support to team members on integration solutions and best practices.