Data Engineer
Location: Orlando, FL
Hybrid/Remote/Onsite: Hybrid 3-Days onsite
Duration: 6 month Contract with potential for extension
Must have strong DataStage experience with banking
Summary:
- Design, develop, and maintain ETL processes using IBM InfoSphere DataStage to transform and load file-based data (CSV, fixed-width, delimited) into SQL Server tables.
- Build end-to-end ETL workflows, including data staging, transformation, validation, and publishing to downstream systems.
- Perform source-to-target mapping and implement complex data transformation logic per business and technical requirements.
- Utilize various DataStage stages (e.g., Sequential File, Transformer, Lookup, Join, Aggregator, Sort, Funnel, Remove Duplicates) with attention to partitioning and parallel job design for performance optimization.
- Write and optimize SQL Server queries, stored procedures, and advanced T-SQL scripts as part of ETL workflows.
- Implement robust, restartable ETL jobs with parameterization, detailed logging, error handling, auditing, and reconciliation checks.
- Apply data quality controls (format, referential integrity, null/duplicate, threshold checks) and produce clear exception reports.
- Monitor and troubleshoot ETL jobs using DataStage Director/Operations Console and SQL Server tools; perform root-cause analysis and resolve defects.
- Tune ETL jobs and SQL queries for optimal performance, leveraging partitioning, sorting, and set-based logic.
- Participate in code reviews, testing, documentation, release management, and maintain clear operational procedures.
- Collaborate with business analysts, data modelers, QA, and production support teams to ensure reliable and auditable data pipelines.
- Leverage experience in UNIX/Linux shell scripting and job scheduling/orchestration tools (e.g., Control‑M, Autosys) as needed.
- Adhere to best practices for data governance, version control (Git), and secure handling of sensitive data.
- Deliver consistent, auditable, and performant file-to-table data loads, with issues easily diagnosable and traceable to requirements.
Required Skills:
- Strong hands-on IBM DataStage ETL development in a banking environment.
- Advanced SQL Server and T-SQL expertise.
- Deep understanding of file ingestion, parsing, and data quality controls.
- Experience with robust ETL job design (parameterization, logging, error handling, restartability).
- Excellent troubleshooting and communication skills.
Preferred:
- DataStage Parallel Jobs tuning.
- UNIX/Linux basics and shell scripting.
- Experience with orchestration tools and CI/CD practices.
- Familiarity with data warehousing concepts and data governance.
Education:
- Bachelor’s degree in Computer Science, Engineering, Information Systems, or equivalent experience.