Job Title: Senior Java PySpark Developer
Location: Irving, TX
Job Description:
We are seeking a seasoned Senior Lead Data Engineer with a dual-threat background in robust backend development and large-scale data processing. You ae a self-starter who thrives in high-complexity environments and processes the architecture maturity to take a project from a blank page to a production-ready ecosystem without hand-holding.
Responsibilities:
Design, develop, and maintain robust, scalable, and high-performance applications using Java, PySpark/Python.
Lead the architecture and design of complex software systems and ensure adherence to best practices.
Collaborate with product managers, designers, and other developers to define project requirements and deliverables.
Optimize and refactor existing codebases for performance, scalability, and maintainability.
Mentor junior developers and provide technical guidance to the team.
Integrate third-party APIs, libraries, and tools into applications.
Write unit tests, perform code reviews, and ensure high code quality.
Work with DevOps teams to deploy and monitor applications in production environments.
Stay updated with the latest trends and advancements in Java and Python and related technologies.
Qualifications:
10+ years of professional experience in Java, PySpark/Python development.
Strong understanding of Python frameworks such as Django, Flask, or FastAPI.
Expertise in building Java RESTful APIs and microservices architecture.
Solid understanding of software design patterns, algorithms, and data structures.
Hands-on experience with Big Data Environment.
Familiarity with CI/CD pipelines and version control systems (e.g., Git).
Strong debugging and problem-solving skills.
Excellent communication and collaboration skills.
Good to Have Skills:
Experience with Java and PySpark/Python development creating data processing pipeline and frameworks.
Knowledge on CICD process to code versioning and deployments in various environment.
Resource should have good knowledge on Big Data environments and distributed architecture.
Knowledge on Snowflake is an added advantage.