Data Engineer

Overview

Hybrid
$120,000 - $140,000
Full Time
No Travel Required

Skills

BigQuery
Data Management
big data
data lakes
data pipelines
stakeholders

Job Details

About RySun Labs Inc:

Rysun Labs (formerly KCS Krish Compusoft Services) is an AI, Data & Digital innovation partner of choice for enterprises, globally. Rysun guides and accelerates the Data & AI strategy and Digital Transformation programs for Fortune 2000 enterprises and product startups to shape remarkable customer experiences and intelligent operations. The team delivers innovative, specialized solutions that help High-tech, Retail & Ecommerce, and Energy companies to outperform competition and lead the change in their industry.

Rysun partners with Microsoft, Google and AWS to bring the best of enterprise technology to its customers. Rysun believes in quality-first and is CMMI Level 5, ISO 9001 & 27001 certified. The team has a growth mindset fueled by a strong culture of collaboration that unifies its global teams across India, USA, South Africa (a proud Level 2 B-BBEE Contributor), and UK.

Location: Milpitas, CA

Responsibilities:

  • Design and develop scalable data pipelines, and data integration solutions to collect, store, and process large volumes of structured and unstructured data.
  • Collaborate with stakeholders to understand data requirements, translate them into technical specifications, and design appropriate data models and schemas.
  • Build Data Management Solutions for various data challenges on relational databases, data lakes, and data warehouses.
  • Take ownership of the Data Management solutions to be built and have initiative to produce Enterprise Grade outcomes.
  • Develop and maintain data infrastructure, including data processing frameworks, job scheduling, and monitoring systems.
  • Create detailed documentation for the solutions built and contribute to grow the Knowledge Base within the organization related to Data Management.
  • Ensure data quality and integrity by implementing data validation checks, data cleansing processes, and data governance practices.
  • Collaborate with IT and security teams to ensure compliance with data privacy regulations and data security standards.
  • Monitor and optimize data processing performance, identifying and resolving bottlenecks and performance issues.
  • Conduct code reviews, enforce coding standards, and promote a culture of high-quality software development within the team.

Requirements

  • Bachelors degree in Computer Science, Data Science, or a related field. Masters degree is a plus.
  • Proven experience as a Lead Data Engineer or similar role, including hands-on experience with data pipeline development, data modeling, and data integration.
  • Strong communication skills to ideate and evangelize to ensure the initiatives in the organization are successfully implemented. Also, to collaborate effectively with cross-functional teams and present complex technical concepts to non-technical stakeholders.
  • Experience managing small agile teams with multiple parallel projects.
  • Experience with emerging data technologies and hands-on experience with PoC s and Demo builds.
  • Strong experience in building solutions end-to-end from infrastructure to final deliverable.
  • Proficiency in SQL and experience working with relational databases and SQL query optimization.
  • Experience with distributed computing frameworks (e.g., Hadoop, Spark) and big data technologies (e.g., HBase, Hive, Kafka) is highly desirable.
  • Knowledge of cloud platforms (e.g., AWS, Azure, Google Cloud Platform, Snowflake) and experience with cloud-based data services (e.g., S3, Redshift, BigQuery) is a plus.
  • Familiarity with data visualization tools and techniques (e.g., Tableau, Power BI) is beneficial.
  • Strong analytical and problem-solving skills, with the ability to analyze complex data sets and identify patterns and insights.
  • Ability to manage transition between multiple data management practices efficiently and tools.