Google Cloud Platform Data Engineer

  • Posted 60+ days ago | Updated 24 days ago

Overview

Remote
$100,000 - $120,000
Full Time

Skills

GCP
Google Cloud Paltform
Google Cloud
FHIR

Job Details

You will be part of CitiusTech s 250+ FHIR & HL7 certified professionals in our HPR vertical. In the HPR market, we have 150+ technology engagements, 80Mn clinical/patient records have been integrated and 350+ clinical applications have been developed so far. CitiusTech offers large health systems and provider services companies a powerful combination of healthcare technology services, platforms, and consulting services. Our deep understanding of healthcare technology and our product engineering approach to business solutions have enabled us to closely track industry evolution and build scalable solutions and platforms ahead of the market. We help organizations address complex technology challenges and leverage unique opportunities for digital innovation.

Must Have: Google Cloud Platform, Python, SQL, Pipeline Development

Job Description:

  • Develop data pipelines:

Create efficient and scalable data pipelines using Google Cloud Platform services such as Google Cloud Dataflow, BigQuery, and Cloud Storage.

Ensure the pipelines are reliable, maintainable, and optimized for performance.

  • Data ingestion and integration:

Implement data ingestion processes from various sources, such as databases, APIs, and streaming platforms, into Google Cloud Platform.

Perform data integration and transformation tasks to ensure data quality and consistency.

  • Data modeling and design:

Design and implement data models and schemas that support efficient data storage, retrieval, and analysis.

Utilize appropriate Google Cloud Platform tools and technologies to define and maintain the data structures.

  • Data processing:

Develop data processing and analysis workflows using Google Cloud Platform tools like Google Cloud Dataproc, Google Cloud Pub/Sub, and Google Cloud Functions.

Apply advanced analytics techniques to derive meaningful insights from large volumes of data.

  • Performance optimization and monitoring:

Identify and resolve performance bottlenecks in data processing pipelines and data storage systems.

Monitor the data infrastructure, analyze system metrics, and implement optimizations to ensure high availability and reliability.

  • Data governance and security:

Implement appropriate data governance practices, including data lineage, metadata management, and data access controls.

Ensure compliance with data privacy regulations and industry best practices for data security.

  • Collaboration and communication:

Collaborate with cross-functional teams, including data scientists, business analysts, and software developers, to understand data requirements and deliver solutions that meet business objectives.

Effectively communicate technical concepts and solutions to both technical and non-technical stakeholders.

Required Skills:

  1. Strong proficiency in Google Cloud Platform (Google Cloud Platform) services related to data engineering, such as Google Cloud Dataflow, Google BigQuery, Google Cloud Storage, Google Cloud Pub/Sub, and Google Cloud Dataproc.
  2. Proficiency in programming languages commonly used in data engineering, such as Python, Java, or Scala.
  3. Experience with data integration and ETL (Extract, Transform, Load) processes.
  4. Familiarity with data modeling and schema design principles.
  5. Knowledge of SQL and database systems.
  6. Understanding of distributed computing and parallel processing concepts.
  7. Experience with version control systems and CI/CD (Continuous Integration/Continuous Deployment) pipelines.

Desired Skills:

  1. Experience with other cloud platforms and services, such as AWS or Azure.
  2. Knowledge of data warehousing concepts and technologies.
  3. Familiarity with machine learning and data science workflows.
  4. Experience with data visualization tools, such as Google Data Studio or Tableau.
  5. Understanding of containerization technologies like Docker and orchestration frameworks like Kubernetes.
  6. Familiarity with data streaming platforms like Apache Kafka or Google Cloud Pub/Sub.
  7. Knowledge of data governance frameworks and practices.

About CitiusTech

We are a leading IT Healthcare Service provider. Our practice areas include-Health Information Systems, healthcare Software Engineering & Development, health cloud, Big Data, Mobile Health and many other technologies. Currently, CitiusTech s proprietary platform BI CLINICAL is deployed across 3,800 locations and around 20,000 hospitals across the US

We are also the Integration, Implementation and Healthcare Solution partners with many of the Fortune 500 Healthcare companies.

CitiusTech s global footprint includes the US (Princeton NJ, Seattle WA, Sarasota FL), Europe (London, Germany), Asia (Middle East, Singapore) and India (Mumbai, Airoli, Bangalore).

For more information on us, please visit our website

About CitiusTech