Hybrid in New York, New York
•
Today
KeyResponsibilities Develop,test,andmaintainscalabledataprocessingpipelinesusingScalaandApacheSpark.ImplementdatatransformationandETLworkflowstohandlestructuredandunstructureddata.UtilizePythonfordataprocessing,scripting,andintegratingSparkworkflows.OptimizeperformanceofSparkapplicationsthroughtuning,partitioning,andcachingstrategies.Participateincodereviews,designdiscussions,andensureadherencetobestpractices.Documentworkflows,architecture,andsolutionsforinternalknowledgesharing.RequiredQualific
Easy Apply
Contract, Third Party
0+
