Overview
Remote
Depends on Experience
Full Time
Skills
AI
Job Details
Key Responsibilities:
- Design and implement robust feature engineering pipelines to extract meaningful signals from structured and unstructured email data.
- Build and fine-tune deep learning and traditional ML models to detect malicious intent in emails (e.g., phishing, malware).
- Collaborate with security analysts and data engineers to define threat patterns and data labeling strategies.
- Use MLflow to manage the complete machine learning lifecycle including experimentation, reproducibility, deployment, and monitoring.
- Optimize Spark-based data processing pipelines to efficiently handle large-scale datasets.
- Conduct offline and online evaluations of model performance using appropriate metrics.
- Stay updated on the latest advancements in AI/ML and apply them to continuously enhance the detection models.
Required Skills and Experience:
- Proven expertise in feature engineering and model development for NLP, anomaly detection, or security domains.
- Strong programming skills in Python with hands-on experience in PyTorch for model development.
- Experience with Apache Spark (PySpark) for distributed data processing.
- Deep understanding of MLflow for model tracking, versioning, and deployment.
- Strong background in machine learning theory, model evaluation, and productionization.
- Experience working with large datasets, especially from enterprise platforms like Google Workspace.
- Knowledge of common threat detection techniques and email-based attack vectors is a plus.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.