BigID Developer (Python + NLP SpaCY)

  • Remote or San Ramon, CA
  • Posted 8 hours ago | Updated 1 hour ago

Overview

Remote
On Site
Hybrid
BASED ON EXPERIENCE
Full Time
Contract - Independent
Contract - W2

Skills

BIGID APP FRAMEWORK
REGEX
PYTHON
AWS
GCP
AZURE
SPACY
REGULAR EXPRESSIONS
NLP
REST API
JSON
NOSQL
BIGID STUDIO
DATA SECURITY

Job Details

J ob Title: BigID Developer (RegEx & Python
Location: San Ramon, CA / Remote
Duration: 3+ Months
Exp. Level: 6-8 years

Position Summary:

We are seeking a highly skilled BigID Developer with 4 6 years of experience to join our data privacy and discovery team. The ideal candidate will have expertise in the BigID platform, Regular Expressions (RegEx), Python scripting, and hands-on experience with NLP frameworks such as SpaCy. You will play a key role in automating data classification, building intelligent data models, and ensuring compliance with global privacy regulations.


Key Responsibilities:

  • Implement and customize BigID for enterprise-wide data discovery, classification, and privacy enforcement.
  • Create and optimize RegEx patterns for custom data identification rules.
  • Develop and integrate Python scripts for automation, data parsing, and API workflows.
  • Leverage SpaCy NLP models for intelligent entity recognition, sensitive data detection, and unstructured data classification.
  • Extend BigID capabilities using custom logic and advanced data classification techniques.
  • Connect and configure diverse data sources (databases, data lakes, SaaS, file systems) in BigID for scan orchestration.
  • Collaborate with data governance and security teams to align with compliance frameworks (GDPR, CCPA, HIPAA).
  • Monitor BigID job performance, resolve system issues, and fine-tune NLP-based classification logic.
  • Prepare detailed documentation, workflows, and operational handbooks.

Required Skills & Experience:

  • 6-8 years of professional experience in data engineering, data governance, or privacy engineering roles.
  • 2+ years of hands-on experience with BigID implementation, policy configuration, and data source integration.
  • Strong command of Regular Expressions (RegEx) for sensitive data discovery patterns.
  • Advanced Python scripting skills including data manipulation, API integration, and automation.
  • Proficiency with SpaCy or similar NLP tools (e.g., NLTK, Transformers) for entity recognition and unstructured data processing.
  • Familiarity with REST APIs, JSON, and data ingestion pipelines.
  • Experience working with structured/unstructured data across cloud and on-prem platforms (e.g., AWS S3, Azure Blob, Google Cloud Platform, SQL/NoSQL databases).

Nice to Have:

  • Experience with BigID App Framework or BigID Studio for building custom connectors or workflows.
  • Exposure to AI/ML-driven data classification or custom NLP models.
  • Cloud platform certifications or hands-on experience (AWS, Azure, Google Cloud Platform).
  • Working knowledge of IAM tools, data security policies, and privacy-enhancing technologies.

Education & Certification:

  • Bachelor's Degree in Computer Science, Information Systems, Engineering, or related field.
  • Preferred:
    • BigID Certified Professional (if available)
    • Python Certifications (PCEP, PCAP)
    • Data Privacy certifications (e.g., CIPP/US, CIPT) a plus
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About My IT LLC