Overview
Skills
Job Details
We are seeking a contract Data Mining Engineer with deep hands-on experience in scraping structured and unstructured data from public websites. The ideal candidate will be comfortable working with a Python-based stack (Anaconda environment) and modern tools for headless browser automation, parsing, and persistence. The engagement will require strong problem-solving skills, particularly in navigating site protections (e.g., bot detection, captchas), handling scale and performance, and ensuring robust, repeatable extraction of data
Absolute Requirements (Must-Have Experience)
Web Scraping / Data Mining
o Demonstrated success scraping data from public websites, including those with dynamic content and bot detection mechanisms
o Ability to structure scraped data for downstream processing or database storage
Linux Troubleshooting
o Comfortable operating and debugging in a Linux environment o Able to work with bash scripts, cron jobs, logs, and environment configuration
Python Expertise
o Experience working within Anaconda environments
o Proficiency with core scraping and data processing libraries:
selenium (including undetected-chromedriver)
beautifulsoup
numpy