Python Scraper Engineer

$$

About Us

At Structurata, we deliver premier workforce data encompassing people, contacts, organizations, jobs, and intent signals. Renowned for providing the most up-to-date and accurate lead generation data on the market, our solutions empower cutting-edge recruiting and sales platforms, AI-driven models, custom audience creation, and much more. With seamless delivery through APIs, and data feeds, Structurata ensures our customers access the data they need, when and how they need it. We are a Ukrainian based company and a Ukrainian based team.

 

As a Scraper Engineer, you will:

  • Monitoring Duties
    • Review monitoring reports to ensure crawler health and data quality
    • Analyze logs to identify unexpected events, proxy issues, and schema changes. 
  • Implement New Crawler Services
    • Build and deploy crawler services for new public data sources we are wishing to crawl
    • Develop work email enrichment capabilities to enhance data coverage
  • Handle Crawler Outages of P1 Services
    • Investigate and resolve LinkedIn crawler issues, including proposing solutions when APIs are deprecated
    • Respond to monitoring alerts and investigate issues (e.g., significant decreases in profiles crawled)
    • Troubleshoot and restore critical data pipelines with urgency

 

Required Skills & Experience

  • 3+ years of experience in Python development with a strong understanding of good code design principles
  • Solid proficiency in PostgreSQL (PSQL), including query optimization and database design
  • Experience with Celery for distributed task queue management
  • Proven experience building ETL applications that can scale with billions of events
  • Experience with web scraping technologies, proxy management, and handling anti-bot measures
  • Strong debugging skills and ability to analyze logs and monitoring systems
  • Excellent problem-solving skills and ability to work independently or collaboratively

 

Preferred Skills

  • Experience with Flower for Celery monitoring
  • Familiarity with LinkedIn APIs and other professional data sources
  • Knowledge of firewall configuration and IP tables management
  • Experience with Docker and containerized deployments

Required skills experience

Python 3 years
Celery 1 year
PostgreSQL 3 years
ETL 1 year
Web Scraping 1 year

Required languages

English B1 - Intermediate
Ukrainian B2 - Upper Intermediate
Published 13 April
46 views
ยท
6 applications
See stats of candidates who applied for this job ๐Ÿ‘€
To apply for this and other jobs on Djinni login or signup.
Loading...