Python Scraper Engineer
$$
About Us
At Structurata, we deliver premier workforce data encompassing people, contacts, organizations, jobs, and intent signals. Renowned for providing the most up-to-date and accurate lead generation data on the market, our solutions empower cutting-edge recruiting and sales platforms, AI-driven models, custom audience creation, and much more. With seamless delivery through APIs, and data feeds, Structurata ensures our customers access the data they need, when and how they need it. We are a Ukrainian based company and a Ukrainian based team.
As a Scraper Engineer, you will:
- Monitoring Duties
- Review monitoring reports to ensure crawler health and data quality
- Analyze logs to identify unexpected events, proxy issues, and schema changes.
- Implement New Crawler Services
- Build and deploy crawler services for new public data sources we are wishing to crawl
- Develop work email enrichment capabilities to enhance data coverage
- Handle Crawler Outages of P1 Services
- Investigate and resolve LinkedIn crawler issues, including proposing solutions when APIs are deprecated
- Respond to monitoring alerts and investigate issues (e.g., significant decreases in profiles crawled)
- Troubleshoot and restore critical data pipelines with urgency
Required Skills & Experience
- 3+ years of experience in Python development with a strong understanding of good code design principles
- Solid proficiency in PostgreSQL (PSQL), including query optimization and database design
- Experience with Celery for distributed task queue management
- Proven experience building ETL applications that can scale with billions of events
- Experience with web scraping technologies, proxy management, and handling anti-bot measures
- Strong debugging skills and ability to analyze logs and monitoring systems
- Excellent problem-solving skills and ability to work independently or collaboratively
Preferred Skills
- Experience with Flower for Celery monitoring
- Familiarity with LinkedIn APIs and other professional data sources
- Knowledge of firewall configuration and IP tables management
- Experience with Docker and containerized deployments
Required skills experience
| Python | 3 years |
| Celery | 1 year |
| PostgreSQL | 3 years |
| ETL | 1 year |
| Web Scraping | 1 year |
Required languages
| English | B1 - Intermediate |
| Ukrainian | B2 - Upper Intermediate |
Published 13 April
46 views
ยท
6 applications
See stats of candidates who applied for this job ๐
๐
Average salary range of similar jobs in
analytics โ
Loading...