Scraping Engineer (outstaff team) (offline)

About the company

GroupBWT is a consulting firm specializing in data management and the construction of data platforms. Our approach combines classical data warehousing with robust visualization tools, ETL processes, and business intelligence.

 

The platforms we create are designed to collect, analyze, distribute, and leverage internal data, providing actionable insights and value to our clients in the Retail, Manufacturing, Financial, and Market Research industries. Unlike off-the-shelf software, our solutions are not one-size-fits-all. We believe every business is unique, and so should be its data management system.

 

Project Overview:

 

In this role, you will be an integral part of our dedicated outstaff team working on a key project for one of our clients in the real estate domain. The project involves extracting public data from government based web resources and passing them into a data analysis pipeline, and we are looking for a dedicated and talented Scraping Engineer to help us achieve success.

 

Job Description:

 

As a Scraping Engineer in our dedicated outstaff team, you will have the unique opportunity to work closely with our client, understanding their specific needs and delivering tailored scraping solutions. Your role will involve:

 

Key Responsibilities:

 

- Collaborating directly with the client to gather project requirements and understand data needs.

- Designing and implementing web scraping solutions using Scrapy, BeautifulSoup, and other relevant technologies.

- Developing and maintaining scraping scripts and pipelines, ensuring data accuracy and reliability.

- Monitoring scraping processes, troubleshooting issues, and optimizing for efficiency.

- Communicating regularly with the client to provide updates and gather feedback.

- Adhering to project timelines and delivering high-quality results.

- Collaborating with other team members to ensure seamless integration of scraped data into the client's systems.

- Collaborate with team members and project stakeholders to contribute to the development of the most efficient project plan. - Leverage your expertise in web scraping and data extraction to provide valuable insights and recommendations for optimizing the project's data acquisition strategy

 

Requirements:

 

- Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent work experience).

- 2+ years of commercial experience as Scraping Engineer with Scrapy (Data Extraction Engineer or similar)

- Proven experience in web scraping and data extraction using Scrapy.

- Strong proficiency in working with databases, including SQL and RDBMS.

- Proficiency in Python programming and scripting.

- Familiarity with data cleaning, transformation, and normalization techniques: XPath, RegEx, NLP.

- Experience in reverse engineering and analysis of web applications

- Experience with following tools and technologies: RabbitMQ, PM2, Docker, SQLAlchemy

- Strong attention to detail and a commitment to delivering high-quality results.

- Basic skills of linux system administration.

- English level: upper-intermediate.

 

Nice to Have:

 

- Experience with Airflow

- Experience with FastAPI or Flask frameworks.

- Knowledge of data visualization tools and techniques.

- Familiarity with NoSQL databases.

- Proven experience of overcoming any anti-bot protection system

 

Benefits:

 

-20+ paid vacation days.

-Paid sick leaves.

-Health expenses reimbursement.