Strong Middle/Senior Data Engineer ASAP (IRC210472) Offline

Job Description

Must have:

 

Strong combined experience (3-5 years) in working with Python for analytics and/or data pipelines.

In-depth knowledge in SQL, optimizaion and monitoring.

Practical experience with pySpark (Data frames, Pandas API, clusterring)

Strong programming experience in Python.

Practical experience with Apache Spark, Hive.

Experience with working Apache Airflow and creation DAGs

Hands-on experience with working with infrastructure in Git, using Jenkins, Docker. Pipelines creation experience.

Understanding the process of data ingestion

Ability to document technical solutions and define development tasks accurately.

Desire to work collaboratively with your teammates to come up with the best solution to a problem.

Good english level (written/spoken) for daily communication within a discributed team and with business stakeholders

Nice to have:

 

Experience with AWS cloud

Experience with DataBricks (using, migration)

Experience in working within AI/ML project with distributed infrastructure (Airflow-SageMaker)

 

Job Responsibilities

Building out data pipelines.

Creating and supporting validation pipelines for continuous monitoring of the quality of data.

Helping pinpoint and fix issues in data quality

Collaborate with other outstanding engineers to power the most exciting digital experiences on the market.

Paricipating in code review sessions

Following client's standards to the code and data qualities

 

Department/Project Description

The client is an American multinational association that is involved in the design, development, manufacturing, worldwide marketing, and sales of apparel, footwear, accessories, equipment, and services. The company is a proven leader in its industry and is constantly working to create innovative products and services.

 

Project is an interactive AI driven deployment engine that provides daily inventory replenishment signals, to identify optimal & accurate level of inventory required at the RSC and serve digital consumers with speed.