Senior/Regular Data Engineer (Python, Spark, Hadoop) Offline

Responsibilities

• Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and Azure 'big data' technologies;

• Implement data flows connecting operational systems, BI systems, and the big data platform;

• Build real-time, reliable, scalable, high-performing, distributed, fault tolerant systems;

• Clean and transform data into a usable state for analytics. Build data dictionary;

• Create data tools for analytics and data scientist team members that assist them in their ML endeavors;

• Design and develop code, scripts and data pipelines that leverage structured and unstructured data;

• Implement measures to address data privacy, security, compliance.

Skills

Must have

HiveQL, Scala, Java, Apache HBase, Python, Kafka Streams, Big Data, Apache Kafka, Hadoop

• Experience with designing data and analytics architectures in Microsoft Azure cloud;

• Experience with Big Data technologies like Spark, Hadoop, Hive, HBase, Kafka etc.;

• Fluency in several programming languages such as Python, Scala, Java, with the ability to pick up new languages and technologies quickly;

• Experience with data warehousing, data ingestion, and data profiling;

• Demonstrated teamwork, strong communication skills, and collaborative in complex engineering projects.

Nice to have

BS in computer science or related STEM field;

The job ad is no longer active
Job unpublished on 12 February 2021

Look at the current jobs (Other) Remote→

📊 Average salary range of similar jobs in analytics →

Similar jobs

Compliance Manager at Sigma Software

Ukraine

Systems Engineer at AMC Bridge

Ukraine

Operations Manager at Zeeks

Ukraine

All jobs Other remote →

All jobs Luxoft →