Commit Offshore

Python Engineer

Commit Offshore Verified Employer

Core Responsibilities

  • Design and implement batch and streaming data pipelines using Apache Spark.
  • Build and evolve a scalable AWS-based data lake architecture.
  • Develop and maintain real-time data processing systems (event-driven pipelines).
  • Own performance tuning and cost optimization of Spark workloads.
  • Define best practices for data modeling, partitioning, and schema evolution.
  • Implement monitoring, observability, and data quality controls.
  • Contribute to infrastructure automation and CI/CD for data workflows.
  • Participate in architectural decisions and mentor other engineers.

Required Qualifications

Experience

  • 3+ years of experience in Data Engineering.
  • Strong hands-on experience with Apache Spark (including Structured Streaming).
  • Experience building both batch and streaming pipelines in production environments.
  • Proven experience designing AWS-based data lake architectures (S3, EMR, Glue, Athena).

Streaming & Event-Driven Systems

  • Experience with event streaming platforms such as Apache Kafka or Amazon Kinesis.

Data Architecture & Modeling

  • Experience implementing lakehouse formats such as Delta Lake.
  • Strong understanding of partitioning strategies and schema evolution.

Performance & Reliability

  • Experience using SparkUI and AWS CloudWatch for profiling and optimization.
  • Strong understanding of Spark performance tuning (shuffle, skew, memory, partitioning).
  • Proven track record of cost optimization in AWS environments.

DevOps & Platform Engineering

  • Experience with Docker and CI/CD pipelines.
  • Experience with Infrastructure as Code (Terraform, AWS CDK, or similar).
  • Familiarity with monitoring and observability practices.

Nice to Have

  • Experience in the Financial domain.
  • Experience running Spark workloads on Kubernetes.
  • Experience implementing data quality frameworks or metadata/lineage systems.

Required languages

English B2 - Upper Intermediate
Python, Apache, AWS, Docker and CI/CD pipelines, Apache Spark
Published 10 March
16 views
ยท
5 applications
75% read
ยท
75% responded
Last responded yesterday
To apply for this and other jobs on Djinni login or signup.
Loading...