Lead Big Data Engineer (with Python experience)
We are looking for an experienced data engineer with design and
development experience in automating scalable and high-performance data processing systems (batch and/or streaming) on the cloud (AWS preferably).
What You’ll Do
● Design our data models for optimal storage and retrieval on the cloud and to meet critical product and business requirements
● Build scalable and highly-performant distributed data processing systems as we migrate to the cloud
● Work closely with our business stakeholders to flesh out and deliver on requirements in an agile manner
● Set and evolve data standards and best practices
● Contribute to the data architecture and align it with the business and technology
● Adhere to and enforce software development best practices on the cloud in areas including but not limited to CI/CD, code reviews, automated testing, operational excellence, data quality etc.
What You Should Have
● 5+ years of experience with Big Data
● 4+ years programming experience in Java Python
● 3+ years experience using an enterprise cloud-based solution
● 3+ years implementing data processing pipelines on the cloud (batch and/or streaming)
● 3+ years of experience with Spark, Kafka
● Advanced understanding of SQL, relational and NoSQL databases required
● Experience with various data access patterns, streaming technology, data quality, data modeling, data performance, and cost optimization