Spark, Scala, Java, Kubernetes, Cassandra or HBase, S3 or HDFS. Worked with Hive, Presto or Impala. Preferred to have work on Production env. With PB of data storage.
All candidate should be strong in coding

Data engineer can be specialized in either NoSQL data stores (Cassandra, HBase etc) or Spark & Spark streaming - need both specializations
3+ years of professional experience with Big Data systems, pipelines and data processing
Hands on experience Big Data, data ingestion, data processing using Spark, Spark Streaming, Flink, HIVE, Kafka, Hadoop, HDFS, S3

Hands-on experience with design and development with NoSQL technologies Cassandra, HBase or similar scalable Key valueStores and time series data stores like Druid, influx or similar
Understanding on various distributed file formats such as Apache AVRO, Apache Parquet and common methods in data transformation
Confirmed understanding of design and development of large scale, high throughput and low latency applications is a plus
Understanding and experience with Micro Services is desired
Excellent problem solving and programming skills
Experience with containerization technologies like Kubernetes, Docker, Mesos, Marathon is desirable
Experience with CI/CD, debugging and monitoring applications and big data jobs is desirable

About EPAM Systems

EPAM Systems is a leading global provider of digital platform engineering and software development services, with more than 36,700+ employees worldwide.

Company website:

DOU company page:

Job posted on 19 November 2020
1 view

Для отклика на эту и другие вакансии на Джинне войдите или зарегистрируйтесь.
  Receive new jobs in Telegram