Lead of SRE for ML / BI Infrastructure

Your expertise:

  • Experience of 1 year or more in a similar role, including team management experience or a strong desire to develop as a technical leader
  • DevOps / SRE Experience: Minimum of 2 years in a DevOps or SRE role
  • Experience with Big Data Technologies like Apache Hadoop, Spark, or Kafka
  • Good skills in one or more programming languages like Python, Java, or Go
  • Experience with Cloud-Based Infrastructure (AWS, GCP, Azure)
  • Experience with CI / CD
  • Proficiency in Linux / Unix Administration
  • Proficiency in Containerization and Orchestration Tools (Docker, Kubernetes)
  • Proficiency in Tools for Infrastructure Automation (Terraform, Ansible, Puppet, Chef)
  • Experience with SQL and NoSQL Databases (Cassandra, MongoDB, HBase)
  • Experience in Performance Tuning and Optimization
  • Good Analytical and Technical Troubleshooting Skills
  • Understanding of Web Technologies (REST, Cloud-Based Applications)
  • Proficient Understanding of Git
  • Written English (read & write)
  • Solid communication and coaching abilities
  • Excellent planning and project management skills
  • Excellent leadership skills, ability to inspire a team and build collaboration across departments

Will definitely be a plus:

  • Understanding of Data Warehousing Solutions (Google BigQuery, Amazon Redshift, Snowflake)
  • Knowledge of Data Processing Pipelines and ETL Processes
  • Understanding of service level indicators (SLIs), objectives (SLOs), and agreements (SLAs)
  • Experience with Logging and Management Systems (AWS CloudWatch, ELK)
  • Understanding of Monitoring Principles and Tools (NewRelic, Zabbix, Prometheus, Grafana)
  • Understanding of Network Protocols and Services (TCP / IP, HTTP, DNS, etc.)
  • Skills in High Availability and Disaster Recovery Strategies
  • Knowledge of System Security, Backup, and Recovery Processes

What’s in it for you?

  • Opportunity to deal with top-notch technologies and approaches in a world-leader product company with millions of customers
  • Opportunity to make a difference for online privacy, freedom of speech, and net neutrality
  • Decent market rate compensation depending on experience and skills
  • Developed corporate culture: no micromanagement, culture based on principles of truth, trust, and transparency
  • “You build it, you own it” mentality in most contexts
  • Support of personal and professional development
    • coverage of costs of external trainings, conferences, professional literature
    • support of experienced colleagues
    • in-house events and trainings
    • regular knowledge sharing in teams
    • English classes and speaking clubs
  • Life-balance support
    • truly flexible schedule, no time-tracking at all
    • 25 working days of vacation
    • 5 days of paid sick leave per month (if necessary) without providing a medical certificate
    • generous maternity leave program
  • Professionally strong environment, friendly and open atmosphere, ability to influence the product development and recognition for it

You will be involved into:

  • Support reliability of sites and services in tandem with team members and other teams
  • Help BI and ML teams to support software and services in the dev and production environment
  • Improve security and reliability of services, count capacity before high load periods
  • Manage incidents and requests from on-duty staff and other teams
  • Document all changes in the company wiki and help the team avoid repetitive tasks
  • Design and improve the methodology of work in a team and help team members do their best
  • Make presentations for best practices for other teams and help other teams do their work in the best way

About the company and project:

ZONE3000 is proud to represent its partnership with Namecheap (www.namecheap.com). Namecheap was founded in 2000 on the idea that all people deserve value-priced domains delivered through stellar service. Today Namecheap is a leading ICANN-accredited domain name registrar and web hosting company with over 13 million customers and 17 million domains under management — and we’re just getting started.

 

129 views
·
17 applications
71% read
·
12% responded
Last responded more than a month ago
78 views
·
11 applications
55% read
·
0% responded
To apply for this and other jobs on Djinni login or signup.