Lead Performance Engineer (with profiling experience) (IRC274726)
Job Description
- Proven 5+ years of experience in performance testing for microservice architectures.
- Strong hands-on experience with Kubernetes (kubectl command line)
- Experience with defining test approach and test plan for a new services
- Skilled in testing distributed services built on message broker architectures (RabbitMQ/Kafka), with emphasis on validating message flow, throughput, fault tolerance, and performance under load.
- Strong hands-on experience with application profiling and monitoring (.NET-based experience if a great plus).
- Proficiency in using Prometheus and Grafana for real-time performance analysis and reporting.
- Familiarity with application performance management, e.g. Dynatrace (preferred).
- Knowledge of Redis and MongoDB performance monitoring techniques (nice to have).
- Experience in and KSQL monitoring (is an advantage).
- Ansible knowledge for automated monitoring setup and configuration (nice to have)
- Strong analytical skills with the ability to diagnose and resolve complex performance issues.
Excellent communication skills and ability to collaborate with both technical and non-technical stakeholders.
Job Responsibilities
As a Performance Engineer, should independently able to define a test strategy and test plan for a new service. Run various performance test scenario's, Review application performance, improve scalability, Resiliency of the applications (different modules) , Enhance current observability stack, etc.
Key Responsibilities:
- Design and execute performance testing strategies for microservice architectures to ensure high reliability and scalability.
- Prepare and validate test environment that includes Kubernetes deployment, Linux and windows VMs
- Monitor the performance of RabbitMQ, ensuring efficient message handling and queue management.
- Utilize Prometheus and Grafana to track, visualize, and analyze application performance metrics.
- Collaborate with development and operations teams to ensure optimal system performance and availability.
- Recommend and implement performance tuning for caching solutions, including Redis.
- Support monitoring of distributed data systems such as Kafka and MongoDB.
- Leverage tools like Dynatrace and KSQL to analyze system performance in real-time.
- Assist in performance tuning and monitoring of Kubernetes-based applications.
Develop and maintain Ansible playbooks for automated deployment of monitoring tools and performance optimization scripts.
Department/Project Description
Our client's talented and diverse workforce develops disruptive robotic solutions for warehouse automation and supply chain transformation, drives results and focuses on customer satisfaction. Our modular and scalable solutions utilize existing space and high-speed robotics combined with proprietary software and technology. Since forming in 2007, it has grown to over 700 employees in North America and supports an impressive, high-profile customer base. We seek candidates who are passionate about our solution and approach problem solving with a creative mind and a commitment to excellence. If you seek a challenge and want to impact the way the world distributes product from manufacturers to store shelves, we invite you to join our team.