Senior ML Engineer
Job Description
We are a comprehensive SaaS platform that provides complete device operational intelligence, allowing security teams to effortlessly detect vulnerabilities, assess risks, and deploy cutting-edge mitigations. Our universal vulnerability repository is the first in the industry; it enables organisations, as well as IT and security solutions, to stay steps ahead of cyber threats, ensuring robust security in the face of evolving cyber threats.
๐ Desired Skills and Experience
โ Strong commitment to code quality, particularly in Python.
โ In-depth knowledge of LLMs (e.g., GPT, Claude, Mistral) with hands-on experience in prompt engineering, fine-tuning, or RAG pipelines.
โ Solid understanding of NLP fundamentals: tokenization, embeddings, and
transformer-based models.
โ Proficiency with Python and key ML/NLP frameworks (e.g., HuggingFace,
Sentence-Transformers, LangChain, LangGraph).
Strong grasp of:
โ Scalable and maintainable software architecture
โ Object-oriented programming and design patterns
โ Automated testing (unit, integration, and smoke tests)
โ Familiarity with Git and command-line tools.
โ Excellent communication skills and professional-level English.
โ Ability to learn quickly and work independently across time zones.
โ Minimum of 5+ years of experience in machine learning, including at least 2 years in the generative AI space.
Nice to Have:
โ Interest or background in cybersecurity.
โ Experience with:
โ Task orchestration tools (e.g., Airflow, Argo Workflows)
โ Cloud platforms (preferably Google Cloud Platform)
โ SQL databases (e.g., PostgreSQL)
โ Graph databases (e.g., ArangoDB, Neo4J)
โ Infrastructure as code (Terraform)
โ Containerization and orchestration (Docker, Kubernetes)
๐ฏ Responsibilities
โ Design and implement Retrieval-Augmented Generation (RAG) pipelines using state-of-the-art LLMs (e.g., OpenAI, Vertex AI, Cohere).
โ Build pipelines for extracting structured information from unstructured sources (PDFs, HTML, etc.) using LLMs and advanced prompt engineering techniques.
โ Conduct R&D in the cybersecurity domain using NLP techniques to match and analyze vulnerability data.
โ Develop evaluation datasets and pipelines to benchmark the quality and correctness of LLM outputs.
โ Craft effective prompts, fine-tune LLMs, and experiment with embedding and similarity methods.
โ Design and implement HTTP APIs for both internal and customer-facing
applications.
โ Research the capabilities and limitations of current LLMs and apply findings to enhance RAG and other LLM-based services.
โ Write unit and integration tests to ensure reliability and maintainability.
โ Participate in sprint planning and collaborate with non-technical stakeholders to gather and translate business requirements.
โ Review code, designs, and technical deliverables, providing constructive feedback.