ML Data Engineer
Who we are
Are you passionate about innovating at the intersection of technology and personal security? We recognize that the human voice is a unique personal identifier, increasingly susceptible to sophisticated fraud, including the threat of deepfakes. We're leading the way in developing cutting-edge authentication, fraud prevention, and deepfake detection. Our mission is to provide seamless and secure digital experiences, safeguarding the most personal aspect of our identity: our voice. Here, you'll be part of a team driven by values of Innovation, Customer Advocacy, Excellence, and Impact. We're not just creating a safer digital landscape by fortifying trust and integrity with those we serve, were also building a dynamic, supportive workplace where your contributions make a real difference.
Headquartered in Atlanta, GA, backed by world-class investors such as Andreessen-Horowitz, IVP, and CapitalG.
We are looking for an ML Data Engineer to join the Speech Research team to help us in the development of our deepfake detection products. This candidate will work in a world-class team of researchers and will play a key role in developing the next generation of voice security suite. This role will touch various use cases at the forefront of the Pindrop vision.
What you`ll do
- Design and maintain datasets for ML model training including data collection, data augmentation and storage
- Develop and maintain tools for data collection, augmentation, and visualization
- Contribute to research packages to train and test ML models
- Conduct experimental studies on various audio processing tasks to assess accuracy and quality of pre-existing models
Contribute to the research teams activities including regular research reviews and code reviews
Who you are
- You are a creative problem solver who is excited about building amazing products that add real value to millions of people
- You are enthusiastic about audio processing as well as transferring research ideas into products
- You can explain complex topics in simple terms, and you love collaborating and building strong relationships with colleagues and stakeholders
- You are a self-starter and excel in a fast-paced dynamic environment that often includes ambiguity
- You have a proven track record of successful and timely project delivery
- You are resilient in the face of challenges, change, and ambiguity
- You are optimistic and believe that you can make a problem into a solution
- You are resourceful, excited to uncover innovative solutions and teach yourself something new when needed
- You take accountability, do the things you say youll do, under-promise and over-deliver
You are nimble and adaptable when priorities change and continue to see the forest through the trees
Your skill-set:
- 3+ years of professional experience as a data engineer in Deep Learning, preferably, in automatic speech recognition, natural language processing, speaker recognition or model compression or in general machine learning
- Experience in the preparation of data for ML model training and evaluation
- Strong Python skills required
- Experience with ML frameworks: TensorFlow, Keras, PyTorch
- Masters degree or PhD in a quantitative field (Computer Science, Mathematics, Engineering, Artificial Intelligence, etc.) or equivalent experience required
- Preferably, you have experience on working with audio/visual data and data collection
- Preferably, you have experience in JavaScript, HTML, CSS or React JS
- Preferably, you have knowledge of biometrics, authentication, fraud, or customer experience concepts
- Preferably, you have experience communicating with internal and external customers around proof-of-concepts, issues, etc.