Incora

Senior Data Scientist

Incora Verified Employer Responds Quickly

Role Summary

We are looking for a Senior Data Scientist to lead the data intelligence layer of an AI-driven platform. This role focuses on extracting insights from structured and unstructured documents, building document processing and OCR pipelines, improving semantic retrieval systems, identifying information gaps in datasets, and maintaining high standards of data quality across end-to-end data workflows.
 

Key Responsibilities

  • Design and improve OCR and document intelligence pipelines that convert complex PDFs and supporting files into structured, machine-readable data.
  • Develop semantic retrieval workflows that identify relevant historical records and support recommendation systems based on similarity and contextual matching.
  • Create data-driven methods to detect missing, incomplete, or inconsistent information within document-based datasets.
  • Define and monitor data quality standards, validation rules, completeness checks, and anomaly detection mechanisms across data pipelines.
  • Analyze structured and unstructured datasets to uncover patterns affecting operational performance, classification quality, and decision outcomes.
  • Collaborate with machine learning engineers to prepare high-quality datasets for model training, validation, and iterative improvement.
  • Develop labeling strategies, feature definitions, and dataset enhancement workflows for early-stage and production use cases.
  • Evaluate the performance of retrieval systems, document extraction pipelines, and information coverage.
  • Work closely with engineering and product teams to operationalize insights through dashboards, tools, and data-driven workflows.
  • Communicate analytical findings, experiment results, and actionable insights clearly to technical and non-technical stakeholders.
     

Qualifications

  • Bachelor’s or Master’s degree in Data Science, Computer Science, Statistics, Artificial Intelligence, or a related technical field.
  • 5+ years of experience in data science, applied NLP, document intelligence, or analytics-focused roles. 
  • Strong proficiency in Python and practical experience with data science libraries and NLP tooling. 
  • Experience building document processing or OCR pipelines and working with semi-structured or noisy datasets.
  • Strong understanding of embeddings, semantic search, retrieval evaluation, and similarity-based recommendation systems.
  • Experience with data cleaning, validation, feature engineering, and dataset quality management.
  • Ability to design experiments and translate analytical findings into product or model improvements.
  • Strong SQL skills and solid analytical problem-solving abilities.
     

Nice to Have

  • Experience working with AI systems in regulated or compliance-sensitive environments.
  • Familiarity with multilingual NLP applications or systems that support multiple languages.
  • Experience with vector databases or retrieval-augmented architectures.
  • Exposure to annotation workflows and human-in-the-loop review processes.
  • Experience building dashboards, defining operational KPIs, or supporting data-driven reporting.

    We offer:
  • Flexible working schedule
  • Fully remote work
  • Compensation for medical, educational and sports activities
  • Professional growth opportunities
  • Internal knowledge-sharing talks
  • Awesome team events and activities
  • Paid vacation and sick leave

Required languages

English B2 - Upper Intermediate
Ukrainian Native
NLP, Python, OCR, SQL, AI/ML
Published 17 March
13 views
·
1 application
To apply for this and other jobs on Djinni login or signup.
Loading...