Senior Data Scientist Offline

Altss is a leading intelligence platform for Limited Partners in alternative assets, providing the most comprehensive and up-to-date LP data available. We blend OSINT, LLMs, and proprietary parsing to power smarter investor discovery and analytics. As we scale, we’re seeking a Senior Data Scientist to lead our data enrichment, entity resolution, and data quality efforts.

 

Your Role

  • Architect and lead development of entity resolution and record-linkage systems across tens of millions of LP, GP, and fund records.
  • Design and implement advanced NLP solutions for extracting, normalizing, and validating unstructured data (web, filings, news, LinkedIn, PDFs, etc.).
  • Oversee and continuously improve automated data QA, anomaly detection, and pipeline health monitoring.
  • Mentor and collaborate with a team of data engineers and parsing specialists; review code, set best practices, and drive technical strategy.
  • Research and integrate new OSINT sources, LLM tools, and enrichment pipelines for deeper coverage and higher data reliability.
  • Prototype and productionize innovative data enrichment features using LLMs, graph analytics, and custom ML models.
  • Partner with product and engineering teams to translate business needs into scalable, production-ready solutions.

     

What You Bring

  • 5+ years experience as a data scientist or ML engineer, ideally with experience in OSINT, unstructured data, or large-scale data platforms.
  • Deep expertise in Python, NLP (spaCy, HuggingFace, NLTK, etc.), and entity resolution at scale.
  • Strong background in building and scaling data/ML infrastructure (airflow, Prefect, or similar).
  • Advanced SQL skills; experience with large datasets (Postgres, BigQuery, Snowflake, etc.).
  • Demonstrated leadership: have built, mentored, or led small teams or technical projects.
  • Hands-on with production LLMs, data pipelines, and graph/relationship modeling.
  • Bonus: Experience in alternative investments, financial data, or OSINT.
  • Preferred location: Ukraine or Eastern Europe.

     

What We Offer

 

  • 100% remote with flexible hours—outcomes > time spent.
  • Greenfield ownership over data architecture; chance to build systems from scratch.
  • Direct impact: shape the backbone of a platform competing with Preqin, PitchBook, and Dakota.
  • High-agency, collaborative culture. Direct access to founders, zero bureaucracy.
  • Fast, transparent hiring process.
Python, NLP, Data Cleaning, QA, NER, BeautifulSoup, Data Scraping & Processing, Web-scraping, Data Scraping

The job ad is no longer active

Look at the current jobs Data Science →

Loading...