Senior Data Scientist Offline
Altss is a leading intelligence platform for Limited Partners in alternative assets, providing the most comprehensive and up-to-date LP data available. We blend OSINT, LLMs, and proprietary parsing to power smarter investor discovery and analytics. As we scale, we’re seeking a Senior Data Scientist to lead our data enrichment, entity resolution, and data quality efforts.
Your Role
- Architect and lead development of entity resolution and record-linkage systems across tens of millions of LP, GP, and fund records.
- Design and implement advanced NLP solutions for extracting, normalizing, and validating unstructured data (web, filings, news, LinkedIn, PDFs, etc.).
- Oversee and continuously improve automated data QA, anomaly detection, and pipeline health monitoring.
- Mentor and collaborate with a team of data engineers and parsing specialists; review code, set best practices, and drive technical strategy.
- Research and integrate new OSINT sources, LLM tools, and enrichment pipelines for deeper coverage and higher data reliability.
- Prototype and productionize innovative data enrichment features using LLMs, graph analytics, and custom ML models.
Partner with product and engineering teams to translate business needs into scalable, production-ready solutions.
What You Bring
- 5+ years experience as a data scientist or ML engineer, ideally with experience in OSINT, unstructured data, or large-scale data platforms.
- Deep expertise in Python, NLP (spaCy, HuggingFace, NLTK, etc.), and entity resolution at scale.
- Strong background in building and scaling data/ML infrastructure (airflow, Prefect, or similar).
- Advanced SQL skills; experience with large datasets (Postgres, BigQuery, Snowflake, etc.).
- Demonstrated leadership: have built, mentored, or led small teams or technical projects.
- Hands-on with production LLMs, data pipelines, and graph/relationship modeling.
- Bonus: Experience in alternative investments, financial data, or OSINT.
Preferred location: Ukraine or Eastern Europe.
What We Offer
- 100% remote with flexible hours—outcomes > time spent.
- Greenfield ownership over data architecture; chance to build systems from scratch.
- Direct impact: shape the backbone of a platform competing with Preqin, PitchBook, and Dakota.
- High-agency, collaborative culture. Direct access to founders, zero bureaucracy.
- Fast, transparent hiring process.
Python, NLP, Data Cleaning, QA, NER, BeautifulSoup, Data Scraping & Processing, Web-scraping, Data Scraping
The job ad is no longer active
Look at the current jobs Data Science →
📊
$4500-7000
Average salary range of similar jobs in
analytics →
Loading...