Senior Parsing and Data Extraction Engineer Offline

Altss is the fastest-growing, AI-driven investor intelligence platform for alternative asset classes. We extract and structure data on LPs, funds, deals, and key people globally, at a scale and depth unmatched in the industry.

 

What You'll Do

  • Build advanced parsers for large-scale, real-time data extraction from diverse sources: websites, PDFs, filings, news, databases, LinkedIn, and more.
  • Architect robust, resilient scraping systems capable of bypassing sophisticated anti-bot and geo-blocking measures.
  • Develop and deploy entity resolution algorithms to link extracted data across sources (e.g., people, firms, deals).
  • Leverage OSINT methodologies to uncover “hidden” data and extract insights not available via standard APIs or databases.
  • Collaborate with LLM/NLP engineers to automate structuring, cleaning, and validation of parsed data at scale.
  • Continuously monitor, QA, and improve pipelines for speed, accuracy, and reliability.
  • Mentor and lead junior team members (if desired), helping set best practices and high engineering standards.

     

Who You Are

  • Proven experience building industrial-grade parsing/scraping infrastructure—handling millions of records and high data velocity.
  • Expert in Python (Scrapy, Playwright, Selenium, Requests, BeautifulSoup, etc.), or similar modern scraping stacks.
  • Hands-on with headless browsers, proxies, captcha-solving, geo-rotation, and anti-bot techniques.
  • Deep understanding of HTML/XML/JSON structure, regex, and automated data cleaning.
  • Experience with data lakes/warehousing (PostgreSQL, ClickHouse, or similar), and orchestrating ETL/ELT pipelines.
  • Knowledge of OSINT, data enrichment, and cross-entity resolution a major plus.
  • Familiar with LLM/NLP workflows for data extraction/normalization is a strong plus.
  • Highly autonomous, outcome-oriented, and able to move fast in a lean, globally distributed team.

     

Bonus Points For

  • Prior work on investor, finance, or B2B datasets.
  • Contributions to open-source scraping, data extraction, or OSINT tools.
  • Strong background in security, privacy, or compliance in data collection.

The job ad is no longer active

Look at the current jobs Data Science →

Similar jobs

Countries of Europe or Ukraine
Countries of Europe or Ukraine
Countries of Europe or Ukraine