Senior Parsing and Data Extraction Engineer Offline
Altss is the fastest-growing, AI-driven investor intelligence platform for alternative asset classes. We extract and structure data on LPs, funds, deals, and key people globally, at a scale and depth unmatched in the industry.
What You'll Do
- Build advanced parsers for large-scale, real-time data extraction from diverse sources: websites, PDFs, filings, news, databases, LinkedIn, and more.
- Architect robust, resilient scraping systems capable of bypassing sophisticated anti-bot and geo-blocking measures.
- Develop and deploy entity resolution algorithms to link extracted data across sources (e.g., people, firms, deals).
- Leverage OSINT methodologies to uncover “hidden” data and extract insights not available via standard APIs or databases.
- Collaborate with LLM/NLP engineers to automate structuring, cleaning, and validation of parsed data at scale.
- Continuously monitor, QA, and improve pipelines for speed, accuracy, and reliability.
Mentor and lead junior team members (if desired), helping set best practices and high engineering standards.
Who You Are
- Proven experience building industrial-grade parsing/scraping infrastructure—handling millions of records and high data velocity.
- Expert in Python (Scrapy, Playwright, Selenium, Requests, BeautifulSoup, etc.), or similar modern scraping stacks.
- Hands-on with headless browsers, proxies, captcha-solving, geo-rotation, and anti-bot techniques.
- Deep understanding of HTML/XML/JSON structure, regex, and automated data cleaning.
- Experience with data lakes/warehousing (PostgreSQL, ClickHouse, or similar), and orchestrating ETL/ELT pipelines.
- Knowledge of OSINT, data enrichment, and cross-entity resolution a major plus.
- Familiar with LLM/NLP workflows for data extraction/normalization is a strong plus.
Highly autonomous, outcome-oriented, and able to move fast in a lean, globally distributed team.
Bonus Points For
- Prior work on investor, finance, or B2B datasets.
- Contributions to open-source scraping, data extraction, or OSINT tools.
- Strong background in security, privacy, or compliance in data collection.
The job ad is no longer active
Look at the current jobs Data Science →
📊
$2800-5000
Average salary range of similar jobs in
analytics →
Similar jobs
Countries of Europe or Ukraine
Countries of Europe or Ukraine
Countries of Europe or Ukraine