Senior Data Parsing/ OSINT Automation Engineer Offline

$$$$

Altss is the world’s most advanced LP/investor intelligence database for alternative assets, trusted by top-tier VC, PE, and investment banks. We leverage OSINT, proprietary NLP, and distributed web-scale automation to deliver verified, real-time data that outperforms legacy players like PitchBook, Preqin, and FINTRX.

 

We're building Palantir-level infrastructure—no fluff, no legacy tech, no compromises.

 

The Mission

Build and scale the best data acquisition and parsing engine in the industry—crawling, extracting, and verifying every relevant datapoint on investors, funds, deals, and companies globally.

 

What You’ll Do

  • Architect and deploy massively parallel web scraping and data extraction pipelines across hundreds of sources (websites, filings, PDFs, APIs, Social media, etc.).
  • Reverse-engineer and bypass anti-bot systems (CAPTCHAs, bans, rotating proxies, browser fingerprinting).
  • Maintain high-throughput, self-healing crawlers with automated error handling, logging, and QA.
  • Enrich, deduplicate, and structure data using NLP and advanced entity resolution.
  • Build modular, scalable infrastructure with rapid onboarding for new data sources.
  • Collaborate with LLM, backend, and product teams to deliver new features and continuous data coverage.
  • Own the parsing stack: From code to deployment, performance monitoring, and compliance.
  • Continuously research and implement new OSINT/data mining methods and tools.
  •  

Your Background

  • 5+ years Python (requests, Scrapy, Playwright/Selenium, pandas, async frameworks)
  • Built and scaled large distributed scraping systems (preferably from adtech, alt data, cybersecurity, or OSINT background)
  • Mastery of proxies, anti-bot, headless browsers, session management
  • Hands-on with PostgreSQL/ClickHouse, data pipelines, cloud (Azure/AWS), Docker
  • Experience with NLP/regex for data cleaning, extraction, and enrichment
  • Proven ability to adapt fast, ship independently, and document your work
  • Bonus: OSINT tools (Maltego, Social Links, etc.), Prefect/Airflow, experience with entity resolution, or dark web sources

     

Why Altss?

  • Work directly with founders building the next category-defining investor data platform
  • Zero bureaucracy: ship fast, own what you build, see your impact every week
  • Solve real, hard technical problems (think Palantir, not startup MVPs)
  • Competitive pay, top-tier team, and global ambition
  • Remote, flexible, results-focused environment (Ukraine-based preferred, global team)

Required skills experience

Data Science in Python, Python, Data Scraping, Web Scrapping, ETL, OSINT, data en, NLP, Regex, data mining

The job ad is no longer active

Look at the current jobs Data Engineer →

Loading...