Data Engineer (Clay, Web Scraping)
About Us
Axia is a deal origination firm that connects investment banks and private equity firms with qualified deal flow. We're a bootstrapped team of 20 โ lean, fast-moving, and growing.
The Role
You'll own our entire data sourcing process โ finding, extracting, enriching, and delivering the data that powers our deal origination engine. You'll work heavily with Clay, APIs, web scraping, and third-party data providers to build reliable pipelines that feed our outreach and research teams.
What You'll Do
- Own the full data sourcing process โ from identifying sources to delivering clean, enriched datasets to the team
- Build and maintain Clay workflows โ design multi-step Clay tables that enrich, filter, and score leads at scale
- Integrate third-party APIs โ connect data providers, enrichment tools, and internal systems into automated pipelines
- Build and maintain web scrapers โ extract structured data from websites, directories, and public sources
- Ensure data quality โ validate, deduplicate, and normalize data before it reaches outreach and research teams
- Evaluate new data sources โ research and test new providers, tools, and methods to improve coverage and accuracy
- Document processes โ maintain clear documentation so workflows are repeatable and not dependent on one person
What We're Looking For
Must-have:
- Strong experience with Clay (building complex tables, enrichment workflows, API integrations within Clay)
- Proficiency working with REST APIs โ authenticating, paginating, transforming responses
- Web scraping experience โ Python (BeautifulSoup, Scrapy, Playwright) or similar tools
- Comfortable cleaning and transforming messy data into structured, usable formats
- Professional English โ clear written communication for async collaboration
- Some availability during US Eastern Time hours (full overlap not required)
Nice-to-have:
- Experience with data enrichment providers (Apollo, ZoomInfo, Clearbit, PeopleDataLabs, etc.)
- SQL and database experience (PostgreSQL, BigQuery, or similar)
- Familiarity with no-code/low-code automation tools (Make, Zapier, n8n)
- Background in B2B data, lead generation, or deal sourcing
- Experience with Python scripting beyond scraping (pandas, data pipelines)
Compensation
- Monthly payment based on experience
- Full-time contractor engagement
- Long-term role with growth potential as our data operations scale
Location
Remote โ some overlap with US Eastern Time (EST) preferred, but full overlap not required.
Required skills experience
| Clay | 1 year |
| Claude Code | 6 months |
| Web Scraping | 1 year |
| API Integration | 1 year |
| Data Scraping | 1 year |
| Prompt Engineering | 1 year |
Required languages
| English | C2 - Proficient |