Python Data Engineer / ETL Developer โ Real Estate Intelligence Platform
We are building a real estate intelligence platform focused on New York City property data, property owners, CRM history, contacts, public records, lead scoring, and opportunity detection.
This is not a simple Excel cleanup job. We need a serious Python Data Engineer / ETL Developer who can help us turn large real estate datasets into a clean, structured database that will power our internal software platform.
What We Are Building
We are building a platform that combines real estate data, CRM history, owner information, contact data, public records, lead scoring, opportunity queues, and future AI automation.
The system will organize:
- Property profiles
- Owner and company profiles
- Phone and email contact points
- CRM notes and follow-up history
- Public record data
- Source trace
- Relationship mapping
- Lead scoring
- Opportunity queues
Future AI and automation features
Responsibilities
- Work with large CSV and Excel files
- Clean and normalize real estate data
- Build Python ETL pipelines
- Import structured data into PostgreSQL
- Create relationships between properties, owners, companies, contacts, and CRM activity
- Build deduplication and merge logic
- Build source trace logic
- Build data quality reports
- Help prepare data for dashboards, APIs, and future public records integrations
- Work closely with the technical lead and product team
Help create a clean data foundation for the platform
Required Skills
- Python
- Pandas
- PostgreSQL
- CSV / Excel processing
- ETL pipelines
- Data cleaning and normalization
- REST API experience
- Ability to work with large datasets
- Strong attention to detail
- Ability to structure messy data into a reliable database
Clear communication and ability to work independently
Bonus Skills
- Real estate data experience
- NYC property data experience
- NYC Open Data / Socrata API
- ACRIS
- DOB permits / violations
- PLUTO / MapPLUTO
- Address normalization
- Entity resolution and deduplication
- Data quality reporting
- CRM data experience
Lead generation data experience
Ideal Candidate
You are detail-oriented, reliable, and able to think carefully through messy data. You do not just write scripts. You understand that clean data needs to support a real business system, including dashboards, CRM workflows, lead scoring, and future AI automation.
How to Apply
Please send:
- Your experience with Python and data engineering
- Examples of similar data projects
- Your PostgreSQL experience
- Your experience with CSV, Excel, ETL, and data cleaning
- Your experience with real estate data or public data, if any
- Your experience with APIs, especially public data APIs
- Your hourly rate or monthly salary expectation
- Your availability and preferred working hours
- Your location and time zone
A short explanation of how you would approach this project
We are looking for a long-term team member, not a one-time freelancer.
Required skills experience
| Python | 3 years |
| Python Pandas | 3 years |
| PostgreSQL | 3 years |
| ETL | 3 years |
| Data Engineering | 3 years |
| Google Sheets / Excel | 3 years |
| REST API | 3 years |
| Data Modeling | 3 years |
| Web Scrapping | 3 years |
| SQL | 3 years |
Required domain experience
| PropTech / Real Estate | 3 years |
Required languages
| English | C2 - Proficient |