Big Data Engineer (Forward Deployed Engineer) Offline
In our project, we help organizations turn big and complex multi-modal datasets into information-rich geo-spatial data subscriptions that can be used across a wide spectrum of use cases. We turn petabytes of raw data into clear, actionable insights by applying advanced analytics to multi-sourced data, enabling customers to gain comprehensive understanding of organizations, events, and behaviors across land, sea, air, cyber and space domains.
We are seeking a Big Data Engineer (Forward Deployed Engineer) to work on-site in Kyiv, supporting critical geospatial analytics and natural language processing initiatives. This hybrid role combines hands-on big data engineering with product development work, focusing on analyzing massive datasets to surface actionable intelligence patterns and behavioral insights.
As a forward-deployed engineer, you will be embedded directly with the operations team, working on cutting-edge projects that transform petabytes of structured and unstructured geospatial data into strategic intelligence. You'll play a crucial role in developing and optimizing the data fusion capabilities that reduce task force response times from days to hours, while collaborating closely with multidisciplinary teams supporting mission-critical applications.
- RESPONSIBILITIES
- Design and implement scalable big data processing pipelines for ingesting and analyzing petabytes of multi-modal geospatial datasets
- Develop and optimize data fusion algorithms that automatically identify relationships and surface hidden patterns in near real-time
- Build and maintain robust data engineering infrastructure supporting behavioral analytics and anomaly detection at unprecedented scale
- Collaborate with data scientists and analysts to translate complex analytical requirements into production-ready data processing solutions
- Implement and optimize natural language processing workflows for unstructured data analysis and entity relationship extraction
- Ensure data quality, governance, and security compliance across all data processing workflows
- Document data architectures, processing standards, and operational procedures to support knowledge transfer and audit readiness
- Participate in incident response and troubleshooting for data pipeline and processing issues
- Develop custom attribution and modeling technologies for real-time threat detection and opportunity identification specific to regional requirements
- Build integration layers connecting the data engine with tactical mission networks and front-end visualization tools used by local partners
- Implement behavioral analytics capabilities that enable rapid identification of meaningful activities, patterns, and anomalies in complex regional datasets
- Support product development initiatives by prototyping new analytical capabilities and validating them against real-world operational scenarios
- Present analytical findings, insights, and recommendations to both technical and executive audiences, translating complex data relationships into actionable intelligence
- Conduct on-site client workshops and training sessions on the analytical capabilities and data products
- Optimize data processing performance for tactical decision-making timelines, ensuring sub-hour response times for critical intelligence queries
SKILLS
- Big Data Technologies: 4+ years hands-on experience with distributed computing frameworks (Apache Spark, Hadoop, Kafka, Flink)
- Programming Proficiency: Expert-level skills in Python, Scala, or Java for large-scale data processing; SQL expertise for complex analytical queries
- Geospatial Analytics: Experience with geospatial data formats (GeoTIFF, SHP, KML, NetCDF) and processing libraries (GDAL, PostGIS, GeoPandas, Shapely)
- Cloud Platforms: Proficiency with cloud-based big data services (Azure Data Factory, Databricks, HDInsight, or AWS equivalents)
- Data Engineering: Strong understanding of ETL/ELT pipelines, data warehousing concepts, and real-time streaming architectures
- Database Systems: Experience with both SQL (PostgreSQL, SQL Server) and NoSQL (MongoDB, Cassandra, Elasticsearch) database technologies
- DevOps & Infrastructure: Familiarity with containerization (Docker, Kubernetes), CI/CD pipelines, and infrastructure-as-code principles
- Performance Optimization: Proven track record of optimizing big data workloads for speed, cost, and reliability at petabyte scale
- Real-time Analytics: Required hands-on experience building low-latency data processing systems for near real-time behavioral analytics and alerting
- Geospatial Intelligence: Required understanding of GEOINT workflows, temporal analysis, and multi-source intelligence fusion methodologies
- Data Visualization Integration: Required experience integrating data engines with tactical visualization tools and mission planning software
- Multi-source Data Fusion: Required proven ability to correlate and analyze disparate data sources (satellite imagery, communications data, social media, sensor networks)
- Client Presentation Skills: Required experience presenting complex technical findings to diverse audiences including C-level executives, leadership, and technical teams
- Stakeholder Management: Required ability to manage multiple client relationships simultaneously while delivering high-quality technical solutions
- Security Awareness: Required understanding of data security best practices, particularly in sensitive operational environment
BENEFITS:
- Fair Compensation: Enjoy a competitive salary and bonuses that recognize your hard work.
- Work-Life Balance: Choose from flexible work arrangements, whether you prefer working from home, the office, or a mix of both.
- Grow with Us: Take advantage of opportunities for professional growth through training, certifications, and attending exciting conferences.
- Care for Your Well-being: Benefit from comprehensive health benefits, wellness programs, and other perks designed to support your overall well-being.
The job ad is no longer active
Look at the current jobs Data Engineer Kyiv→