Data Engineer (IRC257697)
Job Description
- Proficiency in Python for data processing and automation.
- Strong SQL skills for querying and manipulating data.
- Minimum of 3 years of experience in SQL and Python programming languages, specifically for data engineering tasks.
- Experience with cloud platforms, preferably Azure (Azure Data Factory, Azure Databricks, Azure SQL Database, etc.).
- Experience with Spark and Databricks or similar big data processing and analytics platforms
- Experience working with large data environments, including data processing, data integration, and data warehousing.
- Experience with data quality assessment and improvement techniques, including data profiling, data cleansing, and data validation.
- Familiarity with data lakes and their associated technologies, such as Azure Data Lake Storage, AWS S3, or Delta Lake, for scalable and cost-effective data storage and management.
- Experience with NoSQL databases, such as MongoDB or Cosmos, for handling unstructured and semi-structured data.
Additional Skillsets (Nice to Have):
- Familiarity with Agile and Scrum methodologies, including working with Azure DevOps and Jira for project management.
- Knowledge of DevOps methodologies and practices, including continuous integration and continuous deployment (CI/CD).
- Experience with Azure Data Factory or similar data integration tools for orchestrating and automating data pipelines.
- Ability to build and maintain APIs for data integration and consumption.
- Experience with data backends for software platforms, including database design, optimization, and performance tuning.
Job Responsibilities
- Design, develop, and maintain scalable data pipelines and ETL processes.
- Collaborate with cross-functional teams to understand data requirements and deliver high-quality data solutions.
- Implement data quality checks and ensure data integrity across various data sources.
- Optimize and tune data pipelines for performance and scalability.
- Develop and maintain data models and schemas to support data mesh architecture.
- Work with cloud platforms, particularly Azure, to deploy and manage data infrastructure.
- Participate in Agile development processes, including sprint planning, stand-ups, and retrospectives.
- Monitor and troubleshoot data pipeline issues, ensuring timely resolution.
- Document data engineering processes, best practices, and standards.
Department/Project Description
Our Client is one of the biggest global manufacturing companies operating in the fields of industrial systems, worker safety, health care, and consumer goods. The company is dedicated to creating the technology and products that advance every business, improve every home and enhance every life.
As a Data Engineer for our Data Mesh platform, you will design, develop, and maintain data pipelines & models, ensuring high-quality, domain-oriented data products. You will collaborate with cross-functional teams and optimize data processes for performance and cost efficiency. Your expertise in big data technologies, cloud platforms, and programming languages will be crucial in driving the success of our Data Mesh initiatives.