Job Description
- Work with analytics delivery lead and Lead Data Engineer to understand the business needs and build impactful analytics solutions.
- Contribute to cross-functional problem-solving sessions with your squad (data scientists specially) and provide tangible solutions using your expertise in data engineering and domain data knowledge.
- Data onboarding and access process – work with business analysts and analytics delivery lead to manage stakeholders that are affected by changes.
- Use state of the art technologies to acquire, ingest and transform big datasets.
- Design and build high-performance, secure, and scalable data pipelines to support data science projects following software engineering best practices.
- Map data to hypotheses, curate, wrangle, prepare data and feature engineering to be used in machine learning models.
- Design and develop the data and analytics platform selecting the right technologies for each problem at hand (big-data stack, SQL, no-SQL, etc.)
- Build modular pipeline to construct features and modelling tables.
- Build a sense of trust and rapport that creates a comfortable & effective workplace and experience in working as part of an agile squad.
Job Requirements
- Degree in computer science, engineering, mathematics, or equivalent experience.
- 2+ years of previous commercial experience as a leader in a data-driven role.
- 2+ years of hands-on experience building data pipelines in production and ability to work across structured, semi-structured and unstructured data.
- 2+ years of experience in ML pipeline for streaming/batch workflow.
- Ability to write clean, maintainable, and robust code in Python.
- Understanding and expertise of software engineering concepts and best practices.
- Knowledge of testing frameworks and libraries.
- Experience with analytics (descriptive, predictive, EDA), feature engineer, algorithms, anomaly detection, data quality assessment and python visualization libraries – e.g. matplotlib, seaborn or other.
- Comfortable with notebook and source code development – Jupyter, Pycharm/VScode.
- Hands-on experience of technologies like Python, Spark/Pyspark, Hadoop/MapReduce/HIVE, Pandas etc.
- Familiarity with query languages and database technologies, CI/CD, testing and validation of data and software.
- Tech stack and activities that you would use and preform on a daily basis.
Apply Via The Following Link
Leave a Reply