Data Engineer

  • Location

    Manchester

  • Sector:

    Life Sciences

  • Salary:

    Negotiable

  • Contact:

    Jessica Charles

  • Contact email:

    Jessica.Charles@volt.eu.com

  • Job ref:

    BBBH2246_1636538453

  • Published:

    over 2 years ago

  • Expiry date:

    2021-12-31

  • Consultant:

    ConsultantDrop

Role: Data Engineer

I've partnered with a small biopharmaceutical company who have developed a platform to provide clinical drug screens to optimise treatment decisions in patients within cancer, alongside a biobank of (PDC) models that enables them to offer screening to support pharmaceutical drug development. They're looking for a Data Engineer to build products for a biotech company looking to scale up globally and help shape, scope and implement backend solutions and platforms products that allow clients to get the data they need to make impactful decisions in the biopharmaceutical space.

Responsibilities:

Building backend automation pipelines for key products
Scraping, cleaning, pre-processing data
Create pipelines for data science models
Building cloud infrastructure
Building deployment infrastructure
Building devops infrastructure
Writing QA pipelines
Maintaining NoSQL databases

Requirements:

3+ years of relevant, contemporary Python experience in a production environment
3+ years of relevant data science frameworks (Numpy, Pandas, scientific Python)
1+ years of experience working with NoSQL databases (especially MongoDB)
Advanced Python skills and interacting with Linux-based environments and shell scripting
Advanced Numpy/Pandas framework ability (Sci-Kit)
Advanced MongoDB ability with experience crafting query pipelines, especially via PyMongo
Experience in data modelling (creating ERD's, relationship modelling)
Proficiency with interacting with deployment tools, build tools, working with packages (Pip), CI/CD pipelines
Proficiency in building containerized environments (Docker, Kubernetes, Nextflow)
Proficiency in working with AWS and Azure
Proficiency in version control tools such as Git
Experience with R or interacting with bioinformatics pipelines and working with NGS data
Experience with ML libraries such as PyTorch, Sci-Kit Learn, TensorFlow etc


If this sounds of interest, please reach out to me at 01737 236729 /