Senior Data Engineer

  • Antibody
  • Remote job

Senior Data Engineer

Job description

Red Sky is a key part of the Tar Heel Capital Pathfinder's fund - www.thcpathfinder.com, which was launched in late 2016. Our idea is to build projects with a global perspective and with particular emphasis on modern IT solutions. Every year we introduce new, revolutionary products to the market. We support great ideas and promote talents. We help build innovative startups not only by providing financial resources but above all through substantive support.


We operate by the #venture_building formula - we create projects based on the experience of experts, including teams of programmers, designers, project managers, HR, administration, and marketing - ready to develop and support promising startups such as NaturalAntibody.



 Meet NaturalAntibody 

NaturalAntibody is a company specializing in the development of computational methods for antibody-based drug discovery. Our goal is to understand the biology of antibody molecules, their therapeutic context, and how such knowledge can be translated to improved antibody therapy design. We pursue this goal by collecting, generating, and analyzing antibody data, with an end goal of applying our findings to antibody discovery.


NaturalAntibody is seeking a Senior Data Engineer to work on its computational antibody drug discovery product portfolio.

 About the role 

As a Senior Data Engineer, you will design and work on our data and analytics stacks. You will contribute to our data stack by analyzing our existing databases and creating novel datasets. You will employ this data to improve our analytics stack by creating suitable computational models addressing pertinent needs in antibody-based therapy development. The work will be a combination of software development and research so you should be well suited to tackle open-question challenges in an independent fashion. The work will bring you close collaboration with leading experts in drug discovery in the pharmaceutical industry, so communication and teamwork skills are very important.


 Your tasks 

Here are just a few examples of potentials tasks or activities in this role:

  • Designing Big Data pipelines in line with good practices like IaaC, High Availability, and Security in mind
  • Data collection, curation, and maintenance for existing data stack and novel databases.
  • Analysis, benchmarking of the existing models in the analytics stack.
  • Development of novel computational models on antibody drug discovery.
  • Research into antibody biology and their therapeutic context.
  • Liaising with clients from the industry.


 Requirements

The successful candidate should have:

  • Expertise in handling large datasets, preferably (e.g. Next Generation Sequencing, Proteomics, Protein Structures).
  • Programming skills in Python and tools designed for Big Data processing (terabytes of data) like Spark, Apache Airflow
  • Experience in designing cost-efficient data pipelines using AWS tools like AWS EMR, AWS Glue, Step Functions, etc.
  • Knowledge of IaaC tools like Terraform or CloudFormation
  • A high level of self-discipline - as a Data Engineer you will be responsible for making meaningful decisions about the project’s course based on your insights
  • Full proficiency in English is mandatory

Nice to have:

  • A Master's level degree in computer science, statistics, data science, bioinformatics, or similar. A ph.D. would be a strong plus.
  • Prior work in Immunoinformatics is a strong plus.
  • Hands-on expertise in applied statistical methods – knowledge of machine learning is a plus.


 We offer

  • 18 000 - 22 000 PLN net B2B
  • Work on an innovative project with a direct impact on its development
  • Access to unique knowledge within the organization and cooperation with outstanding experts and business partners from around the world
  • 100% financing for participation in industry conferences throughout Europe
  • Budget for books, training, and other materials
  • Private medical care package (Medicover), Multisport card, group insurance
  • Integration events in the spirit of #redskyteamspirit


 How do we work?

Methodology:

  • Kanban

Technical stack:

  • Python
  • Pyspark
  • AWS EMR, AWS Glue, S3, ECS
  • DocumentDB (MongoDB)
  • Terraform

Requirements


Collaborate with the most fast-paced #startup_studio in Poland!

Join NaturalAntibody and have an impact on the future!