Ibai Irastorza Azcarate, Ph.D.

  • About me
  • Experience
  • Education
  • Projects
  • Technical Skills
  • Publications
  • Courses
  • Other Experience
  • Miscelanea

Contact Info:
ibai.irastorza@gmail.com

View My CV

Social Links

About me


Data Scientist | Machine Learning Engineer | PhD in Bioinformatics


13+ years of experience in research and industry. My academic background in computer science and statistics provides me with a solid foundation to transform complex data into clear, compelling narratives. I am highly proficient in Python, data visualization, and machine learning, with extensive experience in integrating and analyzing large-scale datasets.

Data Science

Data Science

Transformed complex datasets into actionable insights using statistical analysis and visualization techniques. Experience with large-scale data processing and analysis in both research and industry settings.

Software Development

Software Development

Built robust, scalable applications using modern software engineering practices. Proficient in multiple programming languages and frameworks.

ML Engineering

Machine Learning

Designed and implemented end-to-end ML solutions for various domains, from genomics to climate prediction. Experience in PyTorch, TensorFlow, and modern MLOps practices for scalable deployments.

Project Management

Project Management

Led cross-functional teams and managed complex projects from conception to delivery in international environments.

Research Background

Research

Led groundbreaking research in genomics, proteomics and 3D modeling of macromolecules, resulting in high-impact publications. Developed novel algorithms for spatial data analysis and contributed to international collaborations.

Teaching & Mentoring

Teaching

Passionate about knowledge sharing and mentoring. Experience in teaching advanced concepts in bioinformatics, machine learning, and programming to diverse audiences.

Key Achievements:

  • Led end-to-end projects in predictive modeling for spatial and genomic data analysis, securing €100,000 in funding and resulting in 2 first-author and 6 co-authored publications (200+ citations).
  • Developed a novel open source 3D modeling algorithm adopted by 10 international projects and resulting in 3 first-author publications in Cell, Nature Genetics, PLOS Comp Bio, with combined impact factor >90.
  • Active contributor to open-source projects, developing innovative solutions across different ML domains: https://github.com/batxes
  • Supervised junior data scientists to transform their analytical skills through mentorship.
  • Designed and implemented systems for climate prediction models and satellite image classification.
  • Experience in developing and deploying scalable solutions that transform large-scale multi-omics data.

My passion lies in solving complex challenges through technology and innovation. I'm always eager to learn and apply new approaches to meaningful problems. I'm particularly interested in projects that can make a positive impact, especially in sustainability. I thrive in collaborative environments where continuous learning and open feedback are part of the culture.

Core Technologies:

Python
Pandas
NumPy
Matplotlib
Bash
Git
Docker

ML/AI:

PyTorch
TensorFlow
Scikit-learn
Kubernetes

Cloud:

AWS
GCP

MLOps:

MLflow
Prefect
Prometheus
Grafana

Data Eng.:

Terraform
BigQuery
Apache Airflow
DBT

Experience

Education

Projects

Skills

First Author Publications

Courses

Other Experience

Miscelanea