About Me

Hi, I’m Tomasz Lasota—a scientist, data geek, and AI enthusiast with a unique blend of expertise in Molecular Biology and Computer Science. With a PhD in Molecular Biology and an MSc in Computer Science specializing in Data Analytics, I focus on bridging the gap between wet-lab experimentation and computational insight.

My career began in the lab developing PCR-based diagnostics and nucleic acid extraction technologies, but I quickly became fascinated by the stories hidden in biological data. That curiosity drew me into the world of Python, machine learning, and statistical modeling. Whether it’s building classification models for PCR amplification curves, detecting anomalies in microfluidic pressure profiles, or interpreting metabolomics datasets using dimensionality reduction and clustering, I enjoy applying ML to solve meaningful biological problems.

My MSc research used artificial neural networks to model human biomechanics, but these days I'm more interested in making sense of high-dimensional biological data—especially in diagnostics and omics applications. I’m passionate about turning noisy lab readouts into actionable insights through clean, reproducible analysis pipelines.

Outside of science, I enjoy tinkering with 3D printing, automation projects, and cloud-native tools to accelerate discovery. And none of this would be possible without my dog, Lilly, who ensures I stay balanced with regular walks and reminders to unplug.

Professional Experience

Senior Integration Scientist | LEX Diagnostics (Dec 2023 – Present)

  • Developed machine learning pipelines to classify real-time PCR amplification curves, enabling early detection of weak, ambiguous, or failed amplification events to support QC and diagnostic decisions.
  • Applied time-series modeling and anomaly detection techniques to monitor pressure profiles in microfluidic systems, improving failure detection and fluidic stability during assay operation.
  • Designed and validated supervised models (e.g., logistic regression, random forests) to predict performance anomalies during thermal cycling phases, integrating insights into instrument firmware and control logic.
  • Created reproducible data analysis workflows in Python (pandas, NumPy, matplotlib, scikit-learn) using Jupyter Notebooks to support R&D and verification teams.
  • Performed statistical analyses (t-tests, ANOVA, logistic regression) on assay outputs to evaluate signal fidelity, probe stability, and channel consistency across pilot builds and validation studies.
  • Built interactive dashboards and data visualizations to accelerate troubleshooting and cross-functional decision-making in assay and instrument development.

Senior Scientist II | Erba Molecular (Jan 2020 - Dec 2023)

  • Led development of multiplex PCR diagnostic assays and used machine learning to model signal dynamics and improve call accuracy in edge cases.
  • Built internal Python tools to process high-dimensional assay data, perform feature engineering, and conduct performance comparison across reagent lots and devices.
  • Used unsupervised learning (e.g., k-means, PCA) to cluster test results by amplification profile or cartridge behavior, identifying subtle failure modes.
  • Integrated cloud-based analytics pipelines to automate batch QC analysis and alert users of potential data quality issues.

Senior Scientist I | Erba Molecular (Aug 2018 - Jan 2020)

  • Managed and trained junior staff in diagnostic assay development.
  • Led the development and validation of novel RNA isolation technologies.
  • Integrated RNA extraction platforms onto third-party and proprietary systems.

Scientist | Erba Molecular (Oct 2017 - Aug 2018)

  • Worked on early-stage assay development and validation.
  • Assisted in nucleic acid purification research and integration.

Machine Learning Projects

Exercise Classification Using Smartphone Sensor Data

Collected time-series motion data from smartphone IMUs during resistance training. Built a 1D Convolutional Neural Network (1D-CNN) to automatically classify types of weight training exercises based on movement patterns.

Depression Prediction Using Lifestyle and Work Factors

Applied logistic regression, random forests, and support vector machines to predict depression risk based on lifestyle, sleep, food habits, and occupational stress indicators. Tuned models using cross-validation and feature selection techniques.

Cancer Survival Modeling with Kaplan–Meier and Machine Learning

Used Kaplan–Meier curves and Cox Proportional Hazards models to explore survival trends. Trained ML models (e.g., XGBoost, LightGBM) to predict survival probability from high-dimensional clinical and demographic data with censored outcomes.

PCR Amplification Curve Classification

Developed supervised ML classifiers to categorize real-time PCR amplification curves into strong, weak, or failed signals, enhancing diagnostic accuracy and supporting QC decisions.

Microfluidic Pressure Anomaly Detection

Used time series anomaly detection (e.g., moving z-scores, isolation forests) to detect pressure anomalies in microfluidic cartridges, identifying leaks or obstructions in early-stage prototypes.

Metabolomics Feature Analysis

Applied dimensionality reduction (PCA, t-SNE), clustering (k-means, DBSCAN), and Elastic Net regularization to identify key features in high-dimensional metabolomics datasets. Used statistical testing and visualization to distinguish metabolic profiles across experimental groups.

Skills

  • Programming & Data Analysis: Python (pandas, NumPy, matplotlib, seaborn, scikit-learn), SQL, Jupyter
  • Machine Learning: Classification (PCR signal patterns), anomaly detection, regression, clustering (PCA, k-means), feature engineering
  • Diagnostics Data Expertise: Real-time PCR data, amplification curves, time series from thermal/pressure sensors, microfluidic cartridge metrics
  • Cloud & Tools: AWS (S3, Lambda), Streamlit, MongoDB, Git, GitHub, Bitbucket
  • Statistical Techniques: t-tests, ANOVA, logistic regression, ROC/AUC, confidence intervals, experimental design
  • Workflow & Collaboration: Jupyter Notebooks, Jira, data dashboards, integration with LIMS/ELNs
  • Compliance & Development: IVDR, V&V documentation, design transfer, cross-functional team leadership

Education

MSc in Computer Science (Data Analytics) | University of York (2020 - 2023)

PhD in Molecular Biology | Cardiff University (2013 - 2017)

MSc in Forensic Science & Technology | University of Central Lancashire (2012 - 2013)