Anton Morgunov

Ph.D. Candidate in Theoretical & Computational Chemistry at Yale University. Building generative models that master the grammar of reactivity, and the infrastructure required to validate them.

Education

Ph.D. Candidate in Theoretical & Computational Chemistry

Aug 2023 - May 2026 (Expected)

Yale University

Advisor: Prof. Victor S. Batista

M.Sc. in Chemistry

Aug 2023 - Dec 2024

Yale University

Grade Average: Honors

S.B. in Chemistry and Biology, Minor in Computer Science

Sep 2018 - May 2023

Massachusetts Institute of Technology

GPA: 5.0/5.0. Phi Beta Kappa.

Research Experience

Graduate Student

Nov 2023 - Present

Prof. Victor S. Batista Lab, Yale University

  • Developed DirectMultiStep, a Transformer-based retrosynthesis model (Mixture-of-Experts) that eliminates search-space explosion, achieving 3.1x higher accuracy than graph-search methods.
  • Secured $100k prize (2nd Place) in the Standard Industries Chemical Innovation Challenge by deploying these generative models to solve industrial synthesis targets.
  • Engineered ChemSpaceAL, an active learning framework using latent-space upsampling that reduced the computational cost of alignment by 90% (10x efficiency gain).

Undergraduate Researcher

Feb 2022 - May 2023

Prof. Troy Van Voorhis Lab, MIT

  • Developed a composite electronic structure method recovering Coupled-Cluster (CCSD) accuracy within 0.02 eV at a fraction of the computational cost (MP2 scaling).
  • Automated large-scale benchmarking pipelines (2k+ lines Python/Shell), streamlining the calculation → analysis → visualization workflow for high-throughput spectroscopy prediction.

Undergraduate Researcher

Sep 2018 - Dec 2021

Prof. Ronald T. Raines Lab, MIT

  • Executed multi-step organic synthesis of novel diazo compounds for protein bioconjugation, characterizing products via NMR and MS.
  • Bridged experimental results with theory by modeling transition states using DFT to elucidate reaction mechanisms.

Work Experience

ML Research Intern

Jun 2025 - Aug 2025

Stealth Startup

  • Architected end-to-end reaction plausibility engine and negative data generation pipeline for proprietary workflows.
  • Refactored GNN codebase into production-ready PyTorch, resolving bottlenecks to achieve 2x inference speedup.
  • Engineered automated benchmarking suite to validate model performance against internal experimental datasets.

Graduate Research Assistant

Jul 2024 - Feb 2025

Los Alamos National Laboratory

  • Modernized legacy infrastructure by porting MATLAB codebases to high-performance Python/VASP pipelines.
  • Identified and fixed critical theoretical discrepancies in published methodology, restoring reproducibility to a stalled research project.

Engineering & Open Source

RetroCast & SynthArena

2025 - Present

Universal Benchmarking Infrastructure

  • Built the industry-standard evaluation harness (RetroCast) and visualization engine (SynthArena), solving the field's data heterogeneity problem to enable the first rigorous comparison of generative vs. search-based planners.
  • Sole architect of full-stack system (Next.js, Python, SQLite); engineered cryptographic provenance tracking and deployed via self-managed CI/CD pipelines.

Production Inference Platform

2024 - Present

models.batistalab.com

  • Engineered and maintain a full-stack MLOps platform serving models from 5 publications; currently processing live prediction traffic for 130+ external researchers.
  • Managed bare-metal GPU infrastructure (VPS/Docker/NGINX) to achieve zero vendor lock-in and data sovereignty.

Leadership Experience

Co-Founder & Board Member

Apr 2020 - Present

Beyond Curriculum Public Foundation

  • Built and managed a remote organization of 70+ volunteers; scaled platform to 2M+ monthly pageviews serving 190k+ unique students.
  • Secured over $40,000 in grants and corporate sponsorship to democratize STEM education access in rural regions.

Chairman & Head Mentor

Nov 2021 - Mar 2023

Kazakhstan Chemistry Olympiads Association (QazChO)

  • Implemented data-driven selection pipelines that resulted in 5 consecutive years of IChO gold medals (ending a 5-year drought) and represented national delegation as arbitrator at IChO 2019-2022.

Technical Skills

Languages

Python (Pydantic, Ruff, uv), TypeScript, Shell/Bash

ML & Data Science

PyTorch, PyTorch Geometric, DGL, scikit-learn, NumPy, Pandas

MLOps & Full-Stack

Docker, Next.js, React, PostgreSQL, Prisma, NGINX, Flask, AWS

Software Engineering

API Design, Unit & Integration Testing (Pytest), Git/GitHub

Scientific Computing

RDKit, ORCA, Q-Chem, PySCF, SLURM

Awards & Recognition

  • Phi Beta Kappa

    Xi Chapter of Massachusetts, MIT, 2023

  • Academic Achievement Award

    MIT Department of Chemistry, 2023

  • El Maqtanyshy (Pride of the Nation)

    Nursultan Nazarbayev Foundation, 2020 & 2019

  • Olympiad Coaching Award

    Ministry of Education and Science of Kazakhstan, 2019

  • Gold Medal (Ranked 10th)

    International Chemistry Olympiad (IChO), 2017

  • Gold Medal (Ranked 8th)

    International Mendeleev Chemistry Olympiad, 2017

In the News