Anton Morgunov
Ph.D. Candidate in Theoretical & Computational Chemistry at Yale University. Building generative models that master the grammar of reactivity, and the infrastructure required to validate them.
Education
Ph.D. Candidate in Theoretical & Computational Chemistry
Aug 2023 - May 2026 (Expected)
Yale University
Advisor: Prof. Victor S. Batista
M.Sc. in Chemistry
Aug 2023 - Dec 2024
Yale University
Grade Average: Honors
S.B. in Chemistry and Biology, Minor in Computer Science
Sep 2018 - May 2023
Massachusetts Institute of Technology
GPA: 5.0/5.0. Phi Beta Kappa.
Research Experience
Graduate Student
Nov 2023 - Present
Prof. Victor S. Batista Lab, Yale University
- Developed DirectMultiStep, a Transformer-based retrosynthesis model (Mixture-of-Experts) that eliminates search-space explosion, achieving 3.1x higher accuracy than graph-search methods.
- Secured $100k prize (2nd Place) in the Standard Industries Chemical Innovation Challenge by deploying these generative models to solve industrial synthesis targets.
- Engineered ChemSpaceAL, an active learning framework using latent-space upsampling that reduced the computational cost of alignment by 90% (10x efficiency gain).
Undergraduate Researcher
Feb 2022 - May 2023
Prof. Troy Van Voorhis Lab, MIT
- Developed a composite electronic structure method recovering Coupled-Cluster (CCSD) accuracy within 0.02 eV at a fraction of the computational cost (MP2 scaling).
- Automated large-scale benchmarking pipelines (2k+ lines Python/Shell), streamlining the calculation → analysis → visualization workflow for high-throughput spectroscopy prediction.
Undergraduate Researcher
Sep 2018 - Dec 2021
Prof. Ronald T. Raines Lab, MIT
- Executed multi-step organic synthesis of novel diazo compounds for protein bioconjugation, characterizing products via NMR and MS.
- Bridged experimental results with theory by modeling transition states using DFT to elucidate reaction mechanisms.
Work Experience
ML Research Intern
Jun 2025 - Aug 2025
Stealth Startup
- Architected end-to-end reaction plausibility engine and negative data generation pipeline for proprietary workflows.
- Refactored GNN codebase into production-ready PyTorch, resolving bottlenecks to achieve 2x inference speedup.
- Engineered automated benchmarking suite to validate model performance against internal experimental datasets.
Graduate Research Assistant
Jul 2024 - Feb 2025
Los Alamos National Laboratory
- Modernized legacy infrastructure by porting MATLAB codebases to high-performance Python/VASP pipelines.
- Identified and fixed critical theoretical discrepancies in published methodology, restoring reproducibility to a stalled research project.
Engineering & Open Source
RetroCast & SynthArena
2025 - Present
Universal Benchmarking Infrastructure
- Built the industry-standard evaluation harness (RetroCast) and visualization engine (SynthArena), solving the field's data heterogeneity problem to enable the first rigorous comparison of generative vs. search-based planners.
- Sole architect of full-stack system (Next.js, Python, SQLite); engineered cryptographic provenance tracking and deployed via self-managed CI/CD pipelines.
Production Inference Platform
2024 - Present
- Engineered and maintain a full-stack MLOps platform serving models from 5 publications; currently processing live prediction traffic for 130+ external researchers.
- Managed bare-metal GPU infrastructure (VPS/Docker/NGINX) to achieve zero vendor lock-in and data sovereignty.
Leadership Experience
Co-Founder & Board Member
Apr 2020 - Present
Beyond Curriculum Public Foundation
- Built and managed a remote organization of 70+ volunteers; scaled platform to 2M+ monthly pageviews serving 190k+ unique students.
- Secured over $40,000 in grants and corporate sponsorship to democratize STEM education access in rural regions.
Chairman & Head Mentor
Nov 2021 - Mar 2023
Kazakhstan Chemistry Olympiads Association (QazChO)
- Implemented data-driven selection pipelines that resulted in 5 consecutive years of IChO gold medals (ending a 5-year drought) and represented national delegation as arbitrator at IChO 2019-2022.
Technical Skills
Languages
Python (Pydantic, Ruff, uv), TypeScript, Shell/Bash
ML & Data Science
PyTorch, PyTorch Geometric, DGL, scikit-learn, NumPy, Pandas
MLOps & Full-Stack
Docker, Next.js, React, PostgreSQL, Prisma, NGINX, Flask, AWS
Software Engineering
API Design, Unit & Integration Testing (Pytest), Git/GitHub
Scientific Computing
RDKit, ORCA, Q-Chem, PySCF, SLURM
Awards & Recognition
Phi Beta Kappa
Xi Chapter of Massachusetts, MIT, 2023
Academic Achievement Award
MIT Department of Chemistry, 2023
El Maqtanyshy (Pride of the Nation)
Nursultan Nazarbayev Foundation, 2020 & 2019
Olympiad Coaching Award
Ministry of Education and Science of Kazakhstan, 2019
Gold Medal (Ranked 10th)
International Chemistry Olympiad (IChO), 2017
Gold Medal (Ranked 8th)
International Mendeleev Chemistry Olympiad, 2017