Bio

Mitch Naylor a machine learning and AI scientist with extensive experience in natural language processing (NLP), statistical modeling, and deep learning. Mitch currently works as a senior applied researcher at GitHub, where he leads a team of machine learning scientists to develop new AI-powered products and improve existing features within GitHub Copilot. Mitch is also an author of Applied Causal Inference, which released in August 2023.

Mitch lives in Chattanooga, TN. Outside of work, he enjoys rock climbing, hiking, music, and traveling.

Tools

  • Technology: Python, Docker, Git, cloud (GCP, Azure, and AWS), Dash, SQL, Kusto, Go, TypeScript, R
  • Methods: NLP, large language models (LLMs), language modeling, machine learning, model evaluation, deep learning, Bayesian statistics, causal inference, data visualization, experimental design, data manipulation
  • Libraries: PyTorch, Hugging Face, pandas, NumPy, scikit-learn, XGBoost, spaCy, matplotlib, plotly, ggplot2, tidyverse

Experience

GitHub | Senior Applied Researcher | April 2024 - Present
  • Lead an embedded team of 6 applied researchers, serving as the primary tech lead from Applied Science driving cross-functional collaboration with product, engineering, and design teams
  • Develop new AI-powered experiences within GitHub Copilot, leveraging cutting-edge LLMs to deliver new features for millions of developers
  • Design agentic systems to accomplish complex software engineering tasks, such as reviewing pull requests (see Copilot Code Review)
  • Fine-tune LLMs using LoRA to improve code generation quality
  • Mentor junior team members through pairing and sharing knowledge of ML best practices
  • Conduct A/B tests to quantify model improvements, delivering statistically significant performance gains in production
  • Act as a key member of the hiring team for applied researchers, including designing technical interview material and assessing prospective team members
Azra AI | Lead Data Scientist | October 2019 - March 2024
  • Led research on efficient architectures for long-context NLP, implementing and adapting emerging methods (including MEGA architecture contribution to Hugging Face Transformers) for clinical NLP
  • Designed pretraining and domain-adaptation pipelines, producing language models optimized for deployment on resource-constrained hardware
  • Built and deployed clinical NLP models (NER, classification) identifying high-risk findings for 200k+ patients annually, enabling early detection and faster treatment
  • Drove technical strategy for ML product development, evaluating cutting-edge research and rapidly prototyping novel architectures for production deployment
  • Collaborated with clinical stakeholders to ensure model accuracy and reliability met the high standards required for cancer detection workflows
  • Note: Azra AI spun out as a standalone company in January 2022, previously part of Digital Reasoning
Asurion | Data Scientist | March 2018 - October 2019
  • Applied methods from NLP, statistical inference, and operations research to solve a diverse set of business problems
  • Communicated findings to stakeholders ranging from highly technical to C-level
GEICO | Product Modeling Analyst III | June 2016 - March 2018
  • Built predictive models for customer behavior under various pricing scenarios
  • Conducted robust statistical analyses in support of pricing department

Education

Georgia Institute of Technology | M.S. Analytics | Completed December 2020

University of Tennessee | B.S. Business Analytics | Completed May 2016

Open Source + Publications

Author of Applied Causal Inference, released August 2023

Implemented MEGA into Hugging Face’s transformers library

Using Machine Learning to Accelerate Identification of Pancreatic Incidentalomas | Journal of Oncology Navigation and Survivorship, 2023 | Paper link

Quantifying Explainability in NLP and Analyzing Algorithms for Performance-Explainability Tradeoff | Interpretable Healthcare in Machine Learning (IMLH) at ICML 2021 | Paper link

PsychBERT: A Mental Health Language Model for Social Media Mental Health Behavioral Analysis | IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2021 | Paper link + link to pretrained model

Non-Author Textbook Contributions

Code contributions for Transformers for Machine Learning: A Deep Dive (Kamath, Graham & Emara; Chapman & Hall, 2022); Machine Translation - Transformer vs. LSTM

Code contributions for Explainable Artificial Intelligence: An Introduction to Interpretable Machine Learning (Kamath & Liu; Springer 2021); Demo of Explainable Models

Mitchell Naylor


Bio

Mitch Naylor a machine learning and AI scientist with extensive experience in natural language processing (NLP), statistical modeling, and deep learning. Mitch currently works as a senior applied researcher at GitHub, where he leads a team of machine learning scientists to develop new AI-powered products and improve existing features within GitHub Copilot. Mitch is also an author of Applied Causal Inference, which released in August 2023.

Mitch lives in Chattanooga, TN. Outside of work, he enjoys rock climbing, hiking, music, and traveling.

Tools

  • Technology: Python, Docker, Git, cloud (GCP, Azure, and AWS), Dash, SQL, Kusto, Go, TypeScript, R
  • Methods: NLP, large language models (LLMs), language modeling, machine learning, model evaluation, deep learning, Bayesian statistics, causal inference, data visualization, experimental design, data manipulation
  • Libraries: PyTorch, Hugging Face, pandas, NumPy, scikit-learn, XGBoost, spaCy, matplotlib, plotly, ggplot2, tidyverse

Experience

GitHub | Senior Applied Researcher | April 2024 - Present
  • Lead an embedded team of 6 applied researchers, serving as the primary tech lead from Applied Science driving cross-functional collaboration with product, engineering, and design teams
  • Develop new AI-powered experiences within GitHub Copilot, leveraging cutting-edge LLMs to deliver new features for millions of developers
  • Design agentic systems to accomplish complex software engineering tasks, such as reviewing pull requests (see Copilot Code Review)
  • Fine-tune LLMs using LoRA to improve code generation quality
  • Mentor junior team members through pairing and sharing knowledge of ML best practices
  • Conduct A/B tests to quantify model improvements, delivering statistically significant performance gains in production
  • Act as a key member of the hiring team for applied researchers, including designing technical interview material and assessing prospective team members
Azra AI | Lead Data Scientist | October 2019 - March 2024
  • Led research on efficient architectures for long-context NLP, implementing and adapting emerging methods (including MEGA architecture contribution to Hugging Face Transformers) for clinical NLP
  • Designed pretraining and domain-adaptation pipelines, producing language models optimized for deployment on resource-constrained hardware
  • Built and deployed clinical NLP models (NER, classification) identifying high-risk findings for 200k+ patients annually, enabling early detection and faster treatment
  • Drove technical strategy for ML product development, evaluating cutting-edge research and rapidly prototyping novel architectures for production deployment
  • Collaborated with clinical stakeholders to ensure model accuracy and reliability met the high standards required for cancer detection workflows
  • Note: Azra AI spun out as a standalone company in January 2022, previously part of Digital Reasoning
Asurion | Data Scientist | March 2018 - October 2019
  • Applied methods from NLP, statistical inference, and operations research to solve a diverse set of business problems
  • Communicated findings to stakeholders ranging from highly technical to C-level
GEICO | Product Modeling Analyst III | June 2016 - March 2018
  • Built predictive models for customer behavior under various pricing scenarios
  • Conducted robust statistical analyses in support of pricing department

Education

Georgia Institute of Technology | M.S. Analytics | Completed December 2020

University of Tennessee | B.S. Business Analytics | Completed May 2016

Open Source + Publications

Author of Applied Causal Inference, released August 2023

Implemented MEGA into Hugging Face’s transformers library

Using Machine Learning to Accelerate Identification of Pancreatic Incidentalomas | Journal of Oncology Navigation and Survivorship, 2023 | Paper link

Quantifying Explainability in NLP and Analyzing Algorithms for Performance-Explainability Tradeoff | Interpretable Healthcare in Machine Learning (IMLH) at ICML 2021 | Paper link

PsychBERT: A Mental Health Language Model for Social Media Mental Health Behavioral Analysis | IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2021 | Paper link + link to pretrained model

Non-Author Textbook Contributions

Code contributions for Transformers for Machine Learning: A Deep Dive (Kamath, Graham & Emara; Chapman & Hall, 2022); Machine Translation - Transformer vs. LSTM

Code contributions for Explainable Artificial Intelligence: An Introduction to Interpretable Machine Learning (Kamath & Liu; Springer 2021); Demo of Explainable Models