Machine Learning Scientist/Sr Scientist, Federated Benchmarking & Validation Engineering
Company: Eli Lilly and Company
Location: Indianapolis
Posted on: January 4, 2026
|
|
|
Job Description:
At Lilly, we unite caring with discovery to make life better for
people around the world. We are a global healthcare leader
headquartered in Indianapolis, Indiana. Our employees around the
world work to discover and bring life-changing medicines to those
who need them, improve the understanding and management of disease,
and give back to our communities through philanthropy and
volunteerism. We give our best effort to our work, and we put
people first. We’re looking for people who are determined to make
life better for people around the world. Purpose Lilly TuneLab is
an AI-powered drug discovery platform that provides biotech
companies with access to machine learning models trained on Lilly's
extensive proprietary pharmaceutical research data. Through
federated learning, the platform enables Lilly to build models on
broad, diverse datasets from across the biotech ecosystem while
preserving partner data privacy and competitive advantages. This
collaborative approach accelerates drug discovery by creating
continuously improving AI models that benefit both Lilly and our
biotech partners. The Machine Learning Scientist/Sr Scientist,
Federated Benchmarking & Validation Engineering plays an essential
role within the TuneLab platform, responsible for identifying,
assessing, and implementing cutting-edge algorithmic solutions that
leverage diverse datasets while ensuring data privacy and security
for our biotech partners. This position requires comprehensive
knowledge in small molecule drug development, ADME/Tox, antibody
engineering, and/or genetic medicine, combined with expertise in
data science and statistical analysis to develop sophisticated
models utilizing federated learning. This position will be
instrumental in advancing both Lilly's pipeline and our partners'
drug discovery efforts by designing critical algorithms and
workflows that expedite the creation of transformative therapies.
This role centers on constructing robust validation frameworks for
federated models, creating privacy-preserving test sets across
partner datasets, establishing standardized benchmarks against
public datasets, and ensuring model reproducibility and
generalization in diverse deployment scenarios. Key
Responsibilities Federated Test Set Design : Architect and
implement privacy-preserving protocols for constructing
representative test sets across distributed partner datasets,
ensuring statistical validity while maintaining data isolation.
Benchmark Suite Development : Create comprehensive benchmark suites
covering small molecules (ADMET, solubility, permeability),
antibodies (affinity, stability, immunogenicity), and RNA
therapeutics (stability, delivery, off-target effects).
Cross-Domain Validation : Develop validation strategies that assess
model generalization across different experimental protocols, cell
lines, species, and therapeutic indications while respecting
partner data boundaries. Public Dataset Integration :
Systematically benchmark federated models against public datasets
(ChEMBL, PubChem, PDB, Therapeutic Antibody Database) to establish
performance baselines and identify gaps. Validation Frameworks :
Implement time-split or proper scaffold-split validation protocols
that assess model performance on prospective data, simulating
real-world deployment scenarios and detecting concept drift.
Reproducibility Infrastructure : Build robust MLOps pipelines
ensuring complete reproducibility of federated experiments,
including versioning of data snapshots, model checkpoints, and
hyperparameter configurations. Statistical Rigor : Design
statistically powered validation studies accounting for multiple
testing, hierarchical data structures, and non-independent
observations common in drug discovery datasets. Performance
Profiling : Develop comprehensive performance profiling across
diverse molecular scaffolds, target classes, and property ranges,
identifying systematic biases and failure modes. Platform
Integration : Collaborate with engineering teams to integrate
validation frameworks with the TuneLab federated learning platform
built on NVIDIA FLARE, ensuring scalable and automated testing
across partner networks. Basic Qualifications PhD in Computational
Biology, Bioinformatics, Cheminformatics, Computer Science,
Statistics, or related field from an accredited college or
university Minimum of 2 years of experience in the
biopharmaceutical industry or related fields, with demonstrated
expertise in drug discovery and early development Strong foundation
in experimental design, statistical validation, and hypothesis
testing Experience with ML model validation, cross-validation
strategies, and performance metrics Proficiency in data
engineering, pipeline development, and automation Additional
Preferences Experience with federated learning platforms and
distributed computing Knowledge of regulatory requirements for
AI/ML in pharmaceutical development Expertise in ADMET assay
development and validation Understanding of antibody engineering
and characterization methods Familiarity with RNA therapeutic
design and delivery systems Experience with clinical biomarker
validation and translational research Proficiency in workflow
orchestration tools (Airflow, Kubeflow, Prefect) Strong knowledge
of containerization and cloud computing (Docker, Kubernetes)
Publications on model validation, benchmarking, or reproducibility
Experience with GxP compliance and quality management systems
Exceptional attention to detail and commitment to scientific rigor
Strong technical writing skills for regulatory documentation
Portfolio mindset balancing rigorous validation with rapid
deployment for partner value This role is based at a Lilly site in
Indianapolis, South San Francisco, or Boston with up to 10% travel
(attendance expected at key industry conferences). Relocation is
provided. Lilly is dedicated to helping individuals with
disabilities to actively engage in the workforce, ensuring equal
opportunities when vying for positions. If you require
accommodation to submit a resume for a position at Lilly, please
complete the accommodation request form (
https://careers.lilly.com/us/en/workplace-accommodation ) for
further assistance. Please note this is for individuals to request
an accommodation as part of the application process and any other
correspondence will not receive a response. Lilly is proud to be an
EEO Employer and does not discriminate on the basis of age, race,
color, religion, gender identity, sex, gender expression, sexual
orientation, genetic information, ancestry, national origin,
protected veteran status, disability, or any other legally
protected status. Our employee resource groups (ERGs) offer strong
support networks for their members and are open to all employees.
Our current groups include: Africa, Middle East, Central Asia
Network, Black Employees at Lilly, Chinese Culture Network,
Japanese International Leadership Network (JILN), Lilly India
Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ
Allies), Veterans Leadership Network (VLN), Women’s Initiative for
Leading at Lilly (WILL), enAble (for people with disabilities).
Learn more about all of our groups. Actual compensation will depend
on a candidate’s education, experience, skills, and geographic
location. The anticipated wage for this position is $151,500 -
$244,200 Full-time equivalent employees also will be eligible for a
company bonus (depending, in part, on company and individual
performance). In addition, Lilly offers a comprehensive benefit
program to eligible employees, including eligibility to participate
in a company-sponsored 401(k); pension; vacation benefits;
eligibility for medical, dental, vision and prescription drug
benefits; flexible benefits (e.g., healthcare and/or dependent day
care flexible spending accounts); life insurance and death
benefits; certain time off and leave of absence benefits; and
well-being benefits (e.g., employee assistance program, fitness
benefits, and employee clubs and activities).Lilly reserves the
right to amend, modify, or terminate its compensation and benefit
programs in its sole discretion and Lilly’s compensation practices
and guidelines will apply regarding the details of any promotion or
transfer of Lilly employees. WeAreLilly
Keywords: Eli Lilly and Company, Covington , Machine Learning Scientist/Sr Scientist, Federated Benchmarking & Validation Engineering, Science, Research & Development , Indianapolis, Kentucky