Design of Biopharmaceutical Formulations Accelerated by Machine Learning

In addition to activity, successful biological drugs must exhibit a series of suitable developability properties, which depend on both protein sequence and buffer composition. In the context of this high-dimensional optimization problem, advanced algorithms from the domain of machine learning are highly beneficial in complementing analytical screening and rational design. Here, we propose a Bayesian optimization algorithm to accelerate the design of biopharmaceutical formulations. We demonstrate the power of this approach by identifying the formulation that optimizes the thermal stability of three tandem single-chain Fv variants within 25 experiments, a number which is less than one-third of the experiments that would be required by a classical DoE method and several orders of magnitude smaller compared to detailed experimental analysis of full combinatorial space.

We further show the advantage of this method over conventional approaches to efficiently transfer historical information as prior knowledge for the development of new biologics or when new buffer agents are available. Moreover, we highlight the benefit of our technique in engineering multiple biophysical properties by simultaneously optimizing both thermal and interface stabilities. This optimization minimizes the amount of surfactant in the formulation, which is important to decrease the risks associated with corresponding degradation processes.

Overall, this method can provide high speed of converging to optimal conditions, the ability to transfer prior knowledge, and the identification of new nonlinear combinations of excipients. We envision that these features can lead to a considerable acceleration in formulation design and to parallelization of operations during drug development.

Download the full research paper as PDF here: Design of Biopharmaceutical Formulations Accelerated by Machine Learning

or read the article here


Frequency of excipient appearance in the antibody formulations marketed since 2015
Frequency of excipient appearance in the antibody formulations marketed since 2015

The figure above summarizes the excipients that are most commonly used in marketed antibody formulations based on the data available in the PharmaCircle database. In the last 5 years, histidine has been the most common buffering agent in the marketed antibody formulations followed by phosphates, sodium hydroxide, citrate (citric acid), and acetate (acetic acid). Polysorbates (Polysorbate 80 or 20) are the most common surfactants, with Polysorbate 80 appearing more often than Polysorbate 20 ). Among other excipients, sucrose and sodium chloride (NaCl) are the most predominant, followed by trehalose, mannitol, sorbitol, and amino acids (AAs).


Design of Biopharmaceutical Formulations Accelerated by Machine Learning
Harini Narayanan, Fabian Dingfelder, Itzel Condado Morales, Bhargav Patel, Kristine Enemærke Heding, Jais Rose Bjelke, Thomas Egebjerg, Alessandro Butté, Michael Sokolov, Nikolai Lorenzen, and Paolo Arosio

Molecular Pharmaceutics 2021 18 (10), 3843-3853

DOI: 10.1021/acs.molpharmaceut.1c00469

You might also like