PH4S019 - Computational and Data-Driven Techniques in the Pharmaceutical Industry 23 Feb 2025 - 31 Aug 2027 | Version 2

Associated Module Information

Module Code: PH4S019
Module Title: Computational and Data-Driven Techniques in the Pharmaceutical Industry
Faculty: Faculty of Computing, Engineering and Science
Faculty Group: Applied Sciences
Faculty Sub Group: Chemistry and Pharmaceutical Science
Module Leader: Samuel Jobbins
Module Team: Suzanna Kean
First Intended Intake: SEP 2024 Final Year of Intake: 2024
Date Closed:
Credit Value: 20 Credit Level: 7
Language: English
Percentage of Module Taught in Welsh: 0
Equivalent Module:
HECOS codes: 100754 - databases 100992 - machine learning 101029 - computational mathematics
HECOS Code Weighting: 34 33 33

Document Version Information

Version 2
Valid From 23 Feb 2025
Valid To 31 Aug 2027

Module Aims

• To develop an awareness of the roles that advanced computing and applied mathematics play in the chemical and pharmaceutical industries;
• To understand core techniques from the fields of data science and machine learning, with an emphasis on those which are applicable to experimental design and analysis in the pharmaceutical sciences;
• To have an awareness of future trends in the field, such as the widespread use of artificial intelligence in drug discovery, molecular modelling and protein structure prediction;
• To provide the practical skills required to confidently interact with such systems.

Content Summary

1) Computer Aided Drug Design (Computational Chemistry)
Following a revision of atomic / molecular orbitals, this topic will:
Introduce the idea of modelling / simulating molecular systems on computers
Introduce different simulation techniques used in computational chemistry (forcefields / semi-empirical descriptions, leading up to density functional theory and QM methods)
Introduce computational chemistry concepts useful in computer aided drug design (including but not necessarily limited to molecular docking, molecular dynamics, structure prediction, screening, enhanced sampling methods etc.)

2) Chemometrics
Following a revision of the underlying mathematical principles this topic will focus on data science and machine learning techniques applicable for use in experimental design and analysis in the pharmaceutical sciences, including but not limited to:
• Data Preprocessing (feature scaling, data transformations, feature encoding etc, with specific emphasis on techniques relevant to the chemical and pharmaceutical industries e.g. Savitzky-Golay filtering)
• Supervised Learning – regression, classification
• Unsupervised Learning – clustering, dimensionality reduction
• Model validation and tuning
• Introduction to Deep Learning – ANNs, CNNs, RNNs, autoencoders (brief, but important for wider context, especially in terms of advanced applications)

3) Artificial Intelligence Applications
This final topic will look at the cutting edge applications of artificial intelligence in the pharmaceutical industry – such topics are expected to include: AI in drug discovery, AI in protein structure prediction (e.g. AlphaFold), AI in molecular simulation, AI in spectra analysis / prediction.

Learning and Teaching Methods

Activity Type Hours
Lecture 20
Practical classes and workshops 10
Independent Study 80
Directed Study 80
Problem / challenge based learning 10
Total Hours Selected 200

Learning Outcomes

# Learning Outcome
LO1 Demonstrate a knowledge of industry-relevant computational and mathematical tools and their applications.
LO2 To demonstrate an ability to select and apply appropriate computational and mathematical tools to solve specific problems/address specific challenges in the workplace.

Module Requisites

N/A

Assessment Criteria

Assessment Category Assessment Type Description Duration Word Count Weight (%) Best of? Pass Mark
Asynchronous Assessment Practical Written Work 2 Completion of chemometrics task, utilising data science and ML tools 0 3000 60 No 40
Asynchronous Assessment Practical Written Work 1 Completion of tasks on WebMO and associated write-up 0 2000 40 No 40

Assessment Matrix

Assessment Type Learning Outcomes
LO1 LO2
Practical Written Work 2
Practical Written Work 1

Reading List

Artificial intelligence in drug discovery and development.
Paul D, Sanap G, Shenoy S, Kalyane D, Kalia K, Tekade RK. Drug Discov Today. 2021 Jan;26(1):80-93. doi: 10.1016/j.drudis.2020.10.010. Epub 2020 Oct 21. PMID: 33099022; PMCID: PMC7577280.

Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems
John A. Keith, Valentin Vassilev-Galindo, Bingqing Cheng, Stefan Chmiela, Michael Gastegger, Klaus-Robert Müller, and Alexandre Tkatchenko
Chemical Reviews 2021 121 (16), 9816-9872
DOI: 10.1021/acs.chemrev.1c00107

Machine Learning in Chemistry: The Impact of Artificial Intelligence
Hugh M Cartwright (Ed)
RSC Publishing, 2020 https://doi.org/10.1039/9781839160233

Highly accurate protein structure prediction with AlphaFold
Jumper, J., Evans, R., Pritzel, A. et al. . Nature 596, 583–589 (2021). https://doi.org/10.1038/s41586-021-03819-2