Bias Propagation in Large Scale Machine Learning Pipelines in the Pharmaceutical Sector
Keywords:
Bias propagation, machine learning pipelines, pharmaceutical analytics, clinical decision support systems, artificial intelligence, data governance, explainable AIAbstract
Machine learning pipelines in the pharmaceutical sector increasingly influence discovery, clinical decision support, safety monitoring, and operational planning. While these systems promise efficiency and scale, they also introduce complex mechanisms through which bias is accumulated, amplified, and propagated across interconnected data and model layers. Unlike isolated model bias, pipeline level bias emerges from interactions between data acquisition, preprocessing, feature engineering, learning architectures, and deployment feedback loops. This work presents a systematic investigation of bias propagation in large scale pharmaceutical machine learning pipelines. We propose a formal pipeline bias decomposition framework, introduce quantitative propagation metrics, and demonstrate how bias evolves across discovery, development, and post market surveillance workflows. Experimental results highlight measurable distortions in risk prediction, patient stratification, and adverse event detection. The study emphasizes the need for architecture aware mitigation strategies that extend beyond single model interventions
Downloads
Published
Issue
Section
License
Copyright (c) 2022 The Artificial Intelligence Journal

This work is licensed under a Creative Commons Attribution 4.0 International License.