The Paper Feed
A feed of Bayesian network related papers, articles, books and research that we happen across and find of interest
A Bayesian network based learning system for modelling faults in large-scale manufacturing
Manufacturing companies can benefit from the early prediction and detection of failures to improve their product yield and reduce system faults through advanced data analytics. Whilst an abundance of data on their processing systems exist, they face difficulties in using it to gain insights to improve their systems. Bayesian networks (BNs) are considered here for diagnosing and predicting faults in a large manufacturing dataset from Bosch. Whilst BN structure learning has been performed traditionally on smaller sized data, this work demonstrates the ability to learn an appropriate BN structure for a large dataset with little information on the variables, for the first time. This paper also demonstrates a new framework for creating an appropriate probabilistic model for the Bosch dataset through the selection of statistically important variables on the response; this is then used to create a BN network which can be used to answer probabilistic queries and classify products based on changes in the sensor values in the production process.
Partial Least Squares Discriminant Analysis and Bayesian Networks for Metabolomic Prediction of Childhood Asthma
To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (n = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (p for difference < 0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.
Risk Assessment of Underground Subway Stations to Fire Disasters Using Bayesian Network
Subway station fires often have serious consequences because of the high density of people and limited number of exits in a relatively enclosed space. In this study, a comprehensive model based on Bayesian network (BN) and the Delphi method is established for the rapid and dynamic assessment of the fire evolution process, and consequences, in underground subway stations. Based on the case studies of typical subway station fire accidents, 28 BN nodes are proposed to represent the evolution process of subway station fires, from causes to consequences. Based on expert knowledge and consistency processing by the Delphi method, the conditional probabilities of child BN nodes are determined. The BN model can quantitatively evaluate the factors influencing fire causes, fire proof/intervention measures, and fire consequences. The results show that the framework, combined with Bayesian network and the Delphi method, is a reliable tool for dynamic assessment of subway station fires. This study could offer insights to a more realistic analysis for emergency decision-making on fire disaster reduction, since the proposed approach could take into account the conditional dependency in the fire propagation process and incorporate fire proof/intervention measures, which is helpful for resilience and sustainability promotion of underground facilities.
Modeling interrelationships between health behaviors in overweight breast cancer survivors: Applying Bayesian networks
Obesity and its impact on health is a multifaceted phenomenon encompassing many factors, including demographics, environment, lifestyle, and psychosocial functioning. A systems science approach, investigating these many influences, is needed to capture the complexity and multidimensionality of obesity prevention to improve health. Leveraging baseline data from a unique clinical cohort comprising 333 postmenopausal overweight or obese breast cancer survivors participating in a weight-loss trial, we applied Bayesian networks, a machine learning approach, to infer interrelationships between lifestyle factors (e.g., sleep, physical activity), body mass index (BMI), and health outcomes (biomarkers and self-reported quality of life metrics). We used bootstrap resampling to assess network stability and accuracy, and Bayesian information criteria (BIC) to compare networks. Our results identified important behavioral subnetworks. BMI was the primary pathway linking behavioral factors to glucose regulation and inflammatory markers; the BMI-biomarker link was reproduced in 100% of resampled networks. Sleep quality was a hub impacting mental quality of life and physical health with > 95% resampling reproducibility. Omission of the BMI or sleep links significantly degraded the fit of the networks. Our findings suggest potential mechanistic pathways and useful intervention targets for future trials. Using our models, we can make quantitative predictions about health impacts that would result from targeted, weight loss and/or sleep improvement interventions. Importantly, this work highlights the utility of Bayesian networks in health behaviors research.
Impact of drivers of change, including climatic factors, on the occurrence of chemical food safety hazards in fruits and vegetables: a Bayesian Network approach
The presence and development of many food safety risks are driven by factors within and outside the food supply chain, such as climate, economy and human behaviour. The interactions between these factors and the supply chain are complex and a system or holistic approach is needed to reveal cause-effect relationships and to be able to perform effective mitigation actions to minimise food safety risks. In this study, we demonstrate the potential of the Bayesian Network (BN) approach to identify and quantify the strength of relationships and interactions between the presence of food safety hazards as reported in Rapid Alert System for Food and Feed (RASFF) for fruits and vegetables on one hand, and climatic factors, economic and agronomic data on the other. To this end, all food safety notifications in RASFF (i.e. 3,781 notifications) on fruits and vegetables originating from India, Turkey and the Netherlands were collected for the period 2005-2015. In addition, climatic factors (e.g. temperature, precipitation), agricultural factors (e.g. pesticide use, fertilizer use) and economic factors (e.g. price, production volumes) were collected for the countries of origin of the product concurrent with the period of food safety notification in RASFF. A BN was constructed with 80% of the collected data using a machine-learning algorithm and optimised for each specific hazard category. The performance of the developed BN was determined in terms of accuracy of prediction of the hazard category in the evaluation set comprising 20% of the total data. The accuracy was high (95%) and the following factors contributed most: product category, notifying country, yearly production, number of notification, maximal residue level (MRL) ratio, country of origin, and the annual agricultural budget of a country. The assessment of the impact of interactions within the BN showed a significant interaction between the presence and level of a hazard as reported in RASFF and several drivers of change but at present, no definite conclusions can be drawn regarding the climatic factors and food safety hazards.
Modelling Electronic Trust Using Bayesian Networks
This paper discusses importance of trust in the context of digital economy. Even though electronic commerce continues to grow worldwide due to many of its advantages, it has not been fully adopted yet. The reason for some barriers in adopting e-commerce lies in potential customers who still perceive online setting as quite risky. Customers who have concerns related to sellers’ IT infrastructure resilience, and secured and safe personal data, will hardly ever engage in e-transactions. The nature of trust is very subjective, complex and multi-faceted. Trust issues are not present only between buyers and sellers, but also between suppliers and sellers, trust in recommendations and references on certain products, etc. In this paper authors propose modelling trust using Bayesian networks and provide an illustrative example which is typical in online transactions.
Probabilistic Age Classification with Bayesian Networks
In the past few decades, the rise of criminal, civil and asylum cases involving young people lacking valid identification documents has generated an increase in the demand of age estimation. The chronological age or the probability that an individual is older or younger than a given age threshold are generally estimated by means of some statistical methods based on observations performed on specific physical attributes. Among these statistical methods, those developed in the Bayesian framework allow the user to provide coherent and transparent assignments which fulfill forensic and medico-legal purposes. The application of the Bayesian approach is facilitated by using probabilistic graphical tools, such as Bayesian networks. The aim of this work is to test the performances of the Bayesian network for age estimation recently presented in scientific literature in classifying individuals as older or younger than 18 years of age. For these exploratory analyses, a sample related to the ossification status of the medial clavicular epiphysis available in scientific literature was used. Results obtained in the classification are extremely promising: in the criminal context, the Bayesian network achieved, on the average, a rate of correct classifications of approximatively 97%, whilst in the civil context, the rate is, on the average, close to the 88%. These results encourage the continuation of the development and the testing of the method in order to support its practical application in casework.
Reducing COPD Readmissions: A Causal Bayesian Network Model - IEEE Journals
This paper introduces a causal Bayesian network model to study readmissions reduction for chronic obstructive pulmonary disease (COPD) patients. The model employs a Bayesian network learning method and adopts domain knowledge. Using this model, we analyze the impacts of critical variables on a patient's readmission risk by manipulation of such variables. Through this analysis, effective intervention options to reduce readmission can be identified, which can provide a quantitative tool for designing personalized interventions to reduce COPD readmissions.