Causality on cross-sectional data: Stable specification search in constrained structural equation modeling

Rahmadi, R., Groot, P., Heins, M., Knoop, H., Heskes, T. and OPTIMISM Consortium, (2017) Causality on cross-sectional data: Stable specification search in constrained structural equation modeling. Applied Soft Computing, 52, pp. 687-698. (doi: 10.1016/j.asoc.2016.10.003)

Full text not currently available from Enlighten.

Abstract

Causal modeling has long been an attractive topic for many researchers and in recent decades there has seen a surge in theoretical development and discovery algorithms. Generally discovery algorithms can be divided into two approaches: constraint-based and score-based. The constraint-based approach is able to detect common causes of the observed variables but the use of independence tests makes it less reliable. The score-based approach produces a result that is easier to interpret as it also measures the reliability of the inferred causal relationships, but it is unable to detect common confounders of the observed variables. A drawback of both score-based and constrained-based approaches is the inherent instability in structure estimation. With finite samples small changes in the data can lead to completely different optimal structures. The present work introduces a new hypothesis-free score-based causal discovery algorithm, called stable specification search, that is robust for finite samples based on recent advances in stability selection using subsampling and selection algorithms. Structure search is performed over structural equation models. Our approach uses exploratory search but allows incorporation of prior background knowledge. We validated our approach on one simulated data set, which we compare to the known ground truth, and two real-world data sets for chronic fatigue syndrome and attention deficit hyperactivity disorder, which we compare to earlier medical studies. The results on the simulated data set show significant improvement over alternative approaches and the results on the real-word data sets show consistency with the hypothesis driven models constructed by medical experts.

Item Type:Articles
Additional Information:The research leading to these results has received funding from the DGHE of Indonesia and the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 305697.
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Monckton, Professor Darren
Authors: Rahmadi, R., Groot, P., Heins, M., Knoop, H., Heskes, T., and OPTIMISM Consortium,
College/School:College of Medical Veterinary and Life Sciences > School of Molecular Biosciences
Journal Name:Applied Soft Computing
Publisher:Elsevier
ISSN:1568-4946
ISSN (Online):1872-9681
Published Online:11 October 2016

University Staff: Request a correction | Enlighten Editors: Update this record