{"id":9576,"date":"2023-01-03T13:54:55","date_gmt":"2023-01-03T12:54:55","guid":{"rendered":"https:\/\/www.iese.fraunhofer.de\/blog\/?p=9576"},"modified":"2024-02-26T10:56:55","modified_gmt":"2024-02-26T09:56:55","slug":"causal-inference","status":"publish","type":"post","link":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/","title":{"rendered":"Causal inference: An introduction on how to separate causal effects from spurious correlations in data"},"content":{"rendered":"<p class=\"lead\">What is causal inference in statistics data science? While &#8222;correlation does not imply causation&#8220;, it is possible to identify causal effects even in data that does not come from randomized controlled trials. Our AI expert, Dr. Julien Siebert, just published a paper on the <a href=\"https:\/\/arxiv.org\/abs\/2211.11482\">applications of statistical causal inference in software engineering<\/a>. In this Fraunhofer IESE blog post, he gives an introduction to the topic of causal inference, its benefits, its limitations, and other pointers to deepen our readers&#8216; knowledge of this topic.<\/p>\n<p>In general, the best way to measure the causal effect of an action (&#8222;a treatment&#8220;) on a system of interest (&#8222;the outcome&#8220;) is to perform a randomized controlled trial.<\/p>\n<p>However, in many cases, performing such a controlled experiment is not possible (for practical or ethical reasons). In such cases, analysts are then left with so-called &#8222;observational data&#8220;; i.e., data that has been gathered in an uncontrolled setting.<\/p>\n<p>The problem with such data is that the observed effect might be due to different factors: some truly causal ones, and some others due to unrelated correlation (a.k.a. spurious correlation).<\/p>\n<figure id=\"attachment_9636\" aria-describedby=\"caption-attachment-9636\" style=\"width: 698px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-9636 size-large\" src=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-698x275.jpeg\" alt=\"An example of spurious correlation\" width=\"698\" height=\"275\" srcset=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-698x275.jpeg 698w, https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-400x158.jpeg 400w, https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-768x303.jpeg 768w, https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-1536x606.jpeg 1536w, https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-2048x807.jpeg 2048w, https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-1138x450.jpeg 1138w, https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/chart1-1320x520.jpeg 1320w\" sizes=\"auto, (max-width: 698px) 100vw, 698px\" \/><figcaption id=\"caption-attachment-9636\" class=\"wp-caption-text\">An example of (easily detectable) spurious correlation. Source <a href=\"https:\/\/www.tylervigen.com\/spurious-correlations\">https:\/\/www.tylervigen.com\/spurious-correlations<\/a><\/figcaption><\/figure>\n<p>Finding methods for separating the wheat (causal effects) from the chaff (spurious correlations), that is, identifying causal effects from the nets of spurious correlations, has been the focus of researchers in the field of causal inference such as Judea Pearl and colleagues (Pearl &amp; Mackenzie 2018; Pearl, Glymour &amp; Jewell 2016).<\/p>\n<p>In this blog post, I will present a short introduction to the topic of causal inference.<\/p>\n<h2>Example: Simpson&#8217;s paradox<\/h2>\n<p>One important thing to realize is that, contrary to widespread belief in data science, data cannot speak for itself. Or at least, in some cases known as statistical paradoxes, data can be interpreted in completely opposite directions, especially if the underlying assumptions about how the data was generated are not explicit.<\/p>\n<p>The typical example, often used in many textbooks on causal inference, is Simpson&#8217;s paradox (Pearl 2013), illustrated in the figure below.<\/p>\n<figure id=\"attachment_9677\" aria-describedby=\"caption-attachment-9677\" style=\"width: 700px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/upload.wikimedia.org\/wikipedia\/commons\/f\/fb\/Simpsons_paradox_-_animation.gif\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-9677 size-full\" src=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/Simpsons_paradox_-_animation.gif\" alt=\"Causal inference: A nice illustration of Simpson Paradox from Wikipedia\" width=\"700\" height=\"500\" \/><\/a><figcaption id=\"caption-attachment-9677\" class=\"wp-caption-text\">An illustration of Simpson&#8217;s paradox (source <a href=\"https:\/\/en.wikipedia.org\/wiki\/Simpson%27s_paradox#\/media\/File:Simpsons_paradox_-_animation.gif\">Wikipedia<\/a>). Analyzing the whole population (black) or the different groups (colors) separately leads to opposite interpretations. The whole population shows a negative effect of X on Y, whereas for each group, the effect of X on Y is positive.<\/figcaption><\/figure>\n<h2>Modeling causal assumptions<\/h2>\n<p>In order to solve the paradox, it is necessary to make some assumptions explicit. In our example, if gender influences treatment uptake (e.g., if men are more likely to take the drug) and if gender also influences recovery (e.g., if women have lower blood pressure), then gender is a so-called confounder. In this case, it is necessary to analyze the data separately for the different genders.<\/p>\n<p>Modeling these causal hypotheses is the first step in causal inference. Typically, these assumptions take the form of a causal graph, where nodes represent variables, links represent potential direct causal effects, and no link represents the strong hypothesis that there is no direct effect between the variables.<\/p>\n<figure id=\"attachment_9661\" aria-describedby=\"caption-attachment-9661\" style=\"width: 339px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-9661 size-full\" src=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/small-graph-confounder.png\" alt=\"A small causal graph\" width=\"339\" height=\"210\" \/><figcaption id=\"caption-attachment-9661\" class=\"wp-caption-text\">An example of a causal graph made up of three nodes: gender, treatment (drug), outcome (healthy). The gender node (red) has an influence on both the treatment and the outcome. It is a confounder. Graph and colors from <a href=\"http:\/\/www.dagitty.net\/\">DAGitty<\/a>.<\/figcaption><\/figure>\n<p>One of the good things about having such a representation (apart from making causal hypotheses explicit) is that it is technically possible to try to falsify it by assessing which variables have to be independent in the data.<\/p>\n<figure id=\"attachment_9674\" aria-describedby=\"caption-attachment-9674\" style=\"width: 376px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-9674 size-full\" src=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/graph-conditional-independences-1.png\" alt=\"a causal graph illustrating conditional independence\" width=\"376\" height=\"351\" \/><figcaption id=\"caption-attachment-9674\" class=\"wp-caption-text\">In this graph, blood pressure and gender are statistically independent given a specific treatment. This can be directly measured in the data in order to check whether the model is valid. Graph and colors from <a href=\"http:\/\/www.dagitty.net\/\">DAGitty<\/a>. Note that the model also implies a second conditional independence: Treatment and outcome are statistically independent given a specific pair of blood pressure and gender.<\/figcaption><\/figure>\n<div><\/div>\n<h2>\u00a0Identifying whether causal effect can be computed from data<\/h2>\n<p>The graphical structure helps to disentangle correlations from causality. Through the application of a causal identification method called the do-calculus (Pearl, 2012), it is possible to decide whether we can measure causal effect from observational data and how the data needs to be analyzed. Going back to Simpson&#8217;s paradox, the application of the do-calculus can tell us whether we should analyze the data separately for each gender or whether we should analyze it for the whole population. Technically, the do-calculus involves the concepts of d-separation as well as backdoor and frontdoor criteria, which are beyond the scope of this blog post (the interested reader can jump to the pointers section below), but luckily for us, the do-calculus has been implemented in libraries such as <a href=\"https:\/\/py-why.github.io\/dowhy\/\">DoWhy<\/a> or <a href=\"http:\/\/www.dagitty.net\/\">DAGitty<\/a>, and identification can be easily automated.<\/p>\n<figure id=\"attachment_9661\" aria-describedby=\"caption-attachment-9661\" style=\"width: 339px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-9661 size-full\" src=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/small-graph-confounder.png\" alt=\"A small causal graph\" width=\"339\" height=\"210\" \/><figcaption id=\"caption-attachment-9661\" class=\"wp-caption-text\">In this graph, in order to measure the causal effect of the treatment on the outcome, it is necessary to &#8222;adjust for&#8220; gender, i.e., to analyze the data separately for each gender. The <a href=\"http:\/\/www.dagitty.net\/\">DAGitty<\/a> library directly provides a list of variables that need to be &#8222;adjusted for&#8220; and colors the spurious correlation paths in red.<\/figcaption><\/figure>\n<h2>Estimating causal effects from data<\/h2>\n<p>Once a causal effect has been identified, the next step is to estimate that effect using the available data. This is where Machine Learning comes into play. Note that we are not talking about classical Machine Learning algorithms, but about algorithms dedicated to the estimation of causal effects, such as TARNet, T-learner, or X-learner (the adventurous reader can refer to chapter 7 of Brady Neal&#8217;s lecture <a href=\"https:\/\/www.bradyneal.com\/causal-inference-course\">I<span dir=\"ltr\" role=\"presentation\">ntroduction to Causal Inference <\/span><span class=\"\" dir=\"ltr\" role=\"presentation\">from a Machine Learning Perspective<\/span><\/a>). Again, several implementations are available: Besides DoWhy, which was already mentioned above, <a href=\"https:\/\/causalml.readthedocs.io\/en\/latest\/index.html\">CausalML<\/a> is also worth a look.<\/p>\n<h2>Refutation: challenging the effect found<\/h2>\n<p>Now that a causal effect has been both identified and estimated, how can we trust our results? In science, ultimately, we can never prove that a causal effect is true. We can only try to falsify it. Here, it is interesting to take a look at the <a href=\"https:\/\/py-why.github.io\/dowhy\/\">DoWhy<\/a> library and the proposed refutation <a href=\"https:\/\/github.com\/py-why\/dowhy#iv-refute-the-obtained-estimate\">methods<\/a>. Here are some methods for illustration:<\/p>\n<ul>\n<li><em><strong>Add Random Common Cause<\/strong>: Does the estimation method change its estimate after we add an independent random variable as a common cause to the dataset? (Hint: It should not.)<\/em><\/li>\n<li><em><strong>Placebo Treatment<\/strong>: What happens to the estimated causal effect when we replace the true treatment variable with an independent random variable? (Hint: The effect should go to zero.)<\/em><\/li>\n<li><em><strong>Dummy Outcome<\/strong>: What happens to the estimated causal effect when we replace the true outcome variable with an independent random variable? (Hint: The effect should go to zero.)<\/em><\/li>\n<\/ul>\n<h2>Conclusion<\/h2>\n<p>Doing randomized control trials is the gold standard, but when these are not possible, the whole field of causal inference helps in separating causal effects from spurious correlations.<\/p>\n<div class=\"info-box\">\n<p><strong>Interested in this topic?<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p>Feel free to <a href=\"https:\/\/arxiv.org\/abs\/2211.11482\">read the paper<\/a> and\/or to contact our expert Dr. Julien Siebert <a href=\"mailto: Julien.Siebert@iese.fraunhofer.de\">via Mail<\/a> or <a href=\"https:\/\/www.linkedin.com\/in\/dr-siebert-julien\/\">LinkedIn<\/a>.<\/p>\n<\/div>\n<h2>Pointers:<\/h2>\n<p>Here are some pointers for the interested reader.<\/p>\n<h4>Introduction to the topic:<\/h4>\n<ul>\n<li>Was, wie, warum? &#8211; Einf\u00fchrungskurs Kausale Inferenz (German, accessible): <a href=\"https:\/\/ki-campus.org\/courses\/wwweki\">https:\/\/ki-campus.org\/courses\/wwweki<\/a><\/li>\n<li>Introduction to Causal Inference from a machine learning perspective (English, more technical): <a href=\"https:\/\/www.bradyneal.com\/causal-inference-course\">https:\/\/www.bradyneal.com\/causal-inference-course<\/a><\/li>\n<\/ul>\n<h4>Books to read:<\/h4>\n<ul>\n<li>The Book of Why: The New Science of Cause and Effect. Judea Pearl and Dana Mackenzie. 2018. Penguin (UK). <a href=\"http:\/\/bayes.cs.ucla.edu\/WHY\/\">http:\/\/bayes.cs.ucla.edu\/WHY\/<\/a>. ISBN 9780141982410 (English, accessible).<\/li>\n<li>The Effect: An Introduction to Research Design and Causality. Nick Huntington-Klein. 2022. CRC Press. <a href=\"https:\/\/theeffectbook.net\/\">https:\/\/theeffectbook.net\/<\/a>. ISBN 1032125780 (English, accessible).<\/li>\n<li>Causal inference in statistics, A Primer. Judea Pearl, Madelyn Glymour, and Nicholas P. Jewell. 2016. Wiley. <a href=\"http:\/\/bayes.cs.ucla.edu\/PRIMER\/\">http:\/\/bayes.cs.ucla.edu\/PRIMER\/<\/a>. ISBN: 1119186846 (English, more technical).<\/li>\n<\/ul>\n<h4>Tools:<\/h4>\n<ul>\n<li>DAGitty | Draw and analyze causal diagrams: <a href=\"http:\/\/www.dagitty.net\/\">http:\/\/www.dagitty.net\/<\/a><\/li>\n<li>DoWhy | An end-to-end library for causal inference: <a href=\"https:\/\/py-why.github.io\/dowhy\/\">https:\/\/py-why.github.io\/dowhy\/<\/a><\/li>\n<\/ul>\n<h2>References:<\/h2>\n<p>Judea Pearl. 2012. <span dir=\"ltr\" role=\"presentation\">The<\/span> <span dir=\"ltr\" role=\"presentation\">Do<\/span><span dir=\"ltr\" role=\"presentation\">-Calculus Revisited. <\/span><span dir=\"ltr\" role=\"presentation\">Keynote Lecture, August 17, 2012. <\/span><span dir=\"ltr\" role=\"presentation\">UAI-2012 Conference, Catalina, CA. <span class=\"markedContent\">TECHNICAL REPORT R-402. <a href=\"https:\/\/ftp.cs.ucla.edu\/pub\/stat_ser\/r402.pdf\">https:\/\/ftp.cs.ucla.edu\/pub\/stat_ser\/r402.pdf<\/a><br \/>\n<\/span><\/span>Judea Pearl. 2013. Understanding Simpson\u2019s Paradox. TECHNICAL REPORT R-414. <a href=\"https:\/\/ftp.cs.ucla.edu\/pub\/stat_ser\/r414.pdf\">https:\/\/ftp.cs.ucla.edu\/pub\/stat_ser\/r414.pdf<\/a><br \/>\nJudea Pearl and Dana Mackenzie. 2018. The Book of Why. The New Science of Cause and Effect. Penguin (UK). ISBN: 9780141982410<br \/>\nJudea Pearl, Madelyn Glymour, and Nicholas P. Jewell. 2016. Causal Inference in Statistics. A Primer. Wiley. ISBN: 1119186846<\/p>\n","protected":false},"excerpt":{"rendered":"<p>What is causal inference in statistics data science? While &#8222;correlation does not imply causation&#8220;, it is possible to identify causal effects even in data that does not come from randomized controlled trials. Our AI expert, Dr. Julien Siebert, just published a paper on the applications of statistical causal inference in software engineering. In this Fraunhofer [&hellip;]<\/p>\n","protected":false},"author":66,"featured_media":9629,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"featured_image_credits_title":"\u00a9iStock.com | Greyfebruary","featured_image_credits_url":"","featured_image_credits_position":"bottom-right","featured_image_credits_shadow":"-1px -1px 0 rgba(0,0,0,0.25),0 -1px 0 rgba(0,0,0,0.25),1px -1px 0 rgba(0,0,0,0.25),1px 0 0 rgba(0,0,0,0.25),1px 1px 0 rgba(0,0,0,0.25),0 1px 0 rgba(0,0,0,0.25),-1px 1px 0 rgba(0,0,0,0.25),-1px 0 0 rgba(0,0,0,0.25)","_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[177],"tags":[104,170,415,198,296],"coauthors":[214],"class_list":["post-9576","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-kuenstliche-intelligenz","tag-data-analytics","tag-data-science","tag-dependable-ai-verlaessliche-ki","tag-english","tag-machine-learning"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v28.1 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Causal inference: An introduction on how to separate causal effects from spurious correlations in data - Blog des Fraunhofer IESE<\/title>\n<meta name=\"description\" content=\"In this blog post, we give an introduction on causal inference methods for separating causal effects from spurious correlations in data.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Causal inference: An introduction on how to separate causal effects from spurious correlations in data - Blog des Fraunhofer IESE\" \/>\n<meta property=\"og:description\" content=\"In this blog post, we give an introduction on causal inference methods for separating causal effects from spurious correlations in data.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/\" \/>\n<meta property=\"og:site_name\" content=\"Fraunhofer IESE\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/FraunhoferIESE\/\" \/>\n<meta property=\"article:published_time\" content=\"2023-01-03T12:54:55+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-02-26T09:56:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"800\" \/>\n\t<meta property=\"og:image:height\" content=\"600\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Dr. Julien Siebert\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@FraunhoferIESE\" \/>\n<meta name=\"twitter:site\" content=\"@FraunhoferIESE\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dr. Julien Siebert\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"7\u00a0Minuten\" \/>\n\t<meta name=\"twitter:label3\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data3\" content=\"Dr. Julien Siebert\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/\"},\"author\":{\"name\":\"Dr. Julien Siebert\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#\\\/schema\\\/person\\\/96d3d2ce62a13a8ee470ea948601ff28\"},\"headline\":\"Causal inference: An introduction on how to separate causal effects from spurious correlations in data\",\"datePublished\":\"2023-01-03T12:54:55+00:00\",\"dateModified\":\"2024-02-26T09:56:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/\"},\"wordCount\":1411,\"publisher\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/09\\\/iStock-483804887.jpg\",\"keywords\":[\"Data Analytics\",\"Data Science\",\"Dependable AI \\\/ Verl\u00e4ssliche KI\",\"English\",\"Machine Learning\"],\"articleSection\":[\"K\u00fcnstliche Intelligenz\"],\"inLanguage\":\"de\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/\",\"name\":\"Causal inference: An introduction on how to separate causal effects from spurious correlations in data - Blog des Fraunhofer IESE\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/09\\\/iStock-483804887.jpg\",\"datePublished\":\"2023-01-03T12:54:55+00:00\",\"dateModified\":\"2024-02-26T09:56:55+00:00\",\"description\":\"In this blog post, we give an introduction on causal inference methods for separating causal effects from spurious correlations in data.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/09\\\/iStock-483804887.jpg\",\"contentUrl\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2022\\\/09\\\/iStock-483804887.jpg\",\"width\":800,\"height\":600,\"caption\":\"3D illustration.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/causal-inference\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Startseite\",\"item\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Causal inference: An introduction on how to separate causal effects from spurious correlations in data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/\",\"name\":\"Fraunhofer IESE\",\"description\":\"Blog des Fraunhofer-Institut f\u00fcr Experimentelles Software Engineering\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"de\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#organization\",\"name\":\"Fraunhofer IESE\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/08\\\/fhg_iese_logo.png\",\"contentUrl\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2016\\\/08\\\/fhg_iese_logo.png\",\"width\":183,\"height\":50,\"caption\":\"Fraunhofer IESE\"},\"image\":{\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/FraunhoferIESE\\\/\",\"https:\\\/\\\/x.com\\\/FraunhoferIESE\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/fraunhoferiese\\\/\",\"https:\\\/\\\/www.youtube.com\\\/c\\\/FraunhoferIESE\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/#\\\/schema\\\/person\\\/96d3d2ce62a13a8ee470ea948601ff28\",\"name\":\"Dr. Julien Siebert\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/10\\\/siebert_julien_8513_blog-96x96.jpg299e210bdf5631c16552ac66c10e3e56\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/10\\\/siebert_julien_8513_blog-96x96.jpg\",\"contentUrl\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/wp-content\\\/uploads\\\/2020\\\/10\\\/siebert_julien_8513_blog-96x96.jpg\",\"caption\":\"Dr. Julien Siebert\"},\"description\":\"Julien Siebert is working as senior AI Expert in the Data Science department of Fraunhofer IESE. He studied Artificial Intelligence and Engineering Science and got his PhD in Computer Science. His professional interests include data science processes, artificial intelligence and complex systems.\",\"url\":\"https:\\\/\\\/www.iese.fraunhofer.de\\\/blog\\\/author\\\/julien-siebert\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Causal inference: An introduction on how to separate causal effects from spurious correlations in data - Blog des Fraunhofer IESE","description":"In this blog post, we give an introduction on causal inference methods for separating causal effects from spurious correlations in data.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/","og_locale":"de_DE","og_type":"article","og_title":"Causal inference: An introduction on how to separate causal effects from spurious correlations in data - Blog des Fraunhofer IESE","og_description":"In this blog post, we give an introduction on causal inference methods for separating causal effects from spurious correlations in data.","og_url":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/","og_site_name":"Fraunhofer IESE","article_publisher":"https:\/\/www.facebook.com\/FraunhoferIESE\/","article_published_time":"2023-01-03T12:54:55+00:00","article_modified_time":"2024-02-26T09:56:55+00:00","og_image":[{"width":800,"height":600,"url":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg","type":"image\/jpeg"}],"author":"Dr. Julien Siebert","twitter_card":"summary_large_image","twitter_creator":"@FraunhoferIESE","twitter_site":"@FraunhoferIESE","twitter_misc":{"Verfasst von":"Dr. Julien Siebert","Gesch\u00e4tzte Lesezeit":"7\u00a0Minuten","Written by":"Dr. Julien Siebert"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#article","isPartOf":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/"},"author":{"name":"Dr. Julien Siebert","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#\/schema\/person\/96d3d2ce62a13a8ee470ea948601ff28"},"headline":"Causal inference: An introduction on how to separate causal effects from spurious correlations in data","datePublished":"2023-01-03T12:54:55+00:00","dateModified":"2024-02-26T09:56:55+00:00","mainEntityOfPage":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/"},"wordCount":1411,"publisher":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#organization"},"image":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg","keywords":["Data Analytics","Data Science","Dependable AI \/ Verl\u00e4ssliche KI","English","Machine Learning"],"articleSection":["K\u00fcnstliche Intelligenz"],"inLanguage":"de"},{"@type":"WebPage","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/","url":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/","name":"Causal inference: An introduction on how to separate causal effects from spurious correlations in data - Blog des Fraunhofer IESE","isPartOf":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#primaryimage"},"image":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg","datePublished":"2023-01-03T12:54:55+00:00","dateModified":"2024-02-26T09:56:55+00:00","description":"In this blog post, we give an introduction on causal inference methods for separating causal effects from spurious correlations in data.","breadcrumb":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/"]}]},{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#primaryimage","url":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg","contentUrl":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg","width":800,"height":600,"caption":"3D illustration."},{"@type":"BreadcrumbList","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/causal-inference\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Startseite","item":"https:\/\/www.iese.fraunhofer.de\/blog\/"},{"@type":"ListItem","position":2,"name":"Causal inference: An introduction on how to separate causal effects from spurious correlations in data"}]},{"@type":"WebSite","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#website","url":"https:\/\/www.iese.fraunhofer.de\/blog\/","name":"Fraunhofer IESE","description":"Blog des Fraunhofer-Institut f\u00fcr Experimentelles Software Engineering","publisher":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.iese.fraunhofer.de\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"de"},{"@type":"Organization","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#organization","name":"Fraunhofer IESE","url":"https:\/\/www.iese.fraunhofer.de\/blog\/","logo":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2016\/08\/fhg_iese_logo.png","contentUrl":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2016\/08\/fhg_iese_logo.png","width":183,"height":50,"caption":"Fraunhofer IESE"},"image":{"@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/FraunhoferIESE\/","https:\/\/x.com\/FraunhoferIESE","https:\/\/www.linkedin.com\/company\/fraunhoferiese\/","https:\/\/www.youtube.com\/c\/FraunhoferIESE"]},{"@type":"Person","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/#\/schema\/person\/96d3d2ce62a13a8ee470ea948601ff28","name":"Dr. Julien Siebert","image":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2020\/10\/siebert_julien_8513_blog-96x96.jpg299e210bdf5631c16552ac66c10e3e56","url":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2020\/10\/siebert_julien_8513_blog-96x96.jpg","contentUrl":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2020\/10\/siebert_julien_8513_blog-96x96.jpg","caption":"Dr. Julien Siebert"},"description":"Julien Siebert is working as senior AI Expert in the Data Science department of Fraunhofer IESE. He studied Artificial Intelligence and Engineering Science and got his PhD in Computer Science. His professional interests include data science processes, artificial intelligence and complex systems.","url":"https:\/\/www.iese.fraunhofer.de\/blog\/author\/julien-siebert\/"}]}},"jetpack_featured_media_url":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-content\/uploads\/2022\/09\/iStock-483804887.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/posts\/9576","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/users\/66"}],"replies":[{"embeddable":true,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/comments?post=9576"}],"version-history":[{"count":45,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/posts\/9576\/revisions"}],"predecessor-version":[{"id":11811,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/posts\/9576\/revisions\/11811"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/media\/9629"}],"wp:attachment":[{"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/media?parent=9576"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/categories?post=9576"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/tags?post=9576"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.iese.fraunhofer.de\/blog\/wp-json\/wp\/v2\/coauthors?post=9576"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}