更改

添加12,225字节、 2021年5月25日 (二) 15:18

Moved page from wikipedia:en:Rubin causal model (history)

此词条暂由彩云小译翻译，翻译字数共430，未经人工整理和审校，带来阅读不便，请见谅。

{{short description|Method of statistical analysis}}

{{Format footnotes|date=February 2021}}

The '''Rubin causal model''' ('''RCM'''), also known as the '''Neyman–Rubin causal model''',<ref name="sekhon">{{cite book |last=Sekhon |first=Jasjeet |chapter=The Neyman–Rubin Model of Causal Inference and Estimation via Matching Methods |title=The Oxford Handbook of Political Methodology |year=2007 |chapter-url=http://sekhon.berkeley.edu/papers/SekhonOxfordHandbook.pdf }}</ref> is an approach to the [[statistical analysis]] of [[Causality|cause and effect]] based on the [[Conceptual framework|framework]] of [[counterfactual conditional|potential outcomes]], named after [[Donald Rubin]]. The name "Rubin causal model" was first coined by [[Paul W. Holland]].<ref name="holland:causal86">{{cite journal |last=Holland |first=Paul W. |title=Statistics and Causal Inference |journal=[[Journal of the American Statistical Association|J. Amer. Statist. Assoc.]] |volume=81 |issue=396 |year=1986 |pages=945–960 |jstor=2289064 |doi=10.1080/01621459.1986.10478354}}</ref> The potential outcomes framework was first proposed by [[Jerzy Neyman]] in his 1923 Master's thesis,<ref name="neyman:masters">Neyman, Jerzy. ''Sur les applications de la theorie des probabilites aux experiences agricoles: Essai des principes.'' Master's Thesis (1923). Excerpts reprinted in English, Statistical Science, Vol. 5, pp. 463–472. ([[Dorota Dabrowska|D. M. Dabrowska]], and T. P. Speed, Translators.)</ref> though he discussed it only in the context of completely randomized experiments.<ref name="Jasa1">{{cite journal |last=Rubin |first=Donald |year=2005 |title=Causal Inference Using Potential Outcomes |journal=[[Journal of the American Statistical Association|J. Amer. Statist. Assoc.]] |volume=100 |issue=469 |pages=322–331 |doi=10.1198/016214504000001880 }}</ref> Rubin extended it into a general framework for thinking about causation in both observational and experimental studies.<ref name="sekhon"/>

The Rubin causal model (RCM), also known as the Neyman–Rubin causal model, is an approach to the statistical analysis of cause and effect based on the framework of potential outcomes, named after Donald Rubin. The name "Rubin causal model" was first coined by Paul W. Holland. The potential outcomes framework was first proposed by Jerzy Neyman in his 1923 Master's thesis, though he discussed it only in the context of completely randomized experiments. Rubin extended it into a general framework for thinking about causation in both observational and experimental studies. A randomized experiment assigns people randomly to treatments: college or no college. Because of this random assignment, the groups are (on average) equivalent, and the difference in income at age 40 can be attributed to the college assignment since that was the only difference between the groups. An estimate of the average causal effect (also referred to as the average treatment effect) can then be obtained by computing the difference in means between the treated (college-attending) and control (not-college-attending) samples.

虚拟事实模型分析法，也称为 Neyman-虚拟事实模型分析法，是一种基于潜在结果框架的因果统计分析方法，以 Donald Rubin 的名字命名。虚拟事实模型的名字是由 Paul w. Holland 首创的。潜在结果框架最早是由 Jerzy Neyman 在他1923年的硕士论文中提出的，尽管他只是在完全随机化实验的背景下讨论它。鲁宾把它扩展到一个普遍的框架，用来思考观察和实验研究中的因果关系。一个随机实验随机分配人参加治疗: 上大学或不上大学。由于这种随机分配，这些群体(平均)是相等的，40岁时的收入差异可以归因于大学分配，因为这是这些群体之间唯一的差异。平均因果效应(也称为平均治疗效应)的估计可以通过计算治疗(就读大学)和对照(非就读大学)样本之间的平均值差异来获得。

==Introduction==

In many circumstances, however, randomized experiments are not possible due to ethical or practical concerns. In such scenarios there is a non-random assignment mechanism. This is the case for the example of college attendance: people are not randomly assigned to attend college. Rather, people may choose to attend college based on their financial situation, parents' education, and so on. Many statistical methods have been developed for causal inference, such as propensity score matching. These methods attempt to correct for the assignment mechanism by finding control units similar to treatment units.

然而，在许多情况下，由于伦理或实际的考虑，随机试验是不可能的。在这种情况下，有一个非随机分配机制。这就是大学出勤率的例子: 人们并不是随机分配到大学的。相反，人们可能会根据自己的经济状况、父母的教育程度等因素选择上大学。许多因果推断的统计方法已经被开发出来，比如倾向评分匹配。这些方法试图通过寻找类似于处理单元的控制单元来纠正分配机制。

The Rubin causal model is based on the idea of potential outcomes. For example, a person would have a particular income at age 40 if he had attended college, whereas he would have a different income at age 40 if he had not attended college. To measure the causal effect of going to college for this person, we need to compare the outcome for the same individual in both alternative futures. Since it is impossible to see both potential outcomes at once, one of the potential outcomes is always missing. This dilemma is the "fundamental problem of [[causal inference]]".

Because of the fundamental problem of causal inference, unit-level causal effects cannot be directly observed. However, randomized experiments allow for the estimation of population-level causal effects.<ref name=":01">{{cite journal |last=Rubin |first=Donald |title=Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies |journal=[[Journal of Educational Psychology|J. Educ. Psychol.]] |volume=66 |issue=5 |year=1974 |pages=688–701 [p. 689] |doi=10.1037/h0037350 }}</ref> A randomized experiment assigns people randomly to treatments: college or no college. Because of this random assignment, the groups are (on average) equivalent, and the difference in income at age 40 can be attributed to the college assignment since that was the only difference between the groups. An estimate of the '''average causal effect''' (also referred to as the '''[[average treatment effect]]''') can then be obtained by computing the difference in means between the treated (college-attending) and control (not-college-attending) samples.

Rubin defines a causal effect:

鲁宾定义了一种因果效应:

In many circumstances, however, randomized experiments are not possible due to ethical or practical concerns. In such scenarios there is a non-random assignment mechanism. This is the case for the example of college attendance: people are not randomly assigned to attend college. Rather, people may choose to attend college based on their financial situation, parents' education, and so on. Many statistical methods have been developed for causal inference, such as [[propensity score matching]]. These methods attempt to correct for the assignment mechanism by finding control units similar to treatment units.

<blockquote>

< 封锁报价 >

Intuitively, the causal effect of one treatment, E, over another, C, for a particular unit and an interval of time from <math>t_1</math> to <math>t_2</math> is the difference between what would have happened at time <math>t_2</math> if the unit had been exposed to E initiated at <math>t_1</math> and what would have happened at <math>t_2</math> if the unit had been exposed to C initiated at <math>t_1</math>: 'If an hour ago I had taken two aspirins instead of just a glass of water, my headache would now be gone,' or 'because an hour ago I took two aspirins instead of just a glass of water, my headache is now gone.' Our definition of the causal effect of the E versus C treatment will reflect this intuitive meaning." and other techniques for causal inference. For more on the connections between the Rubin causal model, structural equation modeling, and other statistical methods for causal inference, see Morgan and Winship (2007).

直观上，一种治疗方法 e 对另一种治疗方法 c 的因果关系,对于一个特定的单位和一段时间间隔，如果这个单位在 < math > t _ 1 </math > 到 < math > t _ 2 </math > 之间暴露于 e，那么在 < math > t _ 1 </math > 之前会发生什么，如果这个单位在 < math > t _ 1 </math > 之前暴露于 c，那么在 < math > t _ 2 </math > 之前会发生什么，如果一个小时之前我吃了两片阿司匹林而不是一杯水，我的头痛现在就会消失，或者因为一小时前我吃了两片阿司匹林而不是一杯水，现在我的头痛好了我们对 e 与 c 治疗的因果关系的定义将反映这一直观意义。”以及其他因果推理技术。要了解更多关于虚拟事实模型、结构方程模型和其他因果推断统计方法之间的联系，请参见 Morgan 和 Winship (2007)。

==An extended example==

Rubin defines a causal effect:

<blockquote>

Intuitively, the causal effect of one treatment, E, over another, C, for a particular unit and an interval of time from <math>t_1</math> to <math>t_2</math> is the difference between what would have happened at time <math>t_2</math> if the unit had been exposed to E initiated at <math>t_1</math> and what would have happened at <math>t_2</math> if the unit had been exposed to C initiated at <math>t_1</math>: 'If an hour ago I had taken two aspirins instead of just a glass of water, my headache would now be gone,' or 'because an hour ago I took two aspirins instead of just a glass of water, my headache is now gone.' Our definition of the causal effect of the E versus C treatment will reflect this intuitive meaning."<ref name=":01"/>

</blockquote>

According to the RCM, the causal effect of your taking or not taking aspirin one hour ago is the difference between how your head would have felt in case 1 (taking the aspirin) and case 2 (not taking the aspirin). If your headache would remain without aspirin but disappear if you took aspirin, then the causal effect of taking aspirin is headache relief. In most circumstances, we are interested in comparing two futures, one generally termed "treatment" and the other "control". These labels are somewhat arbitrary.

===Potential outcomes===

Suppose that Joe is participating in an FDA test for a new hypertension drug. If we were omniscient, we would know the outcomes for Joe under both treatment (the new drug) and control (either no treatment or the current standard treatment). The causal effect, or treatment effect, is the difference between these two potential outcomes.

{| class="wikitable" align="center"

! subject !! <math>Y_t(u)</math> !! <math>Y_c(u)</math> !! <math>Y_t(u) - Y_c(u)</math>

|-

! Joe

Category:Causal inference

类别: 因果推理

|130 || 135 || −5

Category:Statistical models

类别: 统计模型

|}

Category:Econometric models

类别: 计量经济学模型

Category:Observational study

类别: 观察性研究

<math>Y_t(u)</math> is Joe's [[blood pressure]] if he takes the new pill. In general, this notation expresses the potential outcome which results from a treatment, ''t'', on a unit, ''u''. Similarly, <math>Y_c(u)</math> is the effect of a different treatment, ''c'' or control, on a unit, ''u''. In this case, <math>Y_c(u)</math> is Joe's blood pressure if he doesn't take the pill. <math>Y_t(u) - Y_c(u)</math> is the causal effect of taking the new drug.

Category:Experiments

分类: 实验

<noinclude>

<small>This page was moved from [[wikipedia:en:Rubin causal model]]. Its edit history can be viewed at [[鲁宾因果框架/edithistory]]</small></noinclude>

[[Category:待整理页面]]

Moonscar

管理员

1,592

个编辑

更改

鲁宾因果模型 (查看源代码)

2021年5月25日 (二) 15:18的版本