“匹配”的版本间的差异

2021年5月28日 (五) 10:57的版本

此词条暂由彩云小译翻译，翻译字数共400，未经人工整理和审校，带来阅读不便，请见谅。

Matching is a statistical technique which is used to evaluate the effect of a treatment by comparing the treated and the non-treated units in an observational study or quasi-experiment (i.e. when the treatment is not randomly assigned). The goal of matching is, for every treated unit, to find one (or more) non-treated unit(s) with similar observable characteristics against whom the effect of the treatment can be assessed. By matching treated units to similar non-treated units, matching enables a comparison of outcomes among treated and non-treated units to estimate the effect of the treatment reducing bias due to confounding.^[1]^[2]^[3] Propensity score matching, an early matching technique, was developed as part of the Rubin causal model,^[4] but has been shown to increase model dependence, bias, inefficiency, and power and is no longer recommended compared to other matching methods.^[5]

Matching is a statistical technique which is used to evaluate the effect of a treatment by comparing the treated and the non-treated units in an observational study or quasi-experiment (i.e. when the treatment is not randomly assigned). The goal of matching is, for every treated unit, to find one (or more) non-treated unit(s) with similar observable characteristics against whom the effect of the treatment can be assessed. By matching treated units to similar non-treated units, matching enables a comparison of outcomes among treated and non-treated units to estimate the effect of the treatment reducing bias due to confounding. Propensity score matching, an early matching technique, was developed as part of the Rubin causal model, but has been shown to increase model dependence, bias, inefficiency, and power and is no longer recommended compared to other matching methods.

匹配是一种统计方法，在观察研究或准实验研究（即）内经处理及未经处理的单位，以评估处理的效果。。匹配的目标是，对于每个治疗单位，找到一个(或多个)非治疗单位具有类似的可观察特征，对照治疗的效果可以进行评估。通过将治疗单位与类似的非治疗单位进行匹配，匹配可以使治疗单位与非治疗单位之间的结果进行比较，从而评估治疗减少混杂造成的偏倚的效果。早期的匹配技术---- 倾向评分匹配匹配技术，是作为虚拟事实模型匹配技术的一部分而发展起来的，但已经被证明会增加模型依赖性、偏倚、低效和功率，与其他匹配方法相比不再被推荐使用。

Matching has been promoted by Donald Rubin.^[4] It was prominently criticized in economics by LaLonde (1986),^[6] who compared estimates of treatment effects from an experiment to comparable estimates produced with matching methods and showed that matching methods are biased. Dehejia and Wahba (1999) reevaluated LaLonde's critique and showed that matching is a good solution.^[7] Similar critiques have been raised in political science^[8] and sociology^[9] journals.

When the outcome of interest is binary, the most general tool for the analysis of matched data is conditional logistic regression as it handles strata of arbitrary size and continuous or binary treatments (predictors) and can control for covariates. In particular cases, simpler tests like paired difference test, McNemar test and Cochran-Mantel-Haenszel test are available.

当感兴趣的结果是二进制的时候，分析匹配数据最常用的工具是条件 Logit模型，因为它可以处理任意大小的层次和连续或二进制处理(谓词) ，并且可以控制协变量。在特定情况下，可以使用配对差异检验、 McNemar 检验和 cochran-mantel-haenzel 检验等更简单的检验。

Analysis

When the outcome of interest is continuous, estimation of the average treatment effect is performed.

当感兴趣的结果是连续的，估计的平均治疗效果进行。

When the outcome of interest is binary, the most general tool for the analysis of matched data is conditional logistic regression as it handles strata of arbitrary size and continuous or binary treatments (predictors) and can control for covariates. In particular cases, simpler tests like paired difference test, McNemar test and Cochran-Mantel-Haenszel test are available.

Matching can also be used to "pre-process" a sample before analysis via another technique, such as regression analysis.

匹配也可以用于“预处理”样品，然后再通过另一种技术进行分析，例如回归分析分析。

When the outcome of interest is continuous, estimation of the average treatment effect is performed.

Matching can also be used to "pre-process" a sample before analysis via another technique, such as regression analysis.^[10]

Overmatching is matching for an apparent mediator that actually is a result of the exposure. If the mediator itself is stratified, an obscured relation of the exposure to the disease would highly be likely to be induced.

过匹配是匹配的一个明显的中介，实际上是一个结果的曝光。如果中介者本身是分层的，则很可能诱发一种模糊的疾病暴露关系。

Overmatching

For example, matching the control group by gestation length and/or the number of multiple births when estimating perinatal mortality and birthweight after in vitro fertilization (IVF) is overmatching, since IVF itself increases the risk of premature birth and multiple birth.

例如，在估算新生儿死亡和体外受精后出生体重时，按妊娠期长度和/或多胎次数与对照组相匹配是过度匹配的，因为体外受精本身会增加早产和多胎的风险。

Overmatching is matching for an apparent mediator that actually is a result of the exposure. If the mediator itself is stratified, an obscured relation of the exposure to the disease would highly be likely to be induced.^[11] Overmatching thus causes statistical bias.^[11]

It may be regarded as a sampling bias in decreasing the external validity of a study, because the controls become more similar to the cases in regard to exposure than the general population.

它可以被看作是一个减少研究外部效度的抽样偏差，因为在暴露方面，控制变得比一般人群更加类似于案例。

For example, matching the control group by gestation length and/or the number of multiple births when estimating perinatal mortality and birthweight after in vitro fertilization (IVF) is overmatching, since IVF itself increases the risk of premature birth and multiple birth.^[12]

It may be regarded as a sampling bias in decreasing the external validity of a study, because the controls become more similar to the cases in regard to exposure than the general population.

References

↑ Rubin, Donald B. (1973). "Matching to Remove Bias in Observational Studies". Biometrics. 29 (1): 159–183. doi:10.2307/2529684. JSTOR 2529684.
↑ Anderson, Dallas W.; Kish, Leslie; Cornell, Richard G. (1980). "On Stratification, Grouping and Matching". Scandinavian Journal of Statistics. 7 (2): 61–66. JSTOR 4615774.
↑ Kupper, Lawrence L.; Karon, John M.; Kleinbaum, David G.; Morgenstern, Hal; Lewis, Donald K. (1981). "Matching in Epidemiologic Studies: Validity and Efficiency Considerations". Biometrics. 37 (2): 271–291. CiteSeerX 10.1.1.154.1197. doi:10.2307/2530417. JSTOR 2530417. PMID 7272415.
↑ ^4.0 ^4.1 Rosenbaum, Paul R.; Rubin, Donald B. (1983). "The Central Role of the Propensity Score in Observational Studies for Causal Effects". Biometrika. 70 (1): 41–55. doi:10.1093/biomet/70.1.41.
↑ King, Gary; Nielsen, Richard (October 2019). "Why Propensity Scores Should Not Be Used for Matching". Political Analysis (in English). 27 (4): 435–454. doi:10.1017/pan.2019.11. ISSN 1047-1987.
↑ Matching has been promoted by Donald Rubin. who compared estimates of treatment effects from an experiment to comparable estimates produced with matching methods and showed that matching methods are biased. Dehejia and Wahba (1999) reevaluated LaLonde's critique and showed that matching is a good solution. Similar critiques have been raised in political science and sociology journals. 配对活动由唐纳德•鲁宾(Donald Rubin)推动。他们将实验中的治疗效果评估与匹配方法产生的可比评估进行了比较，并表明匹配方法是有偏差的。德赫加和瓦巴(1999)重新评价了拉隆德的批评，并指出匹配是一个很好的解决方案。类似的批评在政治学和社会学期刊上也被提出。 LaLonde, Robert J. (1986). "Evaluating the Econometric Evaluations of Training Programs with Experimental Data". American Economic Review. 76 (4): 604–620. JSTOR 1806062.
↑ Dehejia, R. H.; Wahba, S. (1999). "Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs" (PDF). Journal of the American Statistical Association. 94 (448): 1053–1062. doi:10.1080/01621459.1999.10473858.
↑ Arceneaux, Kevin; Gerber, Alan S.; Green, Donald P. (2006). "Comparing Experimental and Matching Methods Using a Large-Scale Field Experiment on Voter Mobilization". Political Analysis. 14 (1): 37–62. doi:10.1093/pan/mpj001.
↑ Arceneaux, Kevin; Gerber, Alan S.; Green, Donald P. (2010). "A Cautionary Note on the Use of Matching to Estimate Causal Effects: An Empirical Example Comparing Matching Estimates to an Experimental Benchmark". Sociological Methods & Research. 39 (2): 256–282. doi:10.1177/0049124110378098. S2CID 37012563.
↑ Ho, Daniel E.; Imai, Kosuke; King, Gary; Stuart, Elizabeth A. (2007). "Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference". Political Analysis. 15 (3): 199–236. doi:10.1093/pan/mpl013.
↑ ^11.0 ^11.1 Marsh, J. L.; Hutton, J. L.; Binks, K. (2002). "Removal of radiation dose response effects: an example of over-matching". British Medical Journal. 325 (7359): 327–330. doi:10.1136/bmj.325.7359.327. PMC 1123834. PMID 12169512.
↑ Gissler, M.; Hemminki, E. (1996). "The danger of overmatching in studies of the perinatal mortality and birthweight of infants born after assisted conception". Eur J Obstet Gynecol Reprod Biol. 69 (2): 73–75. doi:10.1016/0301-2115(95)02517-0. PMID 8902436.

2021年5月25日 (二) 15:20的版本（查看源代码） Moonscar（讨论 \| 贡献）（Moved page from wikipedia:en:Matching (statistics) (history)）		2021年5月28日 (五) 10:57的版本（查看源代码） Frombtotop（讨论 \| 贡献）下一编辑→
第5行：		第5行：
	Matching is a statistical technique which is used to evaluate the effect of a treatment by comparing the treated and the non-treated units in an observational study or quasi-experiment (i.e. when the treatment is not randomly assigned). The goal of matching is, for every treated unit, to find one (or more) non-treated unit(s) with similar observable characteristics against whom the effect of the treatment can be assessed. By matching treated units to similar non-treated units, matching enables a comparison of outcomes among treated and non-treated units to estimate the effect of the treatment reducing bias due to confounding. Propensity score matching, an early matching technique, was developed as part of the Rubin causal model, but has been shown to increase model dependence, bias, inefficiency, and power and is no longer recommended compared to other matching methods.		Matching is a statistical technique which is used to evaluate the effect of a treatment by comparing the treated and the non-treated units in an observational study or quasi-experiment (i.e. when the treatment is not randomly assigned). The goal of matching is, for every treated unit, to find one (or more) non-treated unit(s) with similar observable characteristics against whom the effect of the treatment can be assessed. By matching treated units to similar non-treated units, matching enables a comparison of outcomes among treated and non-treated units to estimate the effect of the treatment reducing bias due to confounding. Propensity score matching, an early matching technique, was developed as part of the Rubin causal model, but has been shown to increase model dependence, bias, inefficiency, and power and is no longer recommended compared to other matching methods.

−	配对是一项统计技术，用以比较观察性研究或准实验内经处理及未经处理的单位，以评估处理的效果。当治疗不是随机分配的)。匹配的目标是，对于每个治疗单位，找到一个(或多个)非治疗单位具有类似的可观察特征，对照治疗的效果可以进行评估。通过将治疗单位与类似的非治疗单位进行匹配，匹配可以使治疗单位与非治疗单位之间的结果进行比较，从而评估治疗减少混杂造成的偏倚的效果。早期的匹配技术---- 倾向评分匹配匹配技术，是作为虚拟事实模型匹配技术的一部分而发展起来的，但已经被证明会增加模型依赖性、偏倚、低效和功率，与其他匹配方法相比不再被推荐使用。	+	匹配是一种统计方法，在观察研究或准实验研究（即）内经处理及未经处理的单位，以评估处理的效果。。匹配的目标是，对于每个治疗单位，找到一个(或多个)非治疗单位具有类似的可观察特征，对照治疗的效果可以进行评估。通过将治疗单位与类似的非治疗单位进行匹配，匹配可以使治疗单位与非治疗单位之间的结果进行比较，从而评估治疗减少混杂造成的偏倚的效果。早期的匹配技术---- 倾向评分匹配匹配技术，是作为虚拟事实模型匹配技术的一部分而发展起来的，但已经被证明会增加模型依赖性、偏倚、低效和功率，与其他匹配方法相比不再被推荐使用。

“匹配”的版本间的差异

2021年5月28日 (五) 10:57的版本

目录

Analysis

Overmatching

See also

References

Further reading

导航菜单

搜索