“匹配”的版本间的差异

2021年6月27日 (日) 10:06的版本

作为一种统计技术，匹配 Matching通过在观察研究 Observational Study或准实验研究 Quasi-experiment（即 处理 Treatment 是非随机分配的）中比较已处理和未处理的单元，以评估处理的效果。匹配的目标是，对于每个处理单元，找到一个(或多个)具有相似可观察特征的未处理单元，以评估处理效果。通过处理单元与相似未处理单元的匹配，匹配技术可以比较处理单元与未处理单元的不同结果，从而评估处理效应，减少混杂效应带来的偏差。^[1]^[2]^[3] 倾向值匹配 Propensity Score Matching，一种早期的匹配技术，是作为 鲁宾因果模型 Rubin Causal Model^[4]的一部分发展起来的，但已被证明会增加模型依赖性、偏差、无效性和 计算量 power ，与其他匹配方法相比不再推荐使用。^[5]

匹配由 唐纳德•鲁宾 Donald Rubin ^[4]推动，在经济学中主要受到 拉隆德 LaLonde（1986）^[6]的批评。LaLonde比较了一个实验中的处理效果估计和运用匹配方法产生的可比估计，表明匹配方法是有偏的。 德赫加和瓦巴 Dehejia and Wahba (1999)重新评估了LaLonde的批评，并指出匹配是一个很好的解决方案。^[7]政治学^[8]和社会学期刊^[9]上也提出了类似的批评。

分析

当感兴趣的结果是二元变量时，分析匹配数据最常用的工具是条件Logistic回归模型，因为它可以处理 任意大小的层次和连续或二元处理变量(自变量) strata of arbitrary size and continuous or binary treatments (predictors) ，并且可以控制协变量。在特定情况下，可以使用 配对差异检验 paired difference test、 McNemar 检验和 Cochran-Mantel-Haenzel 检验等更简单的检验。

当感兴趣的结果是连续的，对平均处理效 Average Treatment Effect 应进行估计。

匹配也可用于在通过其他技术分析之前（例如回归分析）“预处理”样本。^[10]

过匹配

过匹配是对表面是中介变量、实际上是暴露的结果进行匹配。如果中介变量本身是分层的，则很可能引致一种暴露与疾病的令人费解的关系。^[11] 过匹配因此导致统计偏误。^[11]

例如，在估计体外受精（IVF）后的围产期死亡率和出生体重时，按妊娠期和/或多胎数来匹配对照组就是过度匹配，因为IVF本身会增加早产和多胎的风险。^[12]

它可以被看作是一个降低研究外部效度的抽样偏误，因为相比一般人群，对照组在暴露方面变得更类似于病例。

另见

倾向得分匹配

参考文献

↑ Rubin, Donald B. (1973). "Matching to Remove Bias in Observational Studies". Biometrics. 29 (1): 159–183. doi:10.2307/2529684. JSTOR 2529684.
↑ Anderson, Dallas W.; Kish, Leslie; Cornell, Richard G. (1980). "On Stratification, Grouping and Matching". Scandinavian Journal of Statistics. 7 (2): 61–66. JSTOR 4615774.
↑ Kupper, Lawrence L.; Karon, John M.; Kleinbaum, David G.; Morgenstern, Hal; Lewis, Donald K. (1981). "Matching in Epidemiologic Studies: Validity and Efficiency Considerations". Biometrics. 37 (2): 271–291. CiteSeerX 10.1.1.154.1197. doi:10.2307/2530417. JSTOR 2530417. PMID 7272415.
↑ ^4.0 ^4.1 Rosenbaum, Paul R.; Rubin, Donald B. (1983). "The Central Role of the Propensity Score in Observational Studies for Causal Effects". Biometrika. 70 (1): 41–55. doi:10.1093/biomet/70.1.41.
↑ King, Gary; Nielsen, Richard (October 2019). "Why Propensity Scores Should Not Be Used for Matching". Political Analysis (in English). 27 (4): 435–454. doi:10.1017/pan.2019.11. ISSN 1047-1987.
↑ LaLonde, Robert J. (1986). "Evaluating the Econometric Evaluations of Training Programs with Experimental Data". American Economic Review. 76 (4): 604–620. JSTOR 1806062.
↑ Dehejia, R. H.; Wahba, S. (1999). "Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs" (PDF). Journal of the American Statistical Association. 94 (448): 1053–1062. doi:10.1080/01621459.1999.10473858.
↑ Arceneaux, Kevin; Gerber, Alan S.; Green, Donald P. (2006). "Comparing Experimental and Matching Methods Using a Large-Scale Field Experiment on Voter Mobilization". Political Analysis. 14 (1): 37–62. doi:10.1093/pan/mpj001.
↑ Arceneaux, Kevin; Gerber, Alan S.; Green, Donald P. (2010). "A Cautionary Note on the Use of Matching to Estimate Causal Effects: An Empirical Example Comparing Matching Estimates to an Experimental Benchmark". Sociological Methods & Research. 39 (2): 256–282. doi:10.1177/0049124110378098.
↑ Ho, Daniel E.; Imai, Kosuke; King, Gary; Stuart, Elizabeth A. (2007). "Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference". Political Analysis. 15 (3): 199–236. doi:10.1093/pan/mpl013.
↑ ^11.0 ^11.1 Marsh, J. L.; Hutton, J. L.; Binks, K. (2002). "Removal of radiation dose response effects: an example of over-matching". British Medical Journal. 325 (7359): 327–330. doi:10.1136/bmj.325.7359.327. PMC 1123834. PMID 12169512.
↑ Gissler, M.; Hemminki, E. (1996). "The danger of overmatching in studies of the perinatal mortality and birthweight of infants born after assisted conception". Eur J Obstet Gynecol Reprod Biol. 69 (2): 73–75. doi:10.1016/0301-2115(95)02517-0. PMID 8902436.

进一步阅读

Angrist, Joshua D.; Pischke, Jörn-Steffen (2009). "Regression Meets Matching". Mostly Harmless Econometrics: An Empiricist's Companion. Princeton University Press. pp. 69–80. ISBN 978-0-691-12034-8.

编者推荐

本中文词条由Frombtotop翻译，薄荷编辑，欢迎在讨论页面留言。

本词条内容源自wikipedia及公开资料，遵守 CC3.0协议。

[1] Rubin, Donald B. (1973). "Matching to Remove Bias in Observational Studies". Biometrics. 29 (1): 159–183. doi:10.2307/2529684. JSTOR 2529684.

[2] Anderson, Dallas W.; Kish, Leslie; Cornell, Richard G. (1980). "On Stratification, Grouping and Matching". Scandinavian Journal of Statistics. 7 (2): 61–66. JSTOR 4615774.

[3] Kupper, Lawrence L.; Karon, John M.; Kleinbaum, David G.; Morgenstern, Hal; Lewis, Donald K. (1981). "Matching in Epidemiologic Studies: Validity and Efficiency Considerations". Biometrics. 37 (2): 271–291. CiteSeerX 10.1.1.154.1197. doi:10.2307/2530417. JSTOR 2530417. PMID 7272415.

[Rosenbaum_Rubin-4] 4.0 ^4.1 Rosenbaum, Paul R.; Rubin, Donald B. (1983). "The Central Role of the Propensity Score in Observational Studies for Causal Effects". Biometrika. 70 (1): 41–55. doi:10.1093/biomet/70.1.41.

[5] King, Gary; Nielsen, Richard (October 2019). "Why Propensity Scores Should Not Be Used for Matching". Political Analysis (in English). 27 (4): 435–454. doi:10.1017/pan.2019.11. ISSN 1047-1987.

[6] LaLonde, Robert J. (1986). "Evaluating the Econometric Evaluations of Training Programs with Experimental Data". American Economic Review. 76 (4): 604–620. JSTOR 1806062.

[7] Dehejia, R. H.; Wahba, S. (1999). "Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs" (PDF). Journal of the American Statistical Association. 94 (448): 1053–1062. doi:10.1080/01621459.1999.10473858.

[8] Arceneaux, Kevin; Gerber, Alan S.; Green, Donald P. (2006). "Comparing Experimental and Matching Methods Using a Large-Scale Field Experiment on Voter Mobilization". Political Analysis. 14 (1): 37–62. doi:10.1093/pan/mpj001.

[9] Arceneaux, Kevin; Gerber, Alan S.; Green, Donald P. (2010). "A Cautionary Note on the Use of Matching to Estimate Causal Effects: An Empirical Example Comparing Matching Estimates to an Experimental Benchmark". Sociological Methods & Research. 39 (2): 256–282. doi:10.1177/0049124110378098.

[10] Ho, Daniel E.; Imai, Kosuke; King, Gary; Stuart, Elizabeth A. (2007). "Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference". Political Analysis. 15 (3): 199–236. doi:10.1093/pan/mpl013.

[marsh-11] 11.0 ^11.1 Marsh, J. L.; Hutton, J. L.; Binks, K. (2002). "Removal of radiation dose response effects: an example of over-matching". British Medical Journal. 325 (7359): 327–330. doi:10.1136/bmj.325.7359.327. PMC 1123834. PMID 12169512.

[12] Gissler, M.; Hemminki, E. (1996). "The danger of overmatching in studies of the perinatal mortality and birthweight of infants born after assisted conception". Eur J Obstet Gynecol Reprod Biol. 69 (2): 73–75. doi:10.1016/0301-2115(95)02517-0. PMID 8902436.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

@@ 第3行： / 第3行： @@
 |description='通过在观察研究或准实验研究中比较已处理和未处理的单元，以评估处理的效果
 }}
-作为一种统计技术，'''<font color="#ff8000"> 匹配 Matching</font>'''通过在'''<font color="#ff8000"> 观察研究 Observational Study</font>'''或'''<font color="#ff8000"> 准实验研究 Quasi-experiment</font>'''（即'''<font color="#ff8000"> 处理 Treatment </font>'''是非随机分配的）中比较已处理和未处理的单元，以评估处理的效果。匹配的目标是，对于每个处理单元，找到一个(或多个)具有相似可观察特征的未处理单元，以评估处理效果。通过处理单元与相似未处理单元的匹配，匹配技术可以比较处理单元与未处理单元的不同结果，从而评估处理效应，减少混杂效应带来的偏差。<ref>{{cite journal | doi=10.2307/2529684 | last=Rubin | first=Donald B. | title=Matching to Remove Bias in Observational Studies | journal=Biometrics | volume=29 | issue=1 | year=1973 | pages=159–183 | jstor=2529684}}</ref><ref>{{cite journal | title=On Stratification, Grouping and Matching | last=Anderson | first=Dallas W. |author2=Kish, Leslie |author3=Cornell, Richard G.  | journal=Scandinavian Journal of Statistics | volume=7 | issue=2 | year=1980 | pages=61–66 | jstor=4615774}}</ref><ref>{{cite journal | doi=10.2307/2530417 | title=Matching in Epidemiologic Studies: Validity and Efficiency Considerations | last=Kupper | first=Lawrence L. |author2=Karon, John M. |author3=Kleinbaum, David G. |author4=Morgenstern, Hal |author5= Lewis, Donald K.  | journal=Biometrics | volume=37 | issue=2 | year=1981 | pages=271–291 | jstor=2530417 | pmid=7272415| citeseerx=10.1.1.154.1197 }}</ref>'''<font color="#ff8000"> 倾向值匹配 Propensity Score Matching</font>'''，一种早期的匹配技术，是作为'''<font color="#ff8000"> 鲁宾因果模型 Rubin Causal Model</font>'''<ref name="Rosenbaum Rubin">{{cite journal |last1=Rosenbaum |first1=Paul R. |last2=Rubin |first2=Donald B. |title=The Central Role of the Propensity Score in Observational Studies for Causal Effects |journal=Biometrika |year=1983 |volume=70 |issue=1 |pages=41–55 |doi=10.1093/biomet/70.1.41 |doi-access=free }}</ref>的一部分发展起来的，但已被证明会增加模型依赖性、偏差、无效性和'''<font color="#32cd32"> 计算量 power </font>'''，与其他匹配方法相比不再推荐使用。<ref>{{Cite journal|last1=King|first1=Gary|last2=Nielsen|first2=Richard|date=October 2019|title=Why Propensity Scores Should Not Be Used for Matching|url=https://www.cambridge.org/core/product/identifier/S1047198719000111/type/journal_article|journal=Political Analysis|language=en|volume=27|issue=4|pages=435–454|doi=10.1017/pan.2019.11|issn=1047-1987|doi-access=free}}</ref>
+作为一种统计技术，'''匹配 Matching'''通过在'''<font color="#ff8000">观察研究 Observational Study</font>'''或'''<font color="#ff8000">准实验研究 Quasi-experiment</font>'''（即'''<font color="#ff8000"> 处理 Treatment </font>'''是非随机分配的）中比较已处理和未处理的单元，以评估处理的效果。匹配的目标是，对于每个处理单元，找到一个(或多个)具有相似可观察特征的未处理单元，以评估处理效果。通过处理单元与相似未处理单元的匹配，匹配技术可以比较处理单元与未处理单元的不同结果，从而评估处理效应，减少混杂效应带来的偏差。<ref>{{cite journal | doi=10.2307/2529684 | last=Rubin | first=Donald B. | title=Matching to Remove Bias in Observational Studies | journal=Biometrics | volume=29 | issue=1 | year=1973 | pages=159–183 | jstor=2529684}}</ref><ref>{{cite journal | title=On Stratification, Grouping and Matching | last=Anderson | first=Dallas W. |author2=Kish, Leslie |author3=Cornell, Richard G.  | journal=Scandinavian Journal of Statistics | volume=7 | issue=2 | year=1980 | pages=61–66 | jstor=4615774}}</ref><ref>{{cite journal | doi=10.2307/2530417 | title=Matching in Epidemiologic Studies: Validity and Efficiency Considerations | last=Kupper | first=Lawrence L. |author2=Karon, John M. |author3=Kleinbaum, David G. |author4=Morgenstern, Hal |author5= Lewis, Donald K.  | journal=Biometrics | volume=37 | issue=2 | year=1981 | pages=271–291 | jstor=2530417 | pmid=7272415| citeseerx=10.1.1.154.1197 }}</ref>'''<font color="#ff8000"> 倾向值匹配 Propensity Score Matching</font>'''，一种早期的匹配技术，是作为'''<font color="#ff8000"> 鲁宾因果模型 Rubin Causal Model</font>'''<ref name="Rosenbaum Rubin">{{cite journal |last1=Rosenbaum |first1=Paul R. |last2=Rubin |first2=Donald B. |title=The Central Role of the Propensity Score in Observational Studies for Causal Effects |journal=Biometrika |year=1983 |volume=70 |issue=1 |pages=41–55 |doi=10.1093/biomet/70.1.41 |doi-access=free }}</ref>的一部分发展起来的，但已被证明会增加模型依赖性、偏差、无效性和'''<font color="#32cd32"> 计算量 power </font>'''，与其他匹配方法相比不再推荐使用。<ref>{{Cite journal|last1=King|first1=Gary|last2=Nielsen|first2=Richard|date=October 2019|title=Why Propensity Scores Should Not Be Used for Matching|url=https://www.cambridge.org/core/product/identifier/S1047198719000111/type/journal_article|journal=Political Analysis|language=en|volume=27|issue=4|pages=435–454|doi=10.1017/pan.2019.11|issn=1047-1987|doi-access=free}}</ref>
@@ 第14行： / 第14行： @@
-当感兴趣的结果是连续的，对　'''<font color="#ff8000"> 平均处理效 Average Treatment Effect </font>'''应进行估计。
+当感兴趣的结果是连续的，对'''<font color="#ff8000">平均处理效 Average Treatment Effect </font>'''应进行估计。
@@ 第44行： / 第44行： @@
 *{{cite book |last1=Angrist |first1=Joshua D. |last2=Pischke |first2=Jörn-Steffen |chapter=Regression Meets Matching |title=Mostly Harmless Econometrics: An Empiricist's Companion |publisher=Princeton University Press |year=2009 |isbn=978-0-691-12034-8 |pages=69–80 }}
+==编者推荐==
 ----
-本中文词条由[[用户: Sikongpop|Sikongpop]]用户参与编译，[[用户:LFZ|LFZ]]参与审校，[[用户:薄荷|薄荷]]编辑，欢迎在讨论页面留言。
+本中文词条由[[用户: Frombtotop|Frombtotop]]翻译，[[用户:薄荷|薄荷]]编辑，欢迎在讨论页面留言。

“匹配”的版本间的差异

2021年6月27日 (日) 10:06的版本

目录

分析

过匹配

另见

参考文献

进一步阅读

编者推荐

导航菜单

搜索