更改

添加25,895字节 、 2021年5月25日 (二) 15:19
此词条暂由彩云小译翻译,翻译字数共895,未经人工整理和审校,带来阅读不便,请见谅。

[[File:StratifiedRandomSampling.jpg|thumb|220x220px|Graphic breakdown of stratified random sampling]]

Graphic breakdown of stratified random sampling

分层随机抽样的图解

In [[statistics]], '''stratified randomization''' is a method of [[Sampling (statistics)|sampling]] which first stratifies the whole study [[Statistical population|population]] into [[Statistical population|subgroups]] with same [[Variable and attribute (research)|attributes]] or characteristics, known as strata, then followed by [[Simple random sample|simple random sampling]] from the stratified groups, where each element within the same subgroup are selected [[Bias (statistics)|unbiasedly]] during any stage of the sampling process, randomly and entirely by chance.<ref name=":3" /><ref>{{Citation|title=Simple random sample|date=2020-03-18|url=https://en.wikipedia.org/w/index.php?title=Simple_random_sample&oldid=946144051|work=Wikipedia|language=en|access-date=2020-04-07}}</ref> Stratified randomization is considered a subdivision of [[stratified sampling]], and should be adopted when shared attributes exist partially and vary widely between subgroups of the investigated population, so that they require special considerations or clear distinctions during sampling.<ref>{{Citation|title=Stratified sampling|date=2020-02-09|url=https://en.wikipedia.org/w/index.php?title=Stratified_sampling&oldid=939938944|work=Wikipedia|language=en|access-date=2020-04-07}}</ref> This sampling method should be distinguished from [[cluster sampling]], where a simple random sample of several entire clusters is selected to represent the whole population, or stratified systematic sampling, where a [[systematic sampling]] is carried out after the stratification process. Stratified random sampling is sometimes also known as "'''quota random sampling'''".<ref name=":3">{{Cite web|url=https://www.investopedia.com/ask/answers/032615/what-are-some-examples-stratified-random-sampling.asp|title=How Stratified Random Sampling Works|last=Nickolas|first=Steven|date=July 14, 2019|website=Investopedia|language=en|access-date=2020-04-07}}</ref>

In statistics, stratified randomization is a method of sampling which first stratifies the whole study population into subgroups with same attributes or characteristics, known as strata, then followed by simple random sampling from the stratified groups, where each element within the same subgroup are selected unbiasedly during any stage of the sampling process, randomly and entirely by chance. Stratified randomization is considered a subdivision of stratified sampling, and should be adopted when shared attributes exist partially and vary widely between subgroups of the investigated population, so that they require special considerations or clear distinctions during sampling. This sampling method should be distinguished from cluster sampling, where a simple random sample of several entire clusters is selected to represent the whole population, or stratified systematic sampling, where a systematic sampling is carried out after the stratification process. Stratified random sampling is sometimes also known as "quota random sampling".

在统计学中,分层随机抽样是一种抽样方法,首先将整个研究人口分层为具有相同属性或特征的子群,称为分层,然后从分层组中进行简单随机抽样,在抽样过程的任何阶段,随机、完全随机地无偏选择同一子群中的每一个元素。分层随机化被认为是分层抽样的一个细分,当共享属性部分存在并且在被调查人口的不同亚群之间有很大差异时,应该采用,因此在抽样时需要特殊的考虑或者明确的区分。这种抽样方法应区别于整群抽样方法,整群抽样方法是在整个群体中选择一个简单的随机抽样来代表整个总体,或分层系统抽样方法,在分层过程之后进行系统抽样。分层随机抽样有时也称为“定额随机抽样”。



== Steps for stratified randomization ==

Stratified randomization is extremely useful when the target population is [[Homogeneity and heterogeneity|heterogeneous]] and effectively displays how the trends or characteristics under study differ between strata.<ref name=":3" /> When performing a stratified randomization, the following 8 steps should be taken:<ref name=":4">{{Cite web|url=https://www.statisticshowto.com/stratified-random-sample/|title=Stratified Random Sample: Definition, Examples|last=Stephanie|date=Dec 11, 2013|website=Statistics How To|language=en-US|access-date=2020-04-07}}</ref><ref name=":5">{{Cite web|url=https://www.questionpro.com/blog/stratified-random-sampling/|title=Stratified Random Sampling: Definition, Method and Examples|date=2018-03-13|website=QuestionPro|language=en|access-date=2020-04-07}}</ref>

Stratified randomization is extremely useful when the target population is heterogeneous and effectively displays how the trends or characteristics under study differ between strata.

分层随机化在目标种群异质性的情况下非常有用,它能有效地显示研究的趋势或特征在不同阶层之间的差异。



# Define a target population.

Define a target population.

Define a target population.

# Define stratification [[Variable and attribute (research)|variables]] and decide the number of strata to be created. The criteria for defining variables for stratification include [[Ageing|age]], [[socioeconomic status]], [[nationality]], [[Race (human categorization)|race]], [[Educational stage|education level]] and others and should be in line with the research objective. Ideally, the number of 4-6strata should be employed, as any increase in stratification variables will raise the probability for some of them to cancel out the impact of other variables.<ref name=":5" />

Define stratification variables and decide the number of strata to be created. The criteria for defining variables for stratification include age, socioeconomic status, nationality, race, education level and others and should be in line with the research objective. Ideally, the number of 4-6strata should be employed, as any increase in stratification variables will raise the probability for some of them to cancel out the impact of other variables.

定义分层变量并决定要创建的分层数量。确定分层变量的标准包括年龄、社会经济地位、国籍、种族、教育程度等,并应与研究目标相一致。理想情况下,应该使用4-6个阶层的数量,因为任何分层变量的增加将提高其中一些变量抵消其他变量的影响的概率。

# Use a [[sampling frame]] to evaluate all the elements in the target population. Make changes afterwards based on [[Coverage probability|coverage]] and grouping.

# List all the elements and consider the sampling result. Each stratum should be [[Mutual exclusivity|mutually exclusive]] and add up to cover all members of the population, whilst each member of the population should fall into [[Uniqueness quantification|unique]] stratum, along with other members with minimum differences.<ref name=":4" />

The number of subgroups can be calculated by multiplying the number of strata for each factor.  Factors are measured before or at the time of randomization and experimental subjects are divided into several subgroups or strata according to the results of measurements.

子群的数目可以用每个因素的阶层数目相乘来计算。因子在随机化之前或之时被测量,实验对象根据测量结果被分成若干子群或阶层。

# Make decisions over the random sampling selection criteria. This can be done manually or with a designed computer program.

# Assign a random and unique number to all the elements followed by sorting these elements according to their number assigned.

Within each stratum, several randomization strategies can be applied, which involves simple randomization, blocked randomization, and minimization.

在每个地层中,可以采用多种随机化策略,包括简单的随机化、阻塞随机化和最小化。

# Review the size of each stratum and [[Probability distribution|numerical distribution]] of all elements in every strata. Determine the type of sampling, either proportional or disproportional stratified sampling.

# Carry out the selected random sampling as defined in step 5. At minimum, one element must be chosen from each stratum so that the final sample includes representatives from every stratum. If two or more elements from each stratum are selected, [[Margin of error|error margins]] of the collected data can be calculated.<ref name=":5" />



Simple randomization is considered as the easiest method for allocating subjects in each stratum. Subjects are assigned to each group purely randomly for every assignment. Even though it is easy to conduct, simple randomization is commonly applied in strata that contain more than 100 samples since a small sampling size would make assignment unequal.

简单随机化被认为是最简单的方法分配主体在每个阶层。每次分配的主题都是随机分配给每个小组的。尽管简单的随机化方法易于实施,但是由于小样本容易造成分配不等,因此在样本数超过100个的地层中常常采用简单的随机化方法。

== Techniques ==

[[File:Stratified sampling.PNG|thumb|Simple random sampling after stratification step]]

Block randomization is commonly used in the experiment with a relatively big sampling size to avoid the imbalance allocation of samples with important characteristics. In certain fields with strict requests of randomization such as clinical trials, the allocation would be predictable when there is no blinding process for conductors and the block size is limited. The blocks permuted randomization in strata could possibly cause an imbalance of samples among strata as the number of strata increases and the sample size is limited, For instance, there is a possibility that no sample is found meeting the characteristic of certain strata.

为了避免重要特征样本分配不平衡的问题,实验中常采用分块随机化的方法,采样规模较大。在某些严格要求随机化的领域,例如临床试验,当没有导体的盲法和块大小有限时,分配是可以预测的。随着地层数量的增加和样本容量的限制,地层中的块体随机化可能导致地层之间样本的不平衡,例如,有可能找不到符合特定地层特征的样本。

Stratified randomization decides one or multiple prognostic factors to make subgroups, on average, have similar entry characteristics. The patient factor can be accurately decided by examining the outcome in previous studies.<ref>{{Cite journal|last=Sylvester|first=Richard|date=December 1982|title=Fundamentals of clinical trials|journal=Controlled Clinical Trials|volume=3|issue=4|pages=385–386|doi=10.1016/0197-2456(82)90029-0|issn=0197-2456}}</ref>



The number of subgroups can be calculated by multiplying the number of strata for each factor.  Factors are measured before or at the time of randomization and experimental subjects are divided into several subgroups or strata according to the results of measurements.<ref name=":0">{{Cite book|last=Pocock, Stuart J.|title=Clinical trials : a practical approach|publisher=John Wiley & Sons Ltd|date=Jul 1, 2013|isbn=978-1-118-79391-6|location=Chichester|oclc=894581169}}</ref>

In order to guarantee the similarity of each treatment group, the "minimization" method attempts are made, which is more direct than random permuted block within strats. In the minimization method, samples in each stratum are assigned to treatment groups based on the sum of samples in each treatment group, which makes the number of subjects keep balance among the group.

为了保证每个处理组之间的相似性,尝试了“最小化”方法,这种方法比层内随机置乱更直接。在最小化方法中,根据每个处理组的样本总和,将每个地层的样本分配给处理组,使处理组的受试者人数保持平衡。



Within each stratum, several randomization strategies can be applied, which involves [[Simple random sample|simple randomization]], [[Blocking (statistics)#Blocking used for nuisance factors that can be controlled|blocked randomization]], and [[Minimisation (clinical trials)|minimization]].



Confounding factors are important to consider in clinical trials

在临床试验中,混杂因素是需要考虑的重要因素

=== Simple randomization within strata ===

Stratified random sampling is useful and productive in situations requiring different weightings on specific strata. In this way, the researchers can manipulate the selection mechanisms from each strata to amplify or minimize the desired characteristics in the survey result.

分层随机抽样在特定地层需要不同权重的情况下是有用的和有效的。通过这种方式,研究人员可以操纵来自每个阶层的选择机制,以便在调查结果中放大或减少所需的特征。

Simple randomization is considered as the easiest method for allocating subjects in each stratum. Subjects are assigned to each group purely randomly for every assignment. Even though it is easy to conduct, simple randomization is commonly applied in strata that contain more than 100 samples since a small sampling size would make assignment unequal.<ref name=":0" />



Stratified randomization is helpful when researchers intend to seek for associations between two or more strata, as simple random sampling causes a larger chance of unequal representation of target groups. It is also useful when the researchers wish to eliminate confounders in observational studies as stratified random sampling allows the adjustments of covariances and the p-values for more accurate results.

当研究人员试图寻找两个或多个阶层之间的联系时,分层随机化是有帮助的,因为简单的随机抽样会导致目标群体代表性不平等的可能性更大。当研究人员希望在观察研究中消除混杂因素时,这也是有用的,因为分层随机抽样允许调整协方差和 p 值以获得更准确的结果。

=== Block randomization within strata ===

[[Randomized block design|Block randomization]], sometimes called permuted block randomization, applies blocks to allocate subjects from the same strata equally to each group in the study. In block randomization, allocation ratio (ratio of the number of one specific group over other groups) and group sizes are specified. The block size must be the multiples of the number of treatments so that samples in each stratum can be assigned to treatment groups with the intended ratio.<ref name=":0" /> For instance, there should be 4 or 8 strata in a clinical trial concerning breast cancer where age and nodal statuses are two prognostic factors and each factor is split into two-level. The different blocks can be assigned to samples in multiple ways including random list and computer programming.<ref>{{Cite web|url=https://www.sealedenvelope.com/help/redpill/latest/block/|title=Sealed Envelope {{!}} Random permuted blocks|date=Feb 25, 2020|website=www.sealedenvelope.com|access-date=2020-04-07}}</ref><ref>{{Citation|last1=Friedman|first1=Lawrence M.|title=Introduction to Clinical Trials|date=2010|work=Fundamentals of Clinical Trials|pages=1–18|publisher=Springer New York|isbn=978-1-4419-1585-6|last2=Furberg|first2=Curt D.|last3=DeMets|first3=David L.|doi=10.1007/978-1-4419-1586-3_1}}</ref>

There is also a higher level of statistical accuracy for stratified random sampling compared with simple random sampling, due to the high relevance of elements chosen to represent the population. The step of stratified randomization is extremely important as an attempt to ensure that no bias, delibrate or accidental, affects the representative nature of the patient sample under study. It increases the study power, especially in small clinical trials(n<400), as these known clinical traits stratified are thought to effect the outcomes of the interventions. It helps prevent the occurrence of type I error, which is valued highly in clinical studies. It also has an important effect on sample size for active control equivalence trials and in theory, facilitates subgroup analysis and interim analysis.

与简单随机抽样相比,分层随机抽样具有更高的统计准确性,因为所选择的元素代表总体具有高度的相关性。分层随机化的步骤是非常重要的,它试图确保没有偏差,取样或偶然,影响研究中患者样本的代表性。它增加了研究力量,特别是在小型临床试验(n < 400) ,因为这些已知的临床特征分层被认为影响干预的结果。它有助于防止 i 型错误的发生,这在临床研究中是很有价值的。它还对主动控制等效试验的样本容量有重要影响,并在理论上简化了亚组分析和中期分析。



The subgroup size is taken to be of the same importance if the data available cannot represent overall subgroup population. In some applications, subgroup size is decided with reference to the amount of data available instead of scaling sample sizes to subgroup size, which would introduce bias in the effects of factors.  In some cases that data needs to be stratified by variances, subgroup variances differ significantly, making each subgroup sampling size proportional to the overall subgroup population cannot be guaranteed.

如果可用的数据不能代表整个分组人口,则子组大小被认为具有同样的重要性。在一些应用中,子群大小是根据可用数据量来决定的,而不是按照子群大小来衡量样本大小,这会在因素的影响中引入偏倚。在某些情况下,数据需要由方差分层,分组方差差异显著,使得每个分组抽样大小与整个分组总体成比例不能得到保证。

Block randomization is commonly used in the experiment with a relatively big sampling size to avoid the imbalance allocation of samples with important characteristics. In certain fields with strict requests of randomization such as [[clinical trial]]s, the allocation would be predictable when there is no blinding process for conductors and the block size is limited. The blocks permuted randomization in strata could possibly cause an imbalance of samples among strata as the number of strata increases and the sample size is limited, For instance, there is a possibility that no sample is found meeting the characteristic of certain strata.<ref>{{Cite book|title=Fundamentals of clinical trials|others=Friedman, Lawrence M., 1942-, Furberg, Curt,, DeMets, David L., 1944-, Reboussin, David,, Granger, Christopher B.|date=27 August 2015|isbn=978-3-319-18539-2|edition=Fifth|location=New York|oclc=919463985}}</ref>

Stratified sampling can not be applied if the population cannot be completely assigned into strata, which would result in sample sizes proportional to sample available instead of overall subgroup population.

如果总体不能完全分配到地层中,那么分层抽样就不能应用,这将导致样本大小与可用样本成比例,而不是整个子群总体。



The process of assigning samples into subgroups could involve overlapping if subjects meet the inclusion standard of multiple strata, which could result in a misrepresentation of the population.

如果受试者符合多个阶层的包含标准,将样本分配到各个子群组的过程可能会涉及重叠,这可能会导致人口的不正当手法引诱。

=== Minimization method ===

In order to guarantee the similarity of each treatment group, the "minimization" method attempts are made, which is more direct than random permuted block within strats. In the minimization method, samples in each stratum are assigned to treatment groups based on the sum of samples in each treatment group, which makes the number of subjects keep balance among the group.<ref name=":0" /> If the sums for multiple treatment groups are the same, simple randomization would be conducted to assign the treatment. In practice, the minimization method needs to follow a daily record of treatment assignments by prognostic factors, which can be done effectively by using a set of index cards to record.  The minimization method effectively avoids imbalance among groups but involves less random process than block randomization because the random process is only conducted when the treatment sums are the same. A feasible solution is to apply an additional random list which makes the treatment groups with a smaller sum of marginal totals possess a higher chance (e.g.¾) while other treatments have a lower chance(e.g.¼ ).<ref name=":1">{{Cite journal|last=Pocock|first=S. J.|date=March 1979|title=Allocation of Patients to Treatment in Clinical Trials|journal=Biometrics|volume=35|issue=1|pages=183–197|doi=10.2307/2529944|jstor=2529944|pmid=497334|issn=0006-341X}}</ref>



== Application ==

[[File:Assessing the role of a confounder.png|thumb|219x219px|Confounding factors are important to consider in clinical trials]]

Stratified random sampling is useful and productive in situations requiring different [[weighting]]s on specific strata. In this way, the researchers can manipulate the selection mechanisms from each strata to amplify or minimize the desired characteristics in the survey result.<ref>{{Cite web|url=https://www.thoughtco.com/stratified-sampling-3026731|title=Understanding Stratified Samples and How to Make Them|last=Crossman|first=Ashley|date=Jan 27, 2020|website=ThoughtCo|language=en|access-date=2020-04-07}}</ref>



Stratified randomization is helpful when researchers intend to seek for [[Association (statistics)|associations]] between two or more strata, as simple random sampling causes a larger chance of unequal representation of target groups. It is also useful when the researchers wish to eliminate [[Confounding|confounders]] in [[Observational study|observational studies]] as stratified random sampling allows the adjustments of [[covariance]]s and the [[P-value|''p''-values]] for more accurate results.<ref>{{Cite book|last=Hennekens, Charles H.|title=Epidemiology in medicine|date=1987|publisher=Little, Brown|others=Buring, Julie E., Mayrent, Sherry L.|isbn=0-316-35636-0|edition=1st|location=Boston, Massachusetts|oclc=16890223}}</ref>



There is also a higher level of [[Accuracy and precision|statistical accuracy]] for stratified random sampling compared with simple random sampling, due to the high [[relevance]] of elements chosen to represent the population.<ref name=":5" /> The differences within the strata is much less compared to the one between strata. Hence, as the between-sample differences are minimized, the [[standard deviation]] will be consequently tightened, resulting in higher degree of accuracy and small error in the final results. This effectively reduces the [[Sample size determination|sample size]] needed and increases [[Cost-effectiveness analysis|cost-effectiveness]] of sampling when research funding is tight.



In real life, stratified random sampling can be applied to results of election polling, investigations into income disparities among social groups, or measurements of education opportunities across nations.<ref name=":3" />



== Stratified randomization in clinical trials ==

In [[clinical trial]]s, patients are stratified according to their social and individual backgrounds, or any factor that are relevant to the study, to match each of these groups within the entire patient population. The aim of such is to create a balance of clinical/prognostic factor as the trials would not produce valid results if the study design is not balanced.<ref>{{Cite book|last1=Polit|first1=DF|title=Nursing Research: Generating and Assessing Evidence for Nursing Practice, 9th ed.|last2=Beck|first2=CT|publisher=Lippincott Williams & Wilkins.|year=2012|location=Philadelphia, USA: Wolters Klower Health}}</ref> The step of stratified randomization is extremely important as an attempt to ensure that no bias, delibrate or accidental, affects the representative nature of the patient sample under study.<ref>{{Cite web|url=https://www.omixon.com/patient-stratification-in-clinical-trials/|title=Patient Stratification in Clinical Trials|date=2014-12-01|website=Omixon {{!}} NGS for HLA|language=en-US|access-date=2020-04-26}}</ref> It increases the study power, especially in small clinical trials(n<400), as these known clinical traits stratified are thought to effect the outcomes of the interventions.<ref>{{Cite web|url=https://www.statisticshowto.com/stratified-randomization/|title=Stratified Randomization in Clinical Trials|last=Stephanie|date=2016-05-20|website=Statistics How To|language=en-US|access-date=2020-04-26}}</ref> It helps prevent the occurrence of [[Type I and type II errors|type I error]], which is valued highly in clinical studies.<ref name=":6">{{Cite journal|last=Kernan|first=W|date=Jan 1999|title=Stratified Randomization for Clinical Trials|journal=Journal of Clinical Epidemiology|volume=52|issue=1|pages=19–26|doi=10.1016/S0895-4356(98)00138-3|pmid=9973070}}</ref> It also has an important effect on sample size for active control equivalence trials and in theory, facilitates [[subgroup analysis]] and [[interim analysis]].<ref name=":6" />



Category:Sampling (statistics)

类别: 抽样(统计)

== Advantage ==

Category:Sampling techniques

类别: 抽样技术

<noinclude>

<small>This page was moved from [[wikipedia:en:Stratified randomization]]. Its edit history can be viewed at [[分层随机试验/edithistory]]</small></noinclude>

[[Category:待整理页面]]
1,592

个编辑