添加46,446字节
、 2020年9月8日 (二) 22:04
此词条暂由彩云小译翻译,未经人工整理和审校,带来阅读不便,请见谅。
{{short description|Mathematical function that describes the probability of occurrence of different possible outcomes in an experiment}}
{{other uses|Distribution (disambiguation){{!}}Distribution}}
{{Off topic|date=August 2020|Random variable|talksectionheading}}
In [[probability theory]] and [[statistics]], a '''probability distribution''' is the mathematical [[Function (mathematics)|function]] that gives the probabilities of occurrence of different possible '''outcomes''' for an [[Experiment (probability theory)|experiment]].<ref name=":02">{{Cite book|title=The Cambridge dictionary of statistics|last=Everitt, Brian.|date=2006|publisher=Cambridge University Press|isbn=978-0-511-24688-3|edition=3rd|location=Cambridge, UK|oclc=161828328}}</ref><ref>{{Cite book|title=Basic probability theory|last=Ash, Robert B.|date=2008|publisher=Dover Publications|isbn=978-0-486-46628-6|edition=Dover|location=Mineola, N.Y.|pages=66–69|oclc=190785258}}</ref> It is a mathematical description of a [[Randomness|random]] phenomenon in terms of its [[sample space]] and the [[Probability|probabilities]] of [[Event (probability theory)|events]] (subsets of the sample space).<ref name=":1">{{Cite book|title=Probability and statistics : the science of uncertainty|last=Evans, Michael (Michael John)|date=2010|publisher=W.H. Freeman and Co|others=Rosenthal, Jeffrey S. (Jeffrey Seth)|isbn=978-1-4292-2462-8|edition=2nd|location=New York|pages=38|oclc=473463742}}</ref>
In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon in terms of its sample space and the probabilities of events (subsets of the sample space).
在概率论和统计学中,概率分布是一个数学函数,它给出了一个实验出现不同可能结果的概率。它是根据样本空间和事件概率(样本空间的子集)对随机现象的数学描述。
For instance, if the [[random variable]] {{mvar|X}} is used to denote the outcome of a coin toss ("the experiment"), then the probability distribution of {{mvar|X}} would take the value 0.5 for {{math|''X'' {{=}} heads}}, and 0.5 for {{math|''X'' {{=}} tails}} (assuming the coin is fair). Examples of random phenomena include the weather condition in a future date, the height of a person, the fraction of male students in a school, the results of a [[Survey methodology|survey]], etc.<ref name="ross">{{cite book|first=Sheldon M.|last=Ross|title=A first course in probability|publisher=Pearson|year=2010}}</ref>
For instance, if the random variable is used to denote the outcome of a coin toss ("the experiment"), then the probability distribution of would take the value 0.5 for heads}}, and 0.5 for tails}} (assuming the coin is fair). Examples of random phenomena include the weather condition in a future date, the height of a person, the fraction of male students in a school, the results of a survey, etc.
例如,如果使用随机变量来表示掷硬币的结果(“实验”) ,那么概率分布将取正面}的值0.5,反面}的值0.5(假设硬币是公平的)。随机现象的例子包括未来某一天的天气状况、一个人的身高、学校中男生的比例、调查结果等等。
A probability distribution is a mathematical function that has a [[sample space]] as its '''input''', and gives a '''probability''' as its output. The sample space is the [[Set (mathematics)|set]] of all possible [[Outcome (probability)|outcomes]] of a random phenomenon being observed; it may be the set of [[real numbers]] or a set of [[vector (mathematics)|vectors]], or it may be a list of non-numerical values. For example, the sample space of a coin flip would be {{math|{heads, tails} }}.
A probability distribution is a mathematical function that has a sample space as its input, and gives a probability as its output. The sample space is the set of all possible outcomes of a random phenomenon being observed; it may be the set of real numbers or a set of vectors, or it may be a list of non-numerical values. For example, the sample space of a coin flip would be .
概率分布是一个数学函数,它的输入是一个样本空间,输出是一个概率。样本空间是所观察到的随机现象的所有可能结果的集合; 它可能是一组实数或一组向量,也可能是一组非数值。例如,抛硬币的样本空间是。
Probability distributions are generally divided into two classes. A '''discrete probability distribution''' is applicable to the scenarios where the set of possible outcomes is [[discrete probability distribution|discrete]] (e.g. a coin toss or the roll of a dice), and the probabilities are here encoded by a discrete list of the probabilities of the outcomes, known as the [[probability mass function]]. On the other hand, '''continuous probability distributions''' are applicable to scenarios where the set of possible outcomes can take on values in a continuous range (e.g. real numbers), such as the temperature on a given day. In this case, probabilities are typically described by a [[probability density function]].<ref name="ross" /><ref>{{cite book|author1=DeGroot, Morris H.|author2=Schervish, Mark J.|title=Probability and Statistics|publisher=Addison-Wesley|year=2002}}</ref> The [[normal distribution]] is a commonly encountered continuous probability distribution. More complex experiments, such as those involving [[stochastic processes]] defined in [[continuous time]], may demand the use of more general [[probability measure]]s.
Probability distributions are generally divided into two classes. A discrete probability distribution is applicable to the scenarios where the set of possible outcomes is discrete (e.g. a coin toss or the roll of a dice), and the probabilities are here encoded by a discrete list of the probabilities of the outcomes, known as the probability mass function. On the other hand, continuous probability distributions are applicable to scenarios where the set of possible outcomes can take on values in a continuous range (e.g. real numbers), such as the temperature on a given day. In this case, probabilities are typically described by a probability density function. The normal distribution is a commonly encountered continuous probability distribution. More complex experiments, such as those involving stochastic processes defined in continuous time, may demand the use of more general probability measures.
概率分布一般分为两类。离散概率分布适用于一组可能的结果是离散的情况。抛硬币或掷骰子) ,这里的概率被编码为结果概率的离散列表,称为概率质量函数。另一方面,连续概率分布适用于一组可能的结果可以在一个连续的范围内取值的情况(例如:。实数) ,例如某一天的温度。在这种情况下,概率通常由概率密度函数描述。正态分布是一种常见的连续概率分布。更复杂的实验,例如那些涉及连续时间定义的随机过程的实验,可能需要使用更一般的概率测度。
A probability distribution whose sample space is one-dimensional (for example real numbers, list of labels, ordered labels or binary) is called [[Univariate distribution|univariate]], while a distribution whose sample space is a [[vector space]] of dimension 2 or more is called [[Multivariate distribution|multivariate]]. A univariate distribution gives the probabilities of a single [[random variable]] taking on various alternative values; a multivariate distribution (a [[joint probability distribution]]) gives the probabilities of a [[random vector]] – a list of two or more random variables – taking on various combinations of values. Important and commonly encountered univariate probability distributions include the [[binomial distribution]], the [[hypergeometric distribution]], and the [[normal distribution]]. The [[multivariate normal distribution]] is a commonly encountered multivariate distribution.
A probability distribution whose sample space is one-dimensional (for example real numbers, list of labels, ordered labels or binary) is called univariate, while a distribution whose sample space is a vector space of dimension 2 or more is called multivariate. A univariate distribution gives the probabilities of a single random variable taking on various alternative values; a multivariate distribution (a joint probability distribution) gives the probabilities of a random vector – a list of two or more random variables – taking on various combinations of values. Important and commonly encountered univariate probability distributions include the binomial distribution, the hypergeometric distribution, and the normal distribution. The multivariate normal distribution is a commonly encountered multivariate distribution.
一个样本空间为一维(例如实数、标签列表、有序标签或二进制)的概率分布被称为单变量,而样本空间为二维或更多向量空间的分布被称为多变量。单变量分布给出了单个随机变量取不同替代值的概率; 联合分布联合分布给出了一个随机向量的概率——一个由两个或多个随机变量组成的列表——取值的各种组合。重要的和常见的单变量概率分布包括二项分布、超几何分布和正态分布。多变量正态分布是一种常见的联合分布。
==Introduction==
[[File:Dice Distribution (bar).svg|thumb|250px|right|The [[probability mass function]] (pmf) ''p''(''S'') specifies the probability distribution for the sum ''S'' of counts from two [[dice]]. For example, the figure shows that ''p''(11) = 2/36 = 1/18. The pmf allows the computation of probabilities of events such as ''P''(''S'' > 9) = 1/12 + 1/18 + 1/36 = 1/6, and all other probabilities in the distribution.]]
The [[probability mass function (pmf) p(S) specifies the probability distribution for the sum S of counts from two dice. For example, the figure shows that p(11) = 2/36 = 1/18. The pmf allows the computation of probabilities of events such as P(S > 9) = 1/12 + 1/18 + 1/36 = 1/6, and all other probabilities in the distribution.]]
[[概率质量函数(pmf) p (s)]指定两个骰子计数总和 s 的概率分布。例如,图中显示 p (11) = 2/36 = 1/18。Pmf 允许计算事件的概率,如 p (s > 9) = 1/12 + 1/18 + 1/36 = 1/6,以及分布中的所有其他概率。]
To define probability distributions for the simplest cases, it is necessary to distinguish between '''discrete''' and '''continuous''' [[random variable]]s. In the discrete case, it is sufficient to specify a [[probability mass function]] <math>p</math> assigning a probability to each possible outcome: for example, when throwing a fair [[Dice|die]], each of the six values 1 to 6 has the probability 1/6. The probability of an [[Event (probability theory)|event]] is then defined to be the sum of the probabilities of the outcomes that satisfy the event; for example, the probability of the event "the dice rolls an even value" is
To define probability distributions for the simplest cases, it is necessary to distinguish between discrete and continuous random variables. In the discrete case, it is sufficient to specify a probability mass function p assigning a probability to each possible outcome: for example, when throwing a fair die, each of the six values 1 to 6 has the probability 1/6. The probability of an event is then defined to be the sum of the probabilities of the outcomes that satisfy the event; for example, the probability of the event "the dice rolls an even value" is
为了定义最简单的概率分布,有必要区分离散和连续的随机变量。在离散情况下,指定一个概率质量函数 p 就足够了,它为每个可能的结果赋予一个概率: 例如,当投掷一个公平骰子时,6个值中的每一个的概率为1到6。然后将事件的概率定义为满足事件的结果的概率之和; 例如,事件”骰子掷出偶数值”的概率是
:<math>p(2) + p(4) + p(6) = 1/6+1/6+1/6=1/2.</math>
p(2) + p(4) + p(6) = 1/6+1/6+1/6=1/2.
P (2) + p (4) + p (6) = 1/6 + 1/6 + 1/6 = 1/2.
In contrast, when a random variable takes values from a continuum then typically, any individual outcome has probability zero and only events that include infinitely many outcomes, such as intervals, can have positive probability. For example, the probability that a given object weighs ''exactly'' 500 g is zero, because the probability of measuring exactly 500 g tends to zero as the accuracy of our measuring instruments increases. Nevertheless, in quality control one might demand that the probability of a "500 g" package containing between 490 g and 510 g should be no less than 98%, and this demand is less sensitive to the accuracy of measurement instruments.
In contrast, when a random variable takes values from a continuum then typically, any individual outcome has probability zero and only events that include infinitely many outcomes, such as intervals, can have positive probability. For example, the probability that a given object weighs exactly 500 g is zero, because the probability of measuring exactly 500 g tends to zero as the accuracy of our measuring instruments increases. Nevertheless, in quality control one might demand that the probability of a "500 g" package containing between 490 g and 510 g should be no less than 98%, and this demand is less sensitive to the accuracy of measurement instruments.
相比之下,当一个随机变量从一个连续体中取值时,那么通常情况下,任何单个结果的概率为零,只有包含无限多个结果的事件,例如间隔,才有正的概率。例如,一个给定的物体重量正好是500克的概率为零,因为随着我们测量仪器精度的提高,正好测量500克的概率趋向于零。然而,在质量控制方面,人们可能会要求包装在490克至510克之间的“500克”包装的可能性不低于98% ,而这一要求对测量仪器的准确性不太敏感。
Continuous probability distributions can be described in several ways. The [[probability density function]] describes the [[infinitesimal]] probability of any given value, and the probability that the outcome lies in a given interval can be computed by [[Integration (mathematics)|integrating]] the probability density function over that interval. The probability that the possible values lie in some fixed interval can be related to the way sums converge to an integral; therefore, continuous probability is based on the definition of an integral.
Continuous probability distributions can be described in several ways. The probability density function describes the infinitesimal probability of any given value, and the probability that the outcome lies in a given interval can be computed by integrating the probability density function over that interval. The probability that the possible values lie in some fixed interval can be related to the way sums converge to an integral; therefore, continuous probability is based on the definition of an integral.
连续概率分布可以用几种方法来描述。概率密度函数概率描述了任意给定值的无穷小概率,并且结果在给定区间内的概率可以通过在该区间上积分概率密度函数概率来计算。可能值位于某一固定区间的概率可以与和收敛于积分的方式有关,因此,连续概率是基于积分的定义。
[[File:Combined Cumulative Distribution Graphs.png|thumb|455x455px|On the left is the probability density function. On the right is the cumulative distribution function, which is the area under the probability density curve.]]
On the left is the probability density function. On the right is the cumulative distribution function, which is the area under the probability density curve.
左边是概率密度函数。右边是累积分布函数,它是概率密度曲线下面的区域。
The [[cumulative distribution function]] describes the probability that the random variable is no larger than a given value; the probability that the outcome lies in a given interval can be computed by taking the difference between the values of the cumulative distribution function at the endpoints of the interval. The cumulative distribution function is the [[antiderivative]] of the probability density function provided that the latter function exists. The cumulative distribution function is the area under the [[probability density function]] from minus infinity <math>\infty</math> to <math>x</math> as described by the picture to the right.<ref>{{Cite book|title=A modern introduction to probability and statistics : understanding why and how|date=2005|publisher=Springer|others=Dekking, Michel, 1946-|isbn=978-1-85233-896-1|location=London|oclc=262680588}}</ref>
The cumulative distribution function describes the probability that the random variable is no larger than a given value; the probability that the outcome lies in a given interval can be computed by taking the difference between the values of the cumulative distribution function at the endpoints of the interval. The cumulative distribution function is the antiderivative of the probability density function provided that the latter function exists. The cumulative distribution function is the area under the probability density function from minus infinity \infty to x as described by the picture to the right.
累积分布函数指标描述了随机变量不大于给定值的概率; 结果在给定区间内的概率可以通过计算区间终点的累积分布函数差来计算。累积分布函数函数是概率密度函数函数的反导函数,前提是后者存在。正如右边图片所描述的那样,累积分布函数是从负无穷到 x 的概率密度函数下面的区域。
[[File:Standard deviation diagram.svg|right|thumb|250px|The [[probability density function]] (pdf) of the [[normal distribution]], also called Gaussian or "bell curve", the most important continuous random distribution. As notated on the figure, the probabilities of intervals of values correspond to the area under the curve.]]
The [[probability density function (pdf) of the normal distribution, also called Gaussian or "bell curve", the most important continuous random distribution. As notated on the figure, the probabilities of intervals of values correspond to the area under the curve.]]
正态分布的[概率密度函数(pdf) ,也称为高斯或钟形曲线,最重要的连续随机分布。如图所示,值间隔的概率对应于曲线下面积。]
== Terminology ==
Some key concepts and terms, widely used in the literature on the topic of probability distributions, are listed below.<ref name=":02" />
Some key concepts and terms, widely used in the literature on the topic of probability distributions, are listed below. the regions close to the bounds of the random variable, if the pmf or pdf are relatively low therein. Usually has the form X > a, X < b or a union thereof.
一些关键的概念和术语,广泛用于文献的主题概率分布,列出如下。接近随机变量界限的区域,如果 pmf 或 pdf 相对较低的话。通常具有形式 x > a,x < b 或它们的结合。
=== Functions for discrete variables ===
Well-known discrete probability distributions used in statistical modeling include the Poisson distribution, the Bernoulli distribution, the binomial distribution, the geometric distribution, and the negative binomial distribution.
用于统计建模的著名离散概率分布包括泊松分佈、伯努利分布、二项分布、几何分佈和负二项分布。
*'''Probability function''': describes the probability distribution of a discrete random variable.
*'''[[Probability mass function|Probability mass function (pmf)]]:''' function that gives the probability that a discrete random variable is equal to some value.
*'''[[Frequency distribution]]''': a table that displays the frequency of various outcomes '''in a sample'''.
For a discrete random variable X, let u0, u1, ... be the values it can take with non-zero probability. Denote
对于一个离散的随机变量 x,设 u0,u1,... 是它在非零概率情况下可以取的值。表示
*'''Relative frequency distribution''': a [[frequency distribution]] where each value has been divided (normalized) by a number of outcomes in a [[Sample (statistics)|sample]] i.e. sample size.
*'''Discrete probability distribution function''': general term to indicate the way the total probability of 1 is distributed over '''all''' various possible outcomes (i.e. over entire population) for discrete random variable.
\Omega_i=X^{-1}(u_i)= \{\omega: X(\omega)=u_i\},\, i=0, 1, 2, \dots
Omega _ i = x ^ {-1}(u _ i) = { Omega: x (Omega) = u _ i } ,i = 0,1,2,点
*'''[[Cumulative distribution function]]''': function evaluating the [[probability]] that <math>X</math> will take a value less than or equal to <math>x</math> for a discrete random variable.
*'''[[Categorical distribution]]''': for discrete random variables with a finite set of values.
These are disjoint sets, and for such sets
这些是不相交的集合,对于这样的集合
=== Functions for continuous variables ===
P\left(\bigcup_i \Omega_i\right)=\sum_i P(\Omega_i)=\sum_i P(X=u_i)=1.
P 左(bigcup _ i Omega _ i 右) = sum _ i p (Omega _ i) = sum _ i p (x = u _ i) = 1。
* '''[[Probability density function]] (pdf):''' function whose value at any given sample (or point) in the [[sample space]] (the set of possible values taken by the random variable) can be interpreted as providing a ''relative likelihood'' that the value of the random variable would equal that sample.
It follows that the probability that X takes any value except for u0, u1, ... is zero, and thus one can write X as
由此可见,除了 u0、 u1、 ... 之外,x 取任何值的概率为零,因此可以将 x 写为
* '''Continuous probability distribution function''': most often reserved for continuous random variables.
* '''[[Cumulative distribution function]]''': function evaluating the [[probability]] that <math>X</math> will take a value less than or equal to <math>x</math> for continuous variable.
X(\omega)=\sum_i u_i 1_{\Omega_i}(\omega)
X (Omega) = sum _ i u _ i 1{ Omega _ i }(Omega)
===Basic terms===
except on a set of probability zero, where 1_A is the indicator function of A. This may serve as an alternative definition of discrete random variables.
除了在一组概率为零的情况下,其中1 a 是 a 的指示函数。这可以作为离散随机变量的另一种定义。
* [[Mode (statistics)|'''Mode''']]: for a discrete random variable, the value with highest probability; for a continuous random variable, a location at which the probability density function has a local peak.
* [[Support (mathematics)|'''Support''']]: set of values that can be assumed, with non-zero probability, by the random variable.
* '''Tail''':<ref name='tail'>More information and examples can be found in the articles [[Heavy-tailed distribution]], [[Long-tailed distribution]], [[fat-tailed distribution]]</ref> the regions close to the bounds of the random variable, if the pmf or pdf are relatively low therein. Usually has the form <math>X > a</math>, <math>X < b</math> or a union thereof.
*'''Head''':<ref name='tail' /> the region where the pmf or pdf is relatively high. Usually has the form <math>a < X < b</math>.
* '''[[Expected value]]''' or '''mean''': the [[weighted average]] of the possible values, using their probabilities as their weights; or the continuous analog thereof.
A continuous probability distribution is a probability distribution whose support is an uncountable set, such as an interval in the real line. They are uniquely characterized by a cumulative density function that can be used to calculate the probability for each subset of the support. There are many examples of continuous probability distributions: normal, uniform, chi-squared, and others.
一个连续的概率分布是一个支持不可数集的概率分布,比如实数行中的一个区间。它们是唯一拥有属性的一个累积密度函数,可以用来计算支持的每个子集的概率。有许多连续概率分布的例子: 正态、均匀、卡方等等。
*'''[[Median]]''': the value such that the set of values less than the median, and the set greater than the median, each have probabilities no greater than one-half.
*'''[[Variance]]''': the second moment of the pmf or pdf about the mean; an important measure of the [[Statistical dispersion|dispersion]] of the distribution.
A random variable X has a continuous probability distribution if there is a function f: \mathbb{R} \rightarrow [0, \infty) such that for each interval I \subset \mathbb{R} the probability of X belonging to I is given by the integral of f over I. For example, if I = [a, b] then we would have:
如果有一个函数 f: mathbb { r } right tarrow [0,infty) ,对于每个区间 i 子集,x 属于 i 的概率由 f 在 i 上的积分给出,则 x 有一个连续的概率分布。例如,如果 i = [ a,b ] ,那么我们将得到:
* '''[[Standard deviation]]''': the square root of the variance, and hence another measure of dispersion.
<math>
《数学》
* [[Symmetric probability distribution|'''Symmetry''']]: a property of some distributions in which the portion of the distribution to the left of a specific value(usually the median) is a mirror image of the portion to its right.
\operatorname{P}\left[a \le X \le b\right] = \int_a^b f(x) \, dx
操作符{ p }左[ a le x le b 右] = int _ a ^ b f (x) ,dx
*'''[[Skewness]]''': a measure of the extent to which a pmf or pdf "leans" to one side of its mean. The third [[standardized moment]] of the distribution.
</math>
数学
*'''[[Kurtosis]]''': a measure of the "fatness" of the tails of a pmf or pdf. The fourth standardized moment of the distribution.
In particular, the probability for X to take any single value a (that is a \le X \le a) is zero, because an integral with coinciding upper and lower limits is always equal to zero. A variable that satisfies the above is called continuous random variable. Its cumulative density function is defined as
特别地,x 取任何单个值 a (即 le x le a)的概率为零,因为一个与上下限重合的积分总是等于零。满足上述条件的变量称为连续随机变量。其累积密度函数定义为
<math>
《数学》
==Discrete probability distribution==
F(x) = \operatorname{P}\left[-\infty < X \le x\right] = \int_{-\infty}^x f(x) \, dx
F (x) = 操作者名{ p }左[-infty < x le x right ] = int _ {-infty } ^ x f (x) ,dx
{{See also|Probability mass function|Categorical distribution}}
</math>
数学
which, by this definition, has the properties:
根据这个定义,它具有以下属性:
[[File:Discrete probability distrib.svg|right|thumb|The probability mass function of a discrete probability distribution. The probabilities of the [[Singleton (mathematics)|singleton]]s {1}, {3}, and {7} are respectively 0.2, 0.5, 0.3. A set not containing any of these points has probability zero.]]
<ul>
< ul >
[[File:Discrete probability distribution.svg|right|thumb|The [[cumulative distribution function|cdf]] of a discrete probability distribution, ...]]
<li style="margin: 0.7rem 0;">F(x) is non-decreasing;</li>
“边际”0.7rem 0“ f (x)是不减的
[[File:Normal probability distribution.svg|right|thumb|... of a continuous probability distribution, ...]]
<li style="margin: 0.7rem 0;">0 \le F(x) \le 1;</li>
“利风” = “利润” : 0.7rem 0; “0” > 0 le f (x) le 1;
[[File:Mixed probability distribution.svg|right|thumb|... of a distribution which has both a continuous part and a discrete part.]]
<li style="margin: 0.7rem 0;">\lim_{x \rightarrow -\infty} F(x) = 0 and \lim_{x \rightarrow \infty} F(x) = 1;</li>
< li style = " margin: 0.7rem 0; " > lim { x right tarrow-infty } f (x) = 0 and lim { x right tarrow infty } f (x) = 1; </li >
<li style="margin: 0.7rem 0;">P(a \le X < b) = F(b) - F(a); and</li>
< li style = " margin: 0.7rem 0; " > p (a le x < b) = f (b)-f (a) ; </li >
A '''discrete probability distribution''' is a probability distribution that can take on a countable number of values.<ref>{{Cite book|title=Probability and stochastics|last=Erhan|first=Çınlar|date=2011|publisher=Springer|isbn=9780387878591|location=New York|pages=51|oclc=710149819}}</ref> For the probabilities to add up to 1, they have to decline to zero fast enough. For example, if <math>\operatorname{P}(X=n) = \tfrac{1}{2^n}</math> for ''n'' = 1, 2, ..., the sum of probabilities would be 1/2 + 1/4 + 1/8 + ... = 1.
<li style="margin: 0.7rem 0;">F(x) is continuous (due to the Riemann integral properties).</li>
< li style = " margin: 0.7rem 0; " > f (x)是连续的(由于黎曼积分属性)。 </li >
</ul>
</ul >
Well-known discrete probability distributions used in statistical modeling include the [[Poisson distribution]], the [[Bernoulli distribution]], the [[binomial distribution]], the [[geometric distribution]], and the [[negative binomial distribution]].<ref name=":1" /> Additionally, the [[Uniform distribution (discrete)|discrete uniform distribution]] is commonly used in computer programs that make equal-probability random selections between a number of choices.
It is also possible to think in the opposite direction, which allows more flexibility. Say F(x) is a function that satisfies all but the last of the properties above, then F represents the cumulative density function for some random variable: a discrete random variable if F is a step function, and a continuous random variable otherwise. This allows for continuous distributions that has a cumulative density function, but not a probability density function, such as the Cantor distribution.
也可以朝相反的方向思考,这样可以有更大的灵活性。假设 f (x)是满足上述所有性质的函数,那么 f 表示某个随机变量的累积密度函数: 如果 f 是阶跃函数,则为离散随机变量,否则为连续随机变量。这允许具有累积密度函数的连续分布,而不是概率密度函数分布,例如 Cantor 分布。
When a [[Sample (statistics)|sample]] (a set of observations) is drawn from a larger population, the sample points have an [[empirical distribution function|empirical distribution]] that is discrete and that provides information about the population distribution.
It is often necessary to generalize the above definition for more arbitrary subsets of the real line. In these contexts, a continuous probability distribution is defined as a probability distribution with a cumulative distribution function that is absolutely continuous. Equivalently, it is a probability distribution on the real numbers that is absolutely continuous with respect to the Lebesgue measure. Such distributions can be represented by their probability density functions. If X is such an absolutely continuous random variable, then it has a probability density function f(x), and its probability of falling into a Lebesgue-measurable set A \subset \mathbb{R} is:
通常有必要将上述定义推广到实线的更多任意子集。在这些情况下,一个连续的概率分布被定义为一个具有绝对连续的概率分布累积分布函数。等价地,它是实数上的一个概率分布,对于勒贝格测度是绝对连续的。这种分布可以用它们的概率密度函数来表示。如果 x 是这样一个绝对连续的随机变量,那么它有一个概率密度函数 f (x) ,它落入 lebesgue 可测集 a 子集数学{ r }的概率是:
<math>
《数学》
===Measure theoretic formulation===
\operatorname{P}\left[X \in A\right] = \int_A f(x) \, d\mu
操作数名{ p }左[ x 在 a 右] = int _ a f (x) ,d mu
</math>
数学
A [[measurable function]] <math>X \colon A \to B </math> between a [[probability space]] <math>(A, \mathcal A, P)</math> and a [[measurable space]] <math>(B, \mathcal B) </math> is called a '''discrete random variable''' provided that its image is a countable set. In this case measurability of <math>X</math> means that the pre-images of singleton sets are measurable, i.e., <math>X^{-1}(\{b\}) \in \mathcal A</math> for all <math>b \in B</math>.
where \mu is the Lebesgue measure.
勒贝格测度在哪里。
The latter requirement induces a [[probability mass function]] <math>f_X \colon X(A) \to \mathbb R</math> via <math> f_X(b):=P(X^{-1}(\{b\}))</math>. Since the pre-images of disjoint sets
are disjoint,
Note on terminology: some authors use the term "continuous distribution" to denote distributions whose cumulative distribution functions are continuous, rather than absolutely continuous. These distributions are the ones \mu such that \mu\{x\}\,=\,0 for all \,x. This definition includes the (absolutely) continuous distributions defined above, but it also includes singular distributions, which are neither absolutely continuous nor discrete nor a mixture of those, and do not have a density. An example is given by the Cantor distribution.
关于术语的说明: 一些作者使用术语“连续分布”来表示分布,其累积分布函数是连续的,而不是绝对连续的。这些分布的 μ 使得 mu { x } ,= ,0 for all,x。这个定义包括上面定义的(绝对)连续分布,但也包括奇异分布,它既不是绝对连续的,也不是离散的,也不是这些分布的混合物,并且没有密度。给出了一个由 Cantor 分布表示的例子。
:<math>\sum_{b \in X(A)} f_X(b) = \sum_{b \in X(A)} P(X^{-1} (\{b\})) = P \left( \bigcup_{b \in X(A)} X^{-1}(\{b\}) \right) = P(A)=1.</math>
This recovers the definition given above.
===Cumulative distribution function===
Equivalently to the above, a discrete random variable can be defined as a random variable whose [[cumulative distribution function]] (cdf) increases only by [[jump discontinuity|jump discontinuities]]—that is, its cdf increases only where it "jumps" to a higher value, and is constant between those jumps. Note however that the points where the cdf jumps may form a dense set of the real numbers. The points where jumps occur are precisely the values which the random variable may take.
In the measure-theoretic formalization of probability theory, a random variable is defined as a measurable function X from a probability space (\Omega, \mathcal{F}, \mathbb{P}) to a measurable space (\mathcal{X},\mathcal{A}). Given that probabilities of events of the form \{\omega\in\Omega\mid X(\omega)\in A\} satisfy Kolmogorov's probability axioms, the probability distribution of X is the pushforward measure X_*\mathbb{P} of X , which is a probability measure on (\mathcal{X},\mathcal{A}) satisfying X_*\mathbb{P} = \mathbb{P}X^{-1}.
在概率论的度量理论形式化中,一个随机变量被定义为一个可测函数 x,从一个概率空间(Omega,mathcal { f } ,mathbb { p })到一个可测空间(mathcal { x } ,mathcal { a })。考虑到 a }中 Omega 中 x (Omega)中 ω 的事件概率满足 Kolmogorov 的概率公理,x 的概率分布是 x 的 x * mathbb { p }的前推测度,它是关于(cal { x } ,mathcal { a })满足 x * mathbb { p } = mathbb { p } x ^ {-1}的机率量测。
===Delta-function representation===
Consequently, a discrete probability distribution is often represented as a generalized [[probability density function]] involving [[Dirac delta function]]s, which substantially unifies the treatment of continuous and discrete distributions. This is especially useful when dealing with probability distributions involving both a continuous and a discrete part.<ref>{{Cite journal|last=Khuri|first=André I.|date=March 2004|title=Applications of Dirac's delta function in statistics|journal=International Journal of Mathematical Education in Science and Technology|language=en|volume=35|issue=2|pages=185–195|doi=10.1080/00207390310001638313|issn=0020-739X}}</ref>
===Indicator-function representation===
Most algorithms are based on a pseudorandom number generator that produces numbers X that are uniformly distributed in the half-open interval [0,1). These random variates X are then transformed via some algorithm to create a new random variate having the required probability distribution. With this source of uniform pseudo-randomness, realizations of any random variable can be generated.
大多数算法都是基于一个伪随机数生成器,它产生的数字 x 在半开区间内均匀分布。这些随机变量 x 然后通过一些算法转换,创建一个新的具有所需概率分布的随机变量。有了这种均匀的伪随机性,任何随机变量都可以被实现。
For a discrete random variable ''X'', let ''u''<sub>0</sub>, ''u''<sub>1</sub>, ... be the values it can take with non-zero probability. Denote
For example, suppose U has a uniform distribution between 0 and 1. To construct a random Bernoulli variable for some 0 < p < 1, we define
例如,假设 u 在0和1之间有一个均匀分布。为了构造一个0 < p < 1的随机贝努利变量,我们定义了
:<math>\Omega_i=X^{-1}(u_i)= \{\omega: X(\omega)=u_i\},\, i=0, 1, 2, \dots</math>
<math>{\displaystyle X ={\begin{cases}1,&{\mbox{if }}U<p\\0,&{\mbox{if }}U\geq p\end{cases}}}
{ displaystyle x = { begin { cases }1,& { mbox { if } u < p 0,& { mbox { if } u geq p end { cases }}
These are [[disjoint set]]s, and for such sets
</math>
数学
:<math>P\left(\bigcup_i \Omega_i\right)=\sum_i P(\Omega_i)=\sum_i P(X=u_i)=1.</math>
so that
所以
It follows that the probability that ''X'' takes any value except for ''u''<sub>0</sub>, ''u''<sub>1</sub>, ... is zero, and thus one can write ''X'' as
<math>\textrm{P}(X=1) = \textrm{P}(U<p) = p,
< math > textrm { p }(x = 1) = textrm { p }(u < p) = p,
\textrm{P}(X=0) = \textrm{P}(U\geq p) = 1-p.</math>
文本{ p }(x = 0) = 文本{ p }(u geq p) = 1-p. </math >
:<math>X(\omega)=\sum_i u_i 1_{\Omega_i}(\omega)</math>
This random variable X has a Bernoulli distribution with parameter p.
这个随机变量 x 有一个带参数 p 的伯努利分布。
except on a set of probability zero, where <math>1_A</math> is the [[indicator function]] of ''A''. This may serve as an alternative definition of discrete random variables.
==Continuous probability distribution==
{{See also|Probability density function}}
A '''continuous probability distribution''' is a probability distribution whose support is an uncountable set, such as an interval in the real line.<ref>{{Cite book|title=Introduction to probability models|author1=Sheldon M. Ross|date=2010|publisher=Elsevier}}</ref> They are uniquely characterized by a [[cumulative density function]]{{dn|date=August 2020}} that can be used to calculate the probability for each subset of the support. There are many examples of continuous probability distributions: [[normal distribution|normal]], [[Uniform distribution (continuous)|uniform]], [[Chi-squared distribution|chi-squared]], and [[List of probability distributions#Continuous distributions|others]].
A random variable <math>X</math> has a continuous probability distribution if there is a function <math>f: \mathbb{R} \rightarrow [0, \infty)</math> such that for each interval <math>I \subset \mathbb{R}</math> the probability of <math>X</math> belonging to <math>I</math> is given by the integral of <math>f</math> over <math>I</math>.<ref>Chapter 3.2 of {{harvp|DeGroot, Morris H.|Schervish, Mark J.|2002}}</ref> For example, if <math>I = [a, b]</math> then we would have:
:<math>
\operatorname{P}\left[a \le X \le b\right] = \int_a^b f(x) \, dx
</math>
In particular, the probability for <math>X</math> to take any single value <math>a</math> (that is <math>a \le X \le a</math>) is zero, because an [[integral]] with coinciding upper and lower limits is always equal to zero. A variable that satisfies the above is called '''continuous random variable'''. Its cumulative density function is defined as
:<math>
F(x) = \operatorname{P}\left[-\infty < X \le x\right] = \int_{-\infty}^x f(x) \, dx
</math>
which, by this definition, has the properties:
<ul>
<li style="margin: 0.7rem 0;"><math>F(x)</math> is non-decreasing;</li>
<li style="margin: 0.7rem 0;"><math>0 \le F(x) \le 1</math>;</li>
<li style="margin: 0.7rem 0;"><math>\lim_{x \rightarrow -\infty} F(x) = 0</math> and <math>\lim_{x \rightarrow \infty} F(x) = 1</math>;</li>
<li style="margin: 0.7rem 0;"><math>P(a \le X < b) = F(b) - F(a)</math>; and</li>
<li style="margin: 0.7rem 0;"><math>F(x)</math> is continuous (due to the [[Riemann integral]] properties).</li>
</ul>
It is also possible to think in the opposite direction, which allows more flexibility. Say <math>F(x)</math> is a function that satisfies all but the last of the properties above, then <math>F</math> represents the cumulative density function for some random variable: a discrete random variable if <math>F</math> is a step function, and a continuous random variable otherwise.<ref>See Theorem 2.1 of {{harvp|Vapnik|1998}}, or [[Lebesgue's decomposition theorem]]. The section [[#Delta-function_representation]] may also be of interest.</ref> This allows for continuous distributions that has a cumulative density function, but not a probability density function, such as the [[Cantor distribution]].
It is often necessary to generalize the above definition for more arbitrary subsets of the real line. In these contexts, a continuous probability distribution is defined as a probability distribution with a cumulative distribution function that is [[absolute continuity|absolutely continuous]]. Equivalently, it is a probability distribution on the [[real numbers]] that is [[absolute continuity|absolutely continuous]] with respect to the [[Lebesgue measure]]. Such distributions can be represented by their [[probability density function]]s. If <math>X</math> is such an absolutely continuous random variable, then it has a [[probability density function]] <math>f(x)</math>, and its probability of falling into a Lebesgue-measurable set <math>A \subset \mathbb{R}</math> is:
: <math>
\operatorname{P}\left[X \in A\right] = \int_A f(x) \, d\mu
</math>
where <math>\mu</math> is the Lebesgue measure.
Note on terminology: some authors use the term "continuous distribution" to denote distributions whose cumulative distribution functions are [[continuous function|continuous]], rather than [[absolute continuity|absolutely continuous]]. These distributions are the ones <math>\mu</math> such that <math>\mu\{x\}\,=\,0</math> for all <math>\,x</math>. This definition includes the (absolutely) continuous distributions defined above, but it also includes [[singular distribution]]s, which are neither absolutely continuous nor discrete nor a mixture of those, and do not have a density. An example is given by the [[Cantor distribution]].
== [[Andrey Kolmogorov|Kolmogorov]] definition ==
{{Main|Probability space|Probability measure}}
In the [[measure theory|measure-theoretic]] formalization of [[probability theory]], a [[random variable]] is defined as a [[measurable function]] <math>X</math> from a [[probability space]] <math>(\Omega, \mathcal{F}, \mathbb{P})</math> to a [[measurable space]] <math>(\mathcal{X},\mathcal{A})</math>. Given that probabilities of events of the form <math>\{\omega\in\Omega\mid X(\omega)\in A\}</math> satisfy [[Probability axioms|Kolmogorov's probability axioms]], the '''probability distribution of ''X''''' is the [[pushforward measure]] <math>X_*\mathbb{P}</math> of <math>X</math> , which is a [[probability measure]] on <math>(\mathcal{X},\mathcal{A})</math> satisfying <math>X_*\mathbb{P} = \mathbb{P}X^{-1}</math>.<ref>{{Cite book|title=Probability theory : an analytic view|last=W.|first=Stroock, Daniel|date=1999|publisher=Cambridge University Press|isbn=978-0521663496|edition= Rev.|location=Cambridge [England]|pages=11|oclc=43953136}}</ref><ref>{{Cite book|title=Foundations of the theory of probability|last=Kolmogorov|first=Andrey|publisher=Chelsea Publishing Company|year=1950|isbn=|location=New York, USA|pages=21–24|orig-year=1933}}</ref><ref>{{Cite web|url=https://mathcs.clarku.edu/~djoyce/ma217/axioms.pdf|title=Axioms of Probability|last=Joyce|first=David|date=2014|website=Clark University|url-status=live|archive-url=|archive-date=|access-date=December 5, 2019}}</ref>
==Random number generation==
{{Main|Pseudo-random number sampling}}
Most algorithms are based on a [[pseudorandom number generator]] that produces numbers ''X'' that are uniformly distributed in the [[half-open interval]] [0,1). These [[random variate]]s ''X'' are then transformed via some algorithm to create a new random variate having the required probability distribution. With this source of uniform pseudo-randomness, realizations of any random variable can be generated.<ref name=":0">{{Citation|last=Dekking|first=Frederik Michel|title=Why probability and statistics?|date=2005|work=A Modern Introduction to Probability and Statistics|pages=1–11|publisher=Springer London|isbn=978-1-85233-896-1|last2=Kraaikamp|first2=Cornelis|last3=Lopuhaä|first3=Hendrik Paul|last4=Meester|first4=Ludolf Erwin|doi=10.1007/1-84628-168-7_1}}</ref>
For example, suppose <math>U</math> has a uniform distribution between 0 and 1. To construct a random Bernoulli variable for some <math>0 < p < 1</math>, we define
<math>{\displaystyle X ={\begin{cases}1,&{\mbox{if }}U<p\\0,&{\mbox{if }}U\geq p\end{cases}}}
</math>
Category:Mathematical and quantitative methods (economics)
类别: 数学和定量方法(经济学)
so that
it:Variabile casuale#Distribuzione di probabilità
它: Variabile casuale # distributione di 盖率 à
<noinclude>
<small>This page was moved from [[wikipedia:en:Probability distribution]]. Its edit history can be viewed at [[概率分布/edithistory]]</small></noinclude>
[[Category:待整理页面]]