更改

重尾分布 (查看源代码)

2021年3月26日 (五) 14:04的版本

添加40字节、 2021年3月26日 (五) 14:04

无编辑摘要

第16行：第16行：

在使用“重尾” Heavy-tailed一词时仍存在一些歧义。于是就出现了另外两种定义。

−

~~有一些作者使用该术语来指代并非所有幂矩都是有限的那些分布，也有一些作者使用这个术语来指代那些没有有限方差的分布。~~

+

有一些作者使用该术语来指代并非所有阶矩都是有限的那些分布，也有一些作者使用这个术语来指代那些没有有限方差的分布。

−

~~在这里，给出的是最常用的定义，包括替代定义所涵盖的所有分布，以及具有所有幂矩但通常被认为是重尾分布的~~对数正态分布 long-normal distributions 。（有时“重尾”用于任何具有比正态分布更重的尾巴的分布。）

+

在这里，给出的是最常用的定义，包括现有定义所涵盖的所有分布，以及具有所有幂矩但通常被认为是重尾分布的对数正态分布 long-normal distributions 。（有时“重尾”用于任何具有比正态分布更重的尾巴的分布。）

第27行：第27行： −

如果''<math>X</math>''~~的矩生成函数~~, ''MX''(''<math>t</math>'')对于所有''<math>t</math>'' > 0都是无限的，则具有分布函数''<math>F</math>''的随机变量''<math>X</math>''的分布被称为重尾（右）。<ref name="ReferenceA">Rolski, Schmidli, Scmidt, Teugels, ''Stochastic Processes for Insurance and Finance'', 1999</ref>

+

如果''<math>X</math>''的矩母函数, ''MX''(''<math>t</math>'')对于所有''<math>t</math>'' > 0都是无限的，则具有分布函数''<math>F</math>''的随机变量''<math>X</math>''的分布被称为重尾（右）。<ref name="ReferenceA">Rolski, Schmidli, Scmidt, Teugels, ''Stochastic Processes for Insurance and Finance'', 1999</ref>

第65行：第65行： −

如果对于所有<math>''t''>0</math>，则称具有分布函数<math>''F''</math>的随机变量<math>''X''</math>的分布为有较长的右尾，

+

如果对于所有<math>t>0</math>，则称具有分布函数<math>F</math>的随机变量<math>X</math>的分布为有较长的右尾，

:<math>

第83行：第83行： −

~~对于右尾长尾分布量具有直观的解释，即如果长尾量超过某个高水平，则概率将接近1，它将超过其他更高的水平。~~

+

对于右尾长尾分布的随机变量有一个直观的解释，即在长尾分布随机变量尾部取值已超过某个高水平的条件下，它将超过其他更高水平的概率接近于1。

第91行：第91行： −

次指数性是根据概率分布的卷积 Convolution 定义的。对于具有共同分布函数<math>F</math>~~的两个独立且分布均匀的随机变量~~<math> X_1,X_2</math>，<math>F</math>与自身的卷积，<math>F^{*2}</math>是卷积的平方，使用Lebesgue–Stieltjes积分，方法如下：

+

次指数性是根据概率分布的卷积 Convolution 定义的。对于具有共同分布函数<math>F</math>的两个独立同分布的随机变量<math> X_1,X_2</math>，<math>F</math>与自身的卷积，<math>F^{*2}</math>是卷积的平方，使用Lebesgue–Stieltjes积分，方法如下：

第99行：第99行： −

''n''~~倍卷积~~<math>F^{*n}</math>定义如下：

+

''n''重卷积<math>F^{*n}</math>定义如下：

第111行：第111行： −

~~如果满足以下条件，则正半线上的分布~~<math>F</math>~~为次指数~~<ref name="Asmussen"/><ref>{{Cite web|url=https://www.researchgate.net/publication/242637603_A_Theorem_on_Sums_of_Independent_Positive_Random_Variables_and_Its_Applications_to_Branching_Random_Processes|title=A Theorem on Sums of Independent Positive Random Variables and Its Applications to Branching Random Processes|last=Chistyakov|first=V. P.|date=1964|website=ResearchGate|language=en|archive-url=|archive-date=|access-date=April 7, 2019}}</ref><ref>{{Cite web|url=https://projecteuclid.org/download/pdf_1/euclid.aop/1176996225|title=The Class of Subexponential Distributions|last=Teugels|first=Jozef L.|authorlink=|date=1975|website=|publisher=Annals of Probability|publication-place=[[KU Leuven|University of Louvain]]|archive-url=|archive-date=|access-date=April 7, 2019}}</ref>

+

如果满足以下条件，分布<math>F</math>在正半轴上是为次指数的<ref name="Asmussen"/><ref>{{Cite web|url=https://www.researchgate.net/publication/242637603_A_Theorem_on_Sums_of_Independent_Positive_Random_Variables_and_Its_Applications_to_Branching_Random_Processes|title=A Theorem on Sums of Independent Positive Random Variables and Its Applications to Branching Random Processes|last=Chistyakov|first=V. P.|date=1964|website=ResearchGate|language=en|archive-url=|archive-date=|access-date=April 7, 2019}}</ref><ref>{{Cite web|url=https://projecteuclid.org/download/pdf_1/euclid.aop/1176996225|title=The Class of Subexponential Distributions|last=Teugels|first=Jozef L.|authorlink=|date=1975|website=|publisher=Annals of Probability|publication-place=[[KU Leuven|University of Louvain]]|archive-url=|archive-date=|access-date=April 7, 2019}}</ref>

第138行：第138行： −

如果分布<math>F I([0,\infty))</math>为实数，则<math>F</math>为整个实数上的次指数分布。<ref>{{cite journal | last = Willekens | first = E. | title = Subexponentiality on the real line | journal = Technical Report | publisher = K.U. Leuven | year = 1986}}</ref>此时<math>I([0,\infty))</math>~~是正半轴的指标函数。或者，当且仅当~~<math>X^+ = \max(0,X)</math>~~是次指数时，实数上支持的随机变量~~<math>X</math>~~才是次指数。~~

+

如果分布<math>F I([0,\infty))</math>为实数，则<math>F</math>为整个实数上的次指数分布。<ref>{{cite journal | last = Willekens | first = E. | title = Subexponentiality on the real line | journal = Technical Report | publisher = K.U. Leuven | year = 1986}}</ref>此时<math>I([0,\infty))</math>是正半轴的示性函数。或者，当且仅当<math>X^+ = \max(0,X)</math>是次指数时，支撑集为实数轴的随机变量<math>X</math>是次指数的。

第160行：第160行：

双尾的包括：

* 柯西分布 Cauchy distribution本身就是稳定分布和t分布的特例；

−

* 稳定分布族 The family of stable distributions<ref>{{cite web |author=John P. Nolan | title=Stable Distributions: Models for Heavy Tailed Data| year=2009 | url=http://academic2.american.edu/~jpnolan/stable/chap1.pdf | accessdate=2009-02-21}}</ref>，但该族中正态分布的特殊情况除外。一些稳定的分布是单面的（或有半线的支持），例如莱维分布。另请参见具有长尾分布和波动性聚类的财务模型。

+

* 稳定分布族 The family of stable distributions<ref>{{cite web |author=John P. Nolan | title=Stable Distributions: Models for Heavy Tailed Data| year=2009 | url=http://academic2.american.edu/~jpnolan/stable/chap1.pdf | accessdate=2009-02-21}}</ref>，但该族中正态分布的特殊情况除外。一些稳定的分布是单面的（或以是半轴为支持），例如莱维分布。另请参见具有长尾分布和波动性聚类的财务模型。

* t分布

*偏对数正态级联分布 The skew lognormal cascade distribution。<ref>{{cite web | author=Stephen Lihn | title=Skew Lognormal Cascade Distribution | year=2009 | url=http://www.skew-lognormal-cascade-distribution.org/ | access-date=2009-06-12 | archive-url=https://web.archive.org/web/20140407075213/http://www.skew-lognormal-cascade-distribution.org/ | archive-date=2014-04-07 | url-status=dead }}</ref>

第166行：第166行：

==与胖尾分布的关系 ==

−

胖尾分布是指对于较大的<math>x</math>~~，概率密度函数为~~<math>x^{-a}</math>~~趋于零。由于这样的幂总是受到指数分布概率密度函数的限制，因此，胖尾分布始终是重尾分布。~~

+

胖尾分布是指对于较大的<math>x</math>，以幂律的速度<math>x^{-a}</math>趋向于0。由于这样的幂总是受到指数分布概率密度函数的限制，因此，胖尾分布始终是重尾分布。

但是，某些分布的尾部趋近于零的速率比指数函数慢（表示它们是重尾），而比幂快（表示它们不是胖尾）。例如对数正态分布<ref>Stephen Lihn (2009). "Skew Lognormal Cascade Distribution". Archived from the original on 2014-04-07. Retrieved 2009-06-12.</ref>。当然，许多其他的重尾分布，例如对数逻辑分布和帕累托分布也属于胖尾分布。

第199行：第199行： −

其中<math>X_{(n-k(n)+1,n)}=\max \left(X_{n-k(n)+1},\ldots ,X_{n}\right)</math>。 ~~此估计量的概率收敛到~~<math>\xi</math>。

+

其中<math>X_{(n-k(n)+1,n)}=\max \left(X_{n-k(n)+1},\ldots ,X_{n}\right)</math>。此估计量的概率依概率收敛到<math>\xi</math>。

第206行：第206行： −

令<math>(X_t , t \geq 1)</math>为具有分布函数<math>F \in D(H(\xi))</math>~~独立且均匀分布的随机变量序列，其分布函数为广义极值分布~~<math> H </math>的最大吸引域，其中<math>\xi \in \mathbb{R}</math>。样本路径为<math>{X_t: 1 \leq t \leq n}</math>，其中<math>n</math>为样本大小。如果<math>\{k(n)\}</math>是中间阶数序列，即<math>k(n) \in \{1,\ldots,n-1\}, </math>，<math>k(n) \to \infty</math>和<math>k(n)/n \to 0</math>，则Hill尾指数估计器为<ref>Hill B.M. (1975) A simple general approach to inference about the tail of a distribution. Ann. Stat., v. 3, 1163–1174.</ref>：

+

令<math>(X_t , t \geq 1)</math>为具有分布函数<math>F \in D(H(\xi))</math>独立同分布的随机变量序列，其分布函数为广义极值分布<math> H </math>的最大吸引域，其中<math>\xi \in \mathbb{R}</math>。样本路径为<math>{X_t: 1 \leq t \leq n}</math>，其中<math>n</math>为样本大小。如果<math>\{k(n)\}</math>是中间阶数序列，即<math>k(n) \in \{1,\ldots,n-1\}, </math>，<math>k(n) \to \infty</math>和<math>k(n)/n \to 0</math>，则Hill尾指数估计器为<ref>Hill B.M. (1975) A simple general approach to inference about the tail of a distribution. Ann. Stat., v. 3, 1163–1174.</ref>：

思无涯咿呀咿呀

管理员

2,443

个编辑

更改

重尾分布 (查看源代码)

2021年3月26日 (五) 14:04的版本

导航菜单

搜索