In the study of graphs and networks, the degree of a node in a network is the number of connections it has to other nodes and the degree distribution is the probability distribution of these degrees over the whole network.

The degree of a node in a network (sometimes referred to incorrectly as the connectivity) is the number of connections or edges the node has to other nodes. If a network is directed, meaning that edges point in one direction from one node to another node, then nodes have two different degrees, the in-degree, which is the number of incoming edges, and the out-degree, which is the number of outgoing edges.

The degree distribution P(k) of a network is then defined to be the fraction of nodes in the network with degree k. Thus if there are n nodes in total in a network and nk of them have degree k, we have P(k) = nk/n.

将网络的度分布 p (k)定义为 k 度网络中所有节点的分数,如果一个网络中有 n 个节点,且其中 n 个节点的 k 度为 n,则 p (k) = n < sub > k /n。

The same information is also sometimes presented in the form of a cumulative degree distribution, the fraction of nodes with degree smaller than k, or even the complementary cumulative degree distribution, the fraction of nodes with degree greater than or equal to k (1 - C) if one considers C as the cumulative degree distribution; i.e. the complement of C.

同样的信息有时也以累积度分布、度小于 k 的节点比例、甚至是互补累积度分布的形式出现,如果把 c 看作累积度分布,则为度大于或等于 k (1-c)的节点比例;。C 的补语。

Observed degree distributions

The degree distribution is very important in studying both real networks, such as the Internet and social networks, and theoretical networks. The simplest network model, for example, the (Erdős–Rényi model) random graph, in which each of n nodes is independently connected (or not) with probability p (or 1 − p), has a binomial distribution of degrees k:

学位分布在研究实际网络(如互联网和社会网络)和理论网络中都非常重要。最简单的网络模型,例如(erd s-Rényi 模型)随机图,其中每个 n 个节点都以概率 p (或1-p)独立连接(或不独立连接) ,其二项分布为 k:

[math]\displaystyle{ \lt math\gt 《数学》 P(k) = {n-1\choose k} p^k (1 - p)^{n-1-k}, P(k) = {n-1\choose k} p^k (1 - p)^{n-1-k}, 选择 k } p ^ k (1-p) ^ { n-1-k } , }[/math]



(or Poisson in the limit of large n, if the average degree [math]\displaystyle{ \langle k\rangle=p(n-1) }[/math] is held fixed). Most networks in the real world, however, have degree distributions very different from this. Most are highly right-skewed, meaning that a large majority of nodes have low degree but a small number, known as "hubs", have high degree. Some networks, notably the Internet, the world wide web, and some social networks were argued to have degree distributions that approximately follow a power law:

P(k)\sim k^{-\gamma}

P(k)\sim k^{-\gamma}

P (k) sim k ^ {-gamma }

, where γ is a constant. Such networks are called scale-free networks and have attracted particular attention for their structural and dynamical properties. However, recently, there have been some researches based on real-world data sets claiming despite the fact that most of the observed networks have fat-tailed degree distributions, they deviate from being scale-free.

</math>, where γ is a constant.这种网络被称为无标度网络,因其结构和动力学性质而引起人们的特别关注。然而,最近有一些基于现实数据集的研究声称,尽管大多数观测到的网络具有厚尾度分布,但它们偏离了无标度分布。

Excess degree distribution

Excess degree distribution is the probability distribution, for a node reached by following an edge, of the number of other edges attached to that node.[6] In other words, it is the distribution of outgoing links from a node reached by following a link.

Suppose a network has a degree distribution [math]\displaystyle{ Suppose a network has a degree distribution \lt math\gt 假设一个网络具有度分布 P(k) P(k) P (k) }[/math], by selecting one node (randomly or not) and going to one of its neighbors (assuming to have one neighbor at least), then the probability of that node to have [math]\displaystyle{ }[/math], by selecting one node (randomly or not) and going to one of its neighbors (assuming to have one neighbor at least), then the probability of that node to have [math]\displaystyle{ }[/math] ,通过选择一个节点(随机或非随机)并转到它的一个邻居(假设至少有一个邻居) ,那么该节点具有 < math > 的概率




neighbors is not given by [math]\displaystyle{ P(k) }[/math]. The reason is that, whenever some node is selected in a heterogeneous network, it is more probable to reach the hobs by following one of the existing neighbors of that node. The true probability of such nodes to have degree [math]\displaystyle{ k }[/math] is [math]\displaystyle{ q(k) }[/math] which is called the excess degree of that node. In the configuration model, which correlations between the nodes have been ignored and every node is assumed to be connected to any other nodes in the network with the same probability, the excess degree distribution can be found as:

</math > ,它被称为该节点的超额度。在配置模型中,忽略节点之间的相关性,并假定每个节点以相同的概率连接到网络中的其他任何节点,超额度分布为:

[math]\displaystyle{ \lt math\gt 《数学》 q(k) = \frac{k+1}{\langle k \rangle}P(k+1), q(k) = \frac{k+1}{\langle k \rangle}P(k+1), Q (k) = frac { k + 1}{ langle k rangle } p (k + 1) , }[/math]



where [math]\displaystyle{ {\langle k \rangle} }[/math] is the mean-degree (average degree) of the model. It follows to that fact that the average degree of the neighbor of any node is greater than the average degree of that node. In social networks, it mean that your friends, on average, have more friends than you. This is famous as the friendship paradox. It can be shown that a network can have a giant component, if its average excess degree is larger than one:

</math > 是模型的平均度(平均度)。由此可知,任何节点的邻居的平均度大于该节点的平均度。在社交网络中,这意味着你的朋友平均比你拥有更多的朋友。这就是著名的友谊悖论。可以证明,如果一个网络的平均超额度大于1,那么它可以有一个巨大的组件:

[math]\displaystyle{ \lt math\gt 《数学》 \sum_k kq(k) \gt 1 \Rightarrow {\langle k^2 \rangle}-2{\langle k \rangle}\gt 0 \sum_k kq(k) \gt 1 \Rightarrow {\langle k^2 \rangle}-2{\langle k \rangle}\gt 0 1 right tarrow { langle k ^ 2 rangle }-2{ langle k rangle } \gt 0 }[/math]



Bear in mind that the last two equations are just for the configuration model and to derive the excess degree distribution of a real-word network, we should also add degree correlations into account.[6]

The Generating Functions Method

Generating functions can be used to calculate different properties of random networks. Given the degree distribution and the excess degree distribution of some network, [math]\displaystyle{ P(k) }[/math] and [math]\displaystyle{ q(k) }[/math] respectively, it is possible to write two power series in the following forms:

[math]\displaystyle{ \lt math\gt 《数学》 G_0(x) = \textstyle \sum_{k} \displaystyle P(k)x^k G_0(x) = \textstyle \sum_{k} \displaystyle P(k)x^k 0(x) = textstyle sum { k }显示样式 p (k) x ^ k }[/math] and [math]\displaystyle{ }[/math] and [math]\displaystyle{ [ math ]和[ math ] G_1(x) = \textstyle \sum_{k} \displaystyle q(k)x^k = \textstyle \sum_{k} \displaystyle \frac{k}{\langle k \rangle}P(k)x^{k-1} G_1(x) = \textstyle \sum_{k} \displaystyle q(k)x^k = \textstyle \sum_{k} \displaystyle \frac{k}{\langle k \rangle}P(k)x^{k-1} 1(x) = textstyle sum { k } displaystyle q (k) x ^ k = textstyle sum { k } displastyle frac { k }{ langle k rangle } p (k) x ^ { k-1} }[/math]



[math]\displaystyle{ \lt math\gt 《数学》 G_1(x) G_1(x) G _ 1(x) }[/math] can also be obtained from derivatives of [math]\displaystyle{ }[/math] can also be obtained from derivatives of [math]\displaystyle{ }[/math] 也可以从 < math > 的导数得到



G _ 0(x)




[math]\displaystyle{ \lt math\gt 《数学》 G_1(x) = \frac{G'_0(x)}{G'_0(1)} G_1(x) = \frac{G'_0(x)}{G'_0(1)} 1(x) = frac { g’ _ 0(x)}{ g’ _ 0(1)} }[/math]



If we know the generating function for a probability distribution [math]\displaystyle{ P(k) }[/math] then we can recover the values of [math]\displaystyle{ P(k) }[/math] by differentiating:

[math]\displaystyle{ \lt math\gt 《数学》 P(k) = \frac{1}{k!} {\operatorname{d}^k\!G\over\operatorname{d}\!x^k}\biggl \vert _{x=0} P(k) = \frac{1}{k!} {\operatorname{d}^k\!G\over\operatorname{d}\!x^k}\biggl \vert _{x=0} = frac {1}{ k! }{ operatorname { d } ^ k! g over operatorname { d } ! x ^ k } bigvert { x = 0} }[/math]



Some properties, e.g. the moments, can be easily calculated from [math]\displaystyle{ G_0(x) }[/math] and its derivatives:

  • [math]\displaystyle{ {\langle k \rangle} = G'_0(1) {\langle k \rangle} = G'_0(1) { langle k rangle } = g’ _ 0(1) }[/math]



  • [math]\displaystyle{ {\langle k^2 \rangle} = G''_0(1) + G'_0(1) {\langle k^2 \rangle} = G_0(1) + G'_0(1) { langle k ^ 2 rangle } = g _ 0(1) + g’ _ 0(1) }[/math]



And in general[6]:

  • [math]\displaystyle{ {\langle k^m \rangle} = \Biggl[{\bigg(\operatorname{x}{\operatorname{d}\!\over\operatorname{dx}\!}\biggl)^m}G_0(x)\Biggl]_{x=1} {\langle k^m \rangle} = \Biggl[{\bigg(\operatorname{x}{\operatorname{d}\!\over\operatorname{dx}\!}\biggl)^m}G_0(x)\Biggl]_{x=1} { langle k ^ m rangle } = Biggl [{ bigg (operatorname { x }{ operatorname { d } ! over operatorname { dx } ![} Biggl) ^ m } g _ 0(x) Biggl ] _ { x = 1} }[/math]



For Poisson-distributed random networks, such as the ER graph, [math]\displaystyle{ G_1(x) = G_0(x) }[/math], that is the reason why the theory of random networks of this type is especially simple. The probability distributions for the 1st and 2nd-nearest neighbors are generated by the functions [math]\displaystyle{ G_0(x) }[/math] and [math]\displaystyle{ G_0(G_1(x)) }[/math]. By extension, the distribution of [math]\displaystyle{ m }[/math]-th neighbors is generated by:

</math>-th neighbors is generated by:


[math]\displaystyle{ G_0\bigl(G_1(...G_1(x)...)\bigr) }[/math], with [math]\displaystyle{ m-1 }[/math] iterations of the function [math]\displaystyle{ G_1 }[/math] acting on itself.

The average number of 1st neighbors, [math]\displaystyle{ c_1 }[/math], is [math]\displaystyle{ {\langle k \rangle} = {dG_0(x)\over dx}|_{x=1} }[/math] and the average number of 2nd neighbors is: [math]\displaystyle{ c_2 = \biggl[ {d\over dx}G_0\big(G_1(x)\big)\biggl]_{x=1} = G_1'(1)G'_0\big(G_1(1)\big) = G_1'(1)G'_0(1) = G''_0(1) }[/math]



Degree distribution for directed networks

In/out degree distribution for Wikipedia's hyperlink graph (logarithmic scales)

维基百科超链接图(对数尺度)的 In/out 度分布

In a directed network, each node has some in-degree [math]\displaystyle{ k_{in} }[/math] and some out-degree [math]\displaystyle{ k_{out} }[/math] which are the number of links which have run into and out of that node respectfully. If [math]\displaystyle{ P(k_{in}, k_{out}) }[/math] is the probability that a randomly chosen node has in-degree [math]\displaystyle{ k_{in} }[/math] and out-degree [math]\displaystyle{ k_{out} }[/math] then the generating function assigned to this joint probability distribution can be written with two valuables [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y }[/math] as:

P(k_{in}, k_{out})

P(k_{in}, k_{out})

P (k _ { in } ,k _ { out })

</math> is the probability that a randomly chosen node has in-degree [math]\displaystyle{ }[/math] is the probability that a randomly chosen node has in-degree [math]\displaystyle{ 是一个随机选择的节点在程度上 k_{ k_{ 这是一个很好的例子 in} in} 开始吧 }[/math] and out-degree [math]\displaystyle{ }[/math] and out-degree [math]\displaystyle{ “数学”和“学位” k_{out} k_{out} 我不知道你在说什么 }[/math] then the generating function assigned to this joint probability distribution can be written with two valuables [math]\displaystyle{ }[/math] then the generating function assigned to this joint probability distribution can be written with two valuables [math]\displaystyle{ 那么分配给这个母函数的联合分布可以用两个贵重物品来写 x x X }[/math] and [math]\displaystyle{ }[/math] and [math]\displaystyle{ [ math ]和[ math ] y y Y }[/math] as:

[math]\displaystyle{ \lt math\gt 《数学》 \mathcal{G}(x,y) = \sum_{k_{in},k_{out}} \displaystyle P({k_{in},k_{out}})x^{k_{in}}y^{k_{out}} . \mathcal{G}(x,y) = \sum_{k_{in},k_{out}} \displaystyle P({k_{in},k_{out}})x^{k_{in}}y^{k_{out}} . 数学{ g }(x,y) = sum _ { k _ { in } ,k _ { out }显示样式 p ({ k _ { in } ,k _ { out }) x ^ { k _ { in } y ^ { k _ { out }。 }[/math]



Since every link in a directed network must leave some node and enter another, the net average number of links entering

a node is zero. Therefore,

[math]\displaystyle{ \lt math\gt 《数学》 \langle{k_{in}-k_{out}}\rangle =\sum_{k_{in},k_{out}} \displaystyle (k_{in}-k_{out})P({k_{in},k_{out}}) = 0 \langle{k_{in}-k_{out}}\rangle =\sum_{k_{in},k_{out}} \displaystyle (k_{in}-k_{out})P({k_{in},k_{out}}) = 0 Langle { k _ { in }-k _ { out } rangle = sum _ { k _ { in } ,k _ { out } displays (k _ { in }-k _ { out }) p ({ k _ { in } ,k _ { out }}) = 0 }[/math],



which implies that, the generation function must satisfy:

[math]\displaystyle{ \lt math\gt 《数学》 {\partial \mathcal{G}\over\partial x}\vert _{x,y=1} = {\partial \mathcal{G}\over\partial y}\vert _{x,y=1} = c, {\partial \mathcal{G}\over\partial x}\vert _{x,y=1} = {\partial \mathcal{G}\over\partial y}\vert _{x,y=1} = c, { partial mathcal { g } over partial x } vert _ { x,y = 1} = { partial mathcal { g } over partial y } vert _ { x,y = 1} = c, }[/math]



where [math]\displaystyle{ c }[/math] is the mean degree (both in and out) of the nodes in the network; [math]\displaystyle{ \langle{k_{in}}\rangle = \langle{k_{out}}\rangle = c. }[/math]



Using the function [math]\displaystyle{ \mathcal{G}(x,y) }[/math], we can again find the generation function for the in/out-degree distribution and in/out-excess degree distribution, as before. [math]\displaystyle{ G^{in}_0(x) }[/math] can be defined as generating functions for the number of arriving links at a randomly chosen node, and [math]\displaystyle{ G^{in}_1(x) }[/math] can be defined as the number of arriving links at a node reached by following a randomly chosen link. We can also define generating functions [math]\displaystyle{ G^{out}_0(y) }[/math] and [math]\displaystyle{ G^{out}_1(y) }[/math] for the number leaving such a node:




</math> can be defined as generating functions for the number of arriving links at a randomly chosen node, and [math]\displaystyle{ }[/math] can be defined as generating functions for the number of arriving links at a randomly chosen node, and [math]\displaystyle{ 可以将 }[/math] 定义为一个随机选择的节点上到达的链接数的生成函数,以及 < math >




</math>can be defined as the number of arriving links at a node reached by following a randomly chosen link. We can also define generating functions [math]\displaystyle{ }[/math]can be defined as the number of arriving links at a node reached by following a randomly chosen link. We can also define generating functions [math]\displaystyle{ }[/math] 可以定义为按照随机选择的链接到达一个节点的到达链接数。我们也可以定义生成函数 < math >




</math> and [math]\displaystyle{ }[/math] and [math]\displaystyle{ [ math ]和[ math ] G^{out}_1(y) G^{out}_1(y) 1(y) }[/math] for the number leaving such a node:[7]

  • [math]\displaystyle{ G^{in}_0(x) = \mathcal{G}(x,1) G^{in}_0(x) = \mathcal{G}(x,1) 0(x) = mathcal { g }(x,1) }[/math]



  • [math]\displaystyle{ G^{in}_1(x) = \frac{1}{c} {\partial \mathcal{G}\over\partial x}\vert _{y=1} G^{in}_1(x) = \frac{1}{c} {\partial \mathcal{G}\over\partial x}\vert _{y=1} 1(x) = frac {1}{ c }{ partial mathcal { g } over partial x } vert _ { y = 1} }[/math]



  • [math]\displaystyle{ G^{out}_0(y) = \mathcal{G}(1,y) G^{out}_0(y) = \mathcal{G}(1,y) 0(y) = mathcal { g }(1,y) }[/math]



  • [math]\displaystyle{ G^{out}_1(y) = \frac{1}{c} {\partial \mathcal{G}\over\partial y}\vert _{x=1} G^{out}_1(y) = \frac{1}{c} {\partial \mathcal{G}\over\partial y}\vert _{x=1} 1(y) = frac {1}{ c }{ partial mathcal { g } over partial y } vert _ { x = 1} }[/math]



Here, the average number of 1st neighbors, [math]\displaystyle{ c }[/math], or as previously introduced as [math]\displaystyle{ c_1 }[/math], is [math]\displaystyle{ {\partial \mathcal{G}\over\partial x}\biggl \vert _{x,y=1} = {\partial \mathcal{G}\over\partial y}\biggl \vert _{x,y=1} }[/math] and the average number of 2nd neighbors reachable from a randomly chosen node is given by: [math]\displaystyle{ c_2 = G_1'(1)G'_0(1) ={\partial^2 \mathcal{G}\over\partial x\partial y}\biggl \vert _{x,y=1} }[/math]. These are also the numbers of 1st and 2nd neighbors from which a random node can be reached, since these equations are manifestly symmetric in [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y }[/math].



  1. Barabási, Albert-László; Albert, Réka (1999-10-15). "Emergence of Scaling in Random Networks". Science. 286 (5439): 509–512. arXiv:cond-mat/9910332. Bibcode:1999Sci...286..509B. doi:10.1126/science.286.5439.509. ISSN 0036-8075. PMID 10521342.
  2. Albert, Réka; Barabási, Albert-László (2000-12-11). "Topology of Evolving Networks: Local Events and Universality" (PDF). Physical Review Letters. 85 (24): 5234–5237. arXiv:cond-mat/0005085. Bibcode:2000PhRvL..85.5234A. doi:10.1103/physrevlett.85.5234. hdl:2047/d20000695. ISSN 0031-9007. PMID 11102229.
  3. Dorogovtsev, S. N.; Mendes, J. F. F.; Samukhin, A. N. (2001-05-21). "Size-dependent degree distribution of a scale-free growing network". Physical Review E. 63 (6): 062101. arXiv:cond-mat/0011115. Bibcode:2001PhRvE..63f2101D. doi:10.1103/physreve.63.062101. ISSN 1063-651X. PMID 11415146.
  4. Pachon, Angelica; Sacerdote, Laura; Yang, Shuyi (2018). "Scale-free behavior of networks with the copresence of preferential and uniform attachment rules". Physica D: Nonlinear Phenomena. 371: 1–12. arXiv:1704.08597. Bibcode:2018PhyD..371....1P. doi:10.1016/j.physd.2018.01.005.
  5. Holme, Petter (2019-03-04). "Rare and everywhere: Perspectives on scale-free networks". Nature Communications (in English). 10 (1): 1016. Bibcode:2019NatCo..10.1016H. doi:10.1038/s41467-019-09038-8. ISSN 2041-1723. PMC 6399274. PMID 30833568.
  6. 6.0 6.1 6.2 6.3 Newman, Mark (2018-10-18) (in en). Networks. 1. Oxford University Press. doi:10.1093/oso/9780198805090.001.0001. ISBN 978-0-19-880509-0. http://www.oxfordscholarship.com/view/10.1093/oso/9780198805090.001.0001/oso-9780198805090. 
  7. 7.0 7.1 7.2 Newman, M. E. J.; Strogatz, S. H.; Watts, D. J. (2001-07-24). "Random graphs with arbitrary degree distributions and their applications". Physical Review E (in English). 64 (2): 026118. doi:10.1103/PhysRevE.64.026118. ISSN 1063-651X.

