Soft configuration model

Random graph model in applied mathematics

Theory

Network types

Features
Clique Component Cut Cycle Data structure Edge Loop Neighborhood Path Vertex Adjacency list / matrix Incidence list / matrix
Types
Bipartite Complete Directed Hyper Labeled Multi Random Weighted

Models

Topology
Random graph Erdős–Rényi Barabási–Albert Bianconi–Barabási Fitness model Watts–Strogatz Exponential random (ERGM) Random geometric (RGG) Hyperbolic (HGN) Hierarchical Stochastic block Blockmodeling Maximum entropy Soft configuration LFR Benchmark
Dynamics
Boolean network agent based Epidemic/SIR

Lists
Categories

v
t
e

In applied mathematics, the soft configuration model (SCM) is a random graph model subject to the principle of maximum entropy under constraints on the expectation of the degree sequence of sampled graphs.^[1] Whereas the configuration model (CM) uniformly samples random graphs of a specific degree sequence, the SCM only retains the specified degree sequence on average over all network realizations; in this sense the SCM has very relaxed constraints relative to those of the CM ("soft" rather than "sharp" constraints^[2]). The SCM for graphs of size $n$ {\displaystyle n} has a nonzero probability of sampling any graph of size $n$ {\displaystyle n}, whereas the CM is restricted to only graphs having precisely the prescribed connectivity structure.

Model formulation

[edit ]

The SCM is a statistical ensemble of random graphs $G$ {\displaystyle G} having $n$ {\displaystyle n} vertices ( $n=|V(G)|$ {\displaystyle n=|V(G)|}) labeled $\{v_{j}\}_{j=1}^{n}=V(G)$ {\displaystyle \{v_{j}\}_{j=1}^{n}=V(G)}, producing a probability distribution on ${\mathcal {G}}_{n}$ {\displaystyle {\mathcal {G}}_{n}} (the set of graphs of size $n$ {\displaystyle n}). Imposed on the ensemble are $n$ {\displaystyle n} constraints, namely that the ensemble average of the degree $k_{j}$ {\displaystyle k_{j}} of vertex $v_{j}$ {\displaystyle v_{j}} is equal to a designated value ${\widehat {k}}_{j}$ {\displaystyle {\widehat {k}}_{j}}, for all $v_{j}\in V(G)$ {\displaystyle v_{j}\in V(G)}. The model is fully parameterized by its size $n$ {\displaystyle n} and expected degree sequence $\{{\widehat {k}}_{j}\}_{j=1}^{n}$ {\displaystyle \{{\widehat {k}}_{j}\}_{j=1}^{n}}. These constraints are both local (one constraint associated with each vertex) and soft (constraints on the ensemble average of certain observable quantities), and thus yields a canonical ensemble with an extensive number of constraints.^[2] The conditions $\langle k_{j}\rangle ={\widehat {k}}_{j}$ {\displaystyle \langle k_{j}\rangle ={\widehat {k}}_{j}} are imposed on the ensemble by the method of Lagrange multipliers (see Maximum-entropy random graph model).

Derivation of the probability distribution

[edit ]

The probability $\mathbb {P} _{\text{SCM}}(G)$ {\displaystyle \mathbb {P} _{\text{SCM}}(G)} of the SCM producing a graph $G$ {\displaystyle G} is determined by maximizing the Gibbs entropy $S[G]$ {\displaystyle S[G]} subject to constraints $\langle k_{j}\rangle ={\widehat {k}}_{j},\ j=1,\ldots ,n$ {\displaystyle \langle k_{j}\rangle ={\widehat {k}}_{j},\ j=1,\ldots ,n} and normalization $\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)=1$ {\displaystyle \sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)=1}. This amounts to optimizing the multi-constraint Lagrange function below:

{\begin{aligned}&{\mathcal {L}}\left(\alpha ,\{\psi _{j}\}_{j=1}^{n}\right)\\[6pt]={}&-\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)\log \mathbb {P} _{\text{SCM}}(G)+\alpha \left(1-\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)\right)+\sum _{j=1}^{n}\psi _{j}\left({\widehat {k}}_{j}-\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)k_{j}(G)\right),\end{aligned}}

{\displaystyle {\begin{aligned}&{\mathcal {L}}\left(\alpha ,\{\psi _{j}\}_{j=1}^{n}\right)\\[6pt]={}&-\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)\log \mathbb {P} _{\text{SCM}}(G)+\alpha \left(1-\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)\right)+\sum _{j=1}^{n}\psi _{j}\left({\widehat {k}}_{j}-\sum _{G\in {\mathcal {G}}_{n}}\mathbb {P} _{\text{SCM}}(G)k_{j}(G)\right),\end{aligned}}}

where $\alpha$ {\displaystyle \alpha } and $\{\psi _{j}\}_{j=1}^{n}$ {\displaystyle \{\psi _{j}\}_{j=1}^{n}} are the $n+1$ {\displaystyle n+1} multipliers to be fixed by the $n+1$ {\displaystyle n+1} constraints (normalization and the expected degree sequence). Setting to zero the derivative of the above with respect to $\mathbb {P} _{\text{SCM}}(G)$ {\displaystyle \mathbb {P} _{\text{SCM}}(G)} for an arbitrary $G\in {\mathcal {G}}_{n}$ {\displaystyle G\in {\mathcal {G}}_{n}} yields

0={\frac {\partial {\mathcal {L}}\left(\alpha ,\{\psi _{j}\}_{j=1}^{n}\right)}{\partial \mathbb {P} _{\text{SCM}}(G)}}=-\log \mathbb {P} _{\text{SCM}}(G)-1-\alpha -\sum _{j=1}^{n}\psi _{j}k_{j}(G)\ \Rightarrow \ \mathbb {P} _{\text{SCM}}(G)={\frac {1}{Z}}\exp \left[-\sum _{j=1}^{n}\psi _{j}k_{j}(G)\right],

{\displaystyle 0={\frac {\partial {\mathcal {L}}\left(\alpha ,\{\psi _{j}\}_{j=1}^{n}\right)}{\partial \mathbb {P} _{\text{SCM}}(G)}}=-\log \mathbb {P} _{\text{SCM}}(G)-1-\alpha -\sum _{j=1}^{n}\psi _{j}k_{j}(G)\ \Rightarrow \ \mathbb {P} _{\text{SCM}}(G)={\frac {1}{Z}}\exp \left[-\sum _{j=1}^{n}\psi _{j}k_{j}(G)\right],}

the constant $Z:=e^{\alpha +1}=\sum _{G\in {\mathcal {G}}_{n}}\exp \left[-\sum _{j=1}^{n}\psi _{j}k_{j}(G)\right]=\prod _{1\leq i<j\leq n}\left(1+e^{-(\psi _{i}+\psi _{j})}\right)$ {\displaystyle Z:=e^{\alpha +1}=\sum _{G\in {\mathcal {G}}_{n}}\exp \left[-\sum _{j=1}^{n}\psi _{j}k_{j}(G)\right]=\prod _{1\leq i<j\leq n}\left(1+e^{-(\psi _{i}+\psi _{j})}\right)}^[3] being the partition function normalizing the distribution; the above exponential expression applies to all $G\in {\mathcal {G}}_{n}$ {\displaystyle G\in {\mathcal {G}}_{n}}, and thus is the probability distribution. Hence we have an exponential family parameterized by $\{\psi _{j}\}_{j=1}^{n}$ {\displaystyle \{\psi _{j}\}_{j=1}^{n}}, which are related to the expected degree sequence $\{{\widehat {k}}_{j}\}_{j=1}^{n}$ {\displaystyle \{{\widehat {k}}_{j}\}_{j=1}^{n}} by the following equivalent expressions:

\langle k_{q}\rangle =\sum _{G\in {\mathcal {G}}_{n}}k_{q}(G)\mathbb {P} _{\text{SCM}}(G)=-{\frac {\partial \log Z}{\partial \psi _{q}}}=\sum _{j\neq q}{\frac {1}{e^{\psi _{q}+\psi _{j}}+1}}={\widehat {k}}_{q},\ q=1,\ldots ,n.

{\displaystyle \langle k_{q}\rangle =\sum _{G\in {\mathcal {G}}_{n}}k_{q}(G)\mathbb {P} _{\text{SCM}}(G)=-{\frac {\partial \log Z}{\partial \psi _{q}}}=\sum _{j\neq q}{\frac {1}{e^{\psi _{q}+\psi _{j}}+1}}={\widehat {k}}_{q},\ q=1,\ldots ,n.}

References

[edit ]

^ van der Hoorn, Pim; Gabor Lippner; Dmitri Krioukov (2017年10月10日). "Sparse Maximum-Entropy Random Graphs with a Given Power-Law Degree Distribution". arXiv:1705.10261 .
^ ^a ^b Garlaschelli, Diego; Frank den Hollander; Andrea Roccaverde (January 30, 2018). "Coviariance structure behind breaking of ensemble equivalence in random graphs" (PDF). Archived (PDF) from the original on February 4, 2023. Retrieved September 14, 2018.
^ Park, Juyong; M.E.J. Newman (2004年05月25日). "The statistical mechanics of networks". arXiv:cond-mat/0405566 .

Retrieved from "https://en.wikipedia.org/w/index.php?title=Soft_configuration_model&oldid=1195898888"