2021年9月9日 (四) 16:53的版本

此词条暂由彩云小译翻译，翻译字数共974，未经人工整理和审校，带来阅读不便，请见谅。

西尔韦斯特方程是控制论中的矩阵方程，形式如下^[1]:

[math]\displaystyle{ A X + X B = C. }[/math]

其中A，B，C为已知的矩阵，问题是要找到能够满足方程的矩阵X。所有矩阵的系数都是复数。为了使方程有意义，矩阵的行和列需要满足一定条件，A 和 B 都要是方阵，大小分别是n和m，而X和C要是n行m列的矩阵，n和m也可以相等，四个矩阵都是大小相同的方阵。

当且仅当 A 和-b 没有共同的本征值时，西尔韦斯特有唯一解 X 。更一般地，方程 AX + XB = c 也可以视为(可能无限维中)巴拿赫空间中有界算子的方程。在这种情况下，此情形下，有唯一解 X 的充分必要条件几乎相同: A 和-B 的谱不相交^[2]。

解的存在及唯一

Using the Kronecker product notation and the vectorization operator , we can rewrite Sylvester's equation in the form

where [math]\displaystyle{ A }[/math] is of dimension [math]\displaystyle{ n\! \times\! n }[/math], [math]\displaystyle{ B }[/math] is of dimension [math]\displaystyle{ m\!\times\!m }[/math], [math]\displaystyle{ X }[/math] of dimension [math]\displaystyle{ n\!\times\!m }[/math] and [math]\displaystyle{ I_k }[/math] is the [math]\displaystyle{ k \times k }[/math] identity matrix. In this form, the equation can be seen as a linear system of dimension [math]\displaystyle{ mn \times mn }[/math].

利用克罗内克积符号和向量化算子操作符[math]\displaystyle{ \operatorname{vec} }[/math]，我们可以以下形式重写韦斯特方程:

[math]\displaystyle{ (I_m \otimes A + B^T \otimes I_n) \operatorname{vec}X = \operatorname{vec}C, }[/math]

其中 A 是 n x m的矩阵，B 是维数 m x m的矩阵，X 是n x m的矩阵，[math]\displaystyle{ I_k }[/math] 是 [math]\displaystyle{ k \times k }[/math]的单位矩阵。在这种形式下，该方程可以看作是一个大小为[math]\displaystyle{ mn \times mn }[/math]的线性系统^[3]。然而，不建议为了数值解重写这种形式的方程，因为这个版本计算代价较高，并且存在病态。

定理。给定矩阵 [math]\displaystyle{ A\in \mathbb{C}^{n\times n} }[/math]和[math]\displaystyle{ B\in \mathbb{C}^{m\times m} }[/math]，韦斯特方程 [math]\displaystyle{ AX+XB=C }[/math]对任意 [math]\displaystyle{ C\in\mathbb{C}^{n\times m} }[/math]有唯一解 X 当且仅当 A 和-B 不共享任何特征值。

证明。方程[math]\displaystyle{ AX+XB=C }[/math]是一个m和n未知的线性系统，涉及和未知数等量的方程。因此，对于任意给定的 C，它是唯一可解的当且仅当齐次方程 [math]\displaystyle{ AX+XB=0 }[/math]仅存在平凡解[math]\displaystyle{ 0 }[/math]。

(i)假设 A 和-B 不共享任何特征值。设 X 是上述齐次方程的解。于是有 [math]\displaystyle{ AX=X(-B) }[/math]，它可以被数学归纳法提升到 [math]\displaystyle{ A^kX = X(-B)^k }[/math] 对任意 [math]\displaystyle{ k \ge 0 }[/math]都成立。因此，对于任意多项式 p，[math]\displaystyle{ p(A) X = X p(-B) }[/math]，特别地，设 [math]\displaystyle{ p }[/math] 是 A 的特征多项式。那么由 Cayley-Hamilton 定理可得[math]\displaystyle{ p(A)=0 }[/math]。同时谱映射定理告诉我们[math]\displaystyle{ \sigma(p(-B)) = p(\sigma(-B)), }[/math]，其中 [math]\displaystyle{ \sigma(\cdot) }[/math]表示矩阵的谱。由于 A 和-B 不共享任何特征值，[math]\displaystyle{ p(\sigma(-B)) }[/math]不包含零，因此 [math]\displaystyle{ p(-B) }[/math]是非奇异的。从而得到[math]\displaystyle{ X= 0 }[/math]。这证明了定理的“当”部分。

(ii)现在假设 A 和-B 有一个相同的特征值 λ。设 u 是 A 的对应右特征向量，v 是-B 的对应左特征向量，并且 [math]\displaystyle{ X=u{v}^* }[/math]。于是[math]\displaystyle{ X\neq 0 }[/math]，而[math]\displaystyle{ AX+XB = A(uv^*)-(uv^*)(-B) = \lambda uv^*-\lambda uv^* = 0. }[/math]。因此，X 是上述齐次方程的非平凡解。这证明了定理的“仅当”部分得证。

Q.E.D.

作为谱映射定理的另一种形式，证明的第(i)部分[math]\displaystyle{ \mathrm{Im}(uv^*) }[/math]的非奇异性也可以用贝祖特的互素多项式恒等式来证明。设 q 是-B 的特征多项式。由于 A 和-B 不共享任何特征值，所以 p 和 q 互素。因此存在多项式 f 和 g，使得 [math]\displaystyle{ p(z)f(z)+q(z)g(z)\equiv 1 }[/math]。利用 Cayley-Hamilton 定理，可得[math]\displaystyle{ q(-B)=0 }[/math]，这意味着 [math]\displaystyle{ p(-B) }[/math]是非奇异的。

这个定理对实矩阵仍然成立，但需要注意的是考虑它们的复特征值。充分条件的证明仍然适用; 对于必要条件的证明，请注意 [math]\displaystyle{ \mathrm{Re}(uv^*) }[/math]和 [math]\displaystyle{ \mathrm{Im}(uv^*) }[/math] 都满足齐次方程 AX + XB = 0，而它们不能同时为零。

Roth消去法则

给定两个大小分别为n 和 m 的复方阵 A 和 B，以及大小为 n × m 的矩阵 C，我们可以确认下列两个大小为 n + m 的方阵\begin{bmatrix} A & C \\ 0 & B \end{bmatrix}和\begin{bmatrix} A & 0 \\0&B \end{bmatrix}是否相似。当存在矩阵 X 使得 AX-XB = C ，这两个矩阵就是相似的。换句话说，X 是维斯特方程的解。这就是众所周知的Roth消去法则^[4]。

一种简便的检查方式如下：如果AX-XB = C，那么：

[math]\displaystyle{ \begin{bmatrix}I_n & X \\ 0 & I_m \end{bmatrix} \begin{bmatrix} A&C\\0&B \end{bmatrix} \begin{bmatrix} I_n & -X \\ 0& I_m \end{bmatrix} = \begin{bmatrix} A&0\\0&B \end{bmatrix}. }[/math]

Roth消去法则无法一般化到巴拿赫空间中的无穷维有界算子中。^[5]

数值解

韦斯特方程数值解的一个经典算法是 Bartels-Stewart 算法，该算法通过 QR 算法将矩阵A 和 B 转化为舒尔形式，然后通过逆向取代法求解三角矩阵。在LAPACK，或是GNU Octave的lyap函数中，该算法的计算代价是[math]\displaystyle{ \mathcal{O}(n^3) }[/math]^[6]。也可以参阅该语言中的 sylvester 函数。自 GNU Octave Version 4.0以来，不推荐使用syl 命令。^[7]^[8]在某些特定的图像处理应用程序中，导出的韦斯特方程有会有解析解^[9]。

Notes

↑ This equation is also commonly written in the equivalent form of AX − XB = C.
↑ Bhatia and Rosenthal, 1997
↑ However, rewriting the equation in this form is not advised for the numerical solution since this version is costly to solve and can be ill-conditioned.
↑ Gerrish, F; Ward, A.G.B (Nov 1998). "Sylvester's matrix equation and Roth's removal rule". The Mathematical Gazette. 82 (495): 423–430. doi:10.2307/3619888. JSTOR 3619888.
↑ Bhatia and Rosenthal, p.3
↑ "Function Reference: Lyap".
↑ "Functions of a Matrix (GNU Octave (version 4.4.1))".
↑ The syl command is deprecated since GNU Octave Version 4.0
↑ Wei, Q.; Dobigeon, N.; Tourneret, J.-Y. (2015). "Fast Fusion of Multi-Band Images Based on Solving a Sylvester Equation". IEEE. 24 (11): 4109–4121. arXiv:1502.03121. Bibcode:2015ITIP...24.4109W. doi:10.1109/TIP.2015.2458572. PMID 26208345.

References

Sylvester, J. (1884). "Sur l'equations en matrices [math]\displaystyle{ px = xq }[/math]". C. R. Acad. Sci. Paris. 99 (2): 67–71, 115–116.
Bartels, R. H.; Stewart, G. W. (1972). "Solution of the matrix equation [math]\displaystyle{ AX +XB = C }[/math]". Comm. ACM. 15 (9): 820–826. doi:10.1145/361573.361582.
Bhatia, R.; Rosenthal, P. (1997). "How and why to solve the operator equation [math]\displaystyle{ AX -XB = Y }[/math] ?". Bull. London Math. Soc. 29 (1): 1–21. doi:10.1112/S0024609396001828.
Lee, S.-G.; Vu, Q.-P. (2011). "Simultaneous solutions of Sylvester equations and idempotent matrices separating the joint spectrum". Linear Algebra Appl. 435 (9): 2097–2109. doi:10.1016/j.laa.2010.09.034.
Wei, Q.; Dobigeon, N.; Tourneret, J.-Y. (2015). "Fast Fusion of Multi-Band Images Based on Solving a Sylvester Equation". IEEE Transactions on Image Processing. 24 (11): 4109–4121. arXiv:1502.03121. Bibcode:2015ITIP...24.4109W. doi:10.1109/TIP.2015.2458572. PMID 26208345.
Birkhoff and MacLane. A survey of Modern Algebra. Macmillan. pp. 213, 299.

外部链接

Online solver for arbitrary sized matrices.
Mathematica function to solve the Sylvester equation
MATLAB function to solve the Sylvester equation

任意大小矩阵的在线求解器。
Mathematica 函数求解 Sylvester 方程
MATLAB 函数求解 Sylvester 方程

Category:Matrices Category:Control theory

范畴: 矩阵范畴: 控制论

This page was moved from wikipedia:en:Sylvester equation. Its edit history can be viewed at 韦斯特方程/edithistory

[1] This equation is also commonly written in the equivalent form of AX − XB = C.

[2] Bhatia and Rosenthal, 1997

[3] However, rewriting the equation in this form is not advised for the numerical solution since this version is costly to solve and can be ill-conditioned.

[4] Gerrish, F; Ward, A.G.B (Nov 1998). "Sylvester's matrix equation and Roth's removal rule". The Mathematical Gazette. 82 (495): 423–430. doi:10.2307/3619888. JSTOR 3619888.

[5] Bhatia and Rosenthal, p.3

[6] "Function Reference: Lyap".

[7] "Functions of a Matrix (GNU Octave (version 4.4.1))".

[8] The syl command is deprecated since GNU Octave Version 4.0

[9] Wei, Q.; Dobigeon, N.; Tourneret, J.-Y. (2015). "Fast Fusion of Multi-Band Images Based on Solving a Sylvester Equation". IEEE. 24 (11): 4109–4121. arXiv:1502.03121. Bibcode:2015ITIP...24.4109W. doi:10.1109/TIP.2015.2458572. PMID 26208345.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

@@ 第9行： / 第9行： @@
 当且仅当 ''A'' 和-''b'' 没有共同的本征值时，西尔韦斯特有唯一解 ''X'' 。更一般地，方程 AX + XB = c 也可以视为(可能无限维中)巴拿赫空间中有界算子的方程。在这种情况下，此情形下，有唯一解 ''X'' 的充分必要条件几乎相同:  ''A'' 和-''B'' 的谱不相交<ref>Bhatia and Rosenthal, 1997</ref>。
-==Existence and uniqueness of the solutions==
+==解的存在及唯一==
-Using the [[Kronecker product]] notation and the [[Vectorization (mathematics)|vectorization operator]] <math>\operatorname{vec}</math>, we can rewrite Sylvester's equation in the form
+Using the [[Kronecker product]] notation and the [[Vectorization (mathematics)|vectorization operator]] , we can rewrite Sylvester's equation in the form
-:<math> (I_m \otimes A +  B^T \otimes I_n) \operatorname{vec}X = \operatorname{vec}C,</math>
-where <math>A</math> is of dimension <math>n\! \times\! n</math>, <math>B</math> is of dimension <math>m\!\times\!m</math>, <math>X</math> of dimension <math>n\!\times\!m</math> and <math>I_k</math> is the <math>k \times k</math> [[identity matrix]]. In this form, the equation can be seen as a [[linear system]] of dimension <math>mn \times mn</math>.<ref>However, rewriting the equation in this form is not advised for the numerical solution since this version is costly to solve and can be [[ill-conditioned]].</ref>
-Using the Kronecker product notation and the vectorization operator \operatorname{vec}, we can rewrite Sylvester's equation in the form
+where <math>A</math> is of dimension <math>n\! \times\! n</math>, <math>B</math> is of dimension <math>m\!\times\!m</math>, <math>X</math> of dimension <math>n\!\times\!m</math> and <math>I_k</math> is the <math>k \times k</math> [[identity matrix]]. In this form, the equation can be seen as a [[linear system]] of dimension <math>mn \times mn</math>.
-: (I_m \otimes A +  B^T \otimes I_n) \operatorname{vec}X = \operatorname{vec}C,
-where A is of dimension n\! \times\! n, B is of dimension m\!\times\!m, X of dimension n\!\times\!m and I_k is the k \times k identity matrix. In this form, the equation can be seen as a linear system of dimension mn \times mn.However, rewriting the equation in this form is not advised for the numerical solution since this version is costly to solve and can be ill-conditioned.
-使用克罗内克积符号和向量运算符操作符{ vec } ，我们可以以下形式重写 Sylvester 的方程: (i m 乘以 a + b ^ t 乘以 i n)操作符名{ vec } x = 操作符名{ vec } c，其中 a 是 n 维数！时代周刊！N，b 是维数 m! 乘以! m，x 是维数 n! 乘以! m，i _ k 是 k 乘以 k 的单位矩阵。在这种形式下，该方程可以看作是一个维数为 mn 乘以 mn 的线性方程组。然而，重写这种形式的方程不建议数值解，因为这个版本是昂贵的解决，可以病态。
-'''Theorem.'''
+利用克罗内克积符号和向量化算子操作符<math>\operatorname{vec}</math>，我们可以以下形式重写韦斯特方程:
-Given matrices <math>A\in \mathbb{C}^{n\times n}</math> and <math>B\in \mathbb{C}^{m\times m}</math>, the Sylvester equation <math>AX+XB=C</math> has a unique solution <math>X\in \mathbb{C}^{n\times m}</math> for any <math>C\in\mathbb{C}^{n\times m}</math> if and only if <math>A</math> and <math>-B</math> do not share any eigenvalue.
+:<math> (I_m \otimes A +  B^T \otimes I_n) \operatorname{vec}X = \operatorname{vec}C,</math>
+其中 ''A'' 是 n x m的矩阵，''B'' 是维数 m x m的矩阵，X 是n x m的矩阵，<math>I_k</math> 是 <math>k \times k</math>的单位矩阵。在这种形式下，该方程可以看作是一个大小为<math>mn \times mn</math>的线性系统<ref>However, rewriting the equation in this form is not advised for the numerical solution since this version is costly to solve and can be [[ill-conditioned]].</ref>。然而，不建议为了数值解重写这种形式的方程，因为这个版本计算代价较高，并且存在病态。
-Theorem.
-Given matrices A\in \mathbb{C}^{n\times n} and B\in \mathbb{C}^{m\times m}, the Sylvester equation AX+XB=C has a unique solution X\in \mathbb{C}^{n\times m} for any C\in\mathbb{C}^{n\times m} if and only if A and -B do not share any eigenvalue.
-定理。给定矩阵 a 在数学上{ c } ^ { n 次 n }和 b 在数学上{ c } ^ { m 次 m } ，Sylvester 方程 ax + xb = c 对任意 c 在数学上{ c } ^ { n 次 m }有唯一解 x 当且仅当 a 和-b 不共享任何特征值。
-'''Proof.'''
-The equation <math>AX+XB=C</math> is a linear system with <math>mn</math> unknowns and the same amount of equations. Hence it is uniquely solvable for any given <math>C</math> if and only if the homogeneous equation
-<math>
-AX+XB=0
-</math>
-admits only the trivial solution <math>0</math>.
-Proof.
+'''定理'''。给定矩阵 <math>A\in \mathbb{C}^{n\times n}</math>和<math>B\in \mathbb{C}^{m\times m}</math>，韦斯特方程 <math>AX+XB=C</math>对任意  <math>C\in\mathbb{C}^{n\times m}</math>有唯一解 ''X'' 当且仅当 ''A'' 和''-B'' 不共享任何特征值。
-The equation AX+XB=C is a linear system with mn unknowns and the same amount of equations. Hence it is uniquely solvable for any given C if and only if the homogeneous equation
+'''证明'''。方程<math>AX+XB=C</math>是一个m和n未知的线性系统，涉及和未知数等量的方程。因此，对于任意给定的 C，它是唯一可解的当且仅当齐次方程 <math>
 AX+XB=0
+</math>仅存在平凡解<math>0</math>。
-admits only the trivial solution 0.
+(i)假设 ''A'' 和-''B'' 不共享任何特征值。设 ''X'' 是上述齐次方程的解。于是有 <math>AX=X(-B)</math>，它可以被数学归纳法提升到 <math>
-证据。方程 ax + xb = c 是一个线性方程组，其中有许多未知数和等量的方程。因此，对于任意给定的 c，它是唯一可解的当且仅当齐次方程 ax + xb = 0仅存在平凡解0。
-(i) Assume that <math>A</math> and <math>-B</math> do not share any eigenvalue. Let <math>X</math> be a solution to the abovementioned homogeneous equation. Then <math>AX=X(-B)</math>, which can be lifted to
-<math>
 A^kX = X(-B)^k
 </math>
-for each <math>k \ge 0</math>
+对任意 <math>k \ge 0</math>都成立。因此，对于任意多项式 p，<math>
-by mathematical induction. Consequently,
-<math>
-p(A) X = X p(-B)
-</math>
-for any polynomial <math>p</math>. In particular, let <math>p</math> be the characteristic polynomial of <math>A</math>. Then
-<math>p(A)=0</math>
-due to the [[Cayley-Hamilton theorem]]; meanwhile, the [[spectral mapping theorem]] tells us
-<math>
-\sigma(p(-B)) = p(\sigma(-B)),
-</math>
-where <math>\sigma(\cdot)</math> denotes the spectrum of a matrix. Since <math>A</math> and <math>-B</math> do not share any eigenvalue, <math>p(\sigma(-B))</math> does not contain zero, and hence <math>p(-B)</math> is nonsingular. Thus <math>X= 0</math> as desired. This proves the "if" part of the theorem.
-(i) Assume that A and -B do not share any eigenvalue. Let X be a solution to the abovementioned homogeneous equation. Then AX=X(-B), which can be lifted to
-A^kX = X(-B)^k
-for each k \ge 0
-by mathematical induction. Consequently,
 p(A) X = X p(-B)
+</math>，特别地，设 <math>p</math> 是 ''A'' 的特征多项式。那么由 Cayley-Hamilton 定理可得<math>p(A)=0</math>。同时谱映射定理告诉我们<math>
-for any polynomial p. In particular, let p be the characteristic polynomial of A. Then
-p(A)=0
-due to the Cayley-Hamilton theorem; meanwhile, the spectral mapping theorem tells us
 \sigma(p(-B)) = p(\sigma(-B)),
+</math>，其中 <math>\sigma(\cdot)</math>表示矩阵的谱。由于 ''A'' 和-''B'' 不共享任何特征值，<math>p(\sigma(-B))</math>不包含零，因此 <math>p(-B)</math>是非奇异的。从而得到<math>X= 0</math>。这证明了定理的“当”部分。
-where \sigma(\cdot) denotes the spectrum of a matrix. Since A and -B do not share any eigenvalue, p(\sigma(-B)) does not contain zero, and hence p(-B) is nonsingular. Thus X= 0 as desired. This proves the "if" part of the theorem.
+(ii)现在假设 ''A'' 和-''B'' 有一个相同的特征值 λ。设 u 是 ''A'' 的对应右特征向量，v 是-''B'' 的对应左特征向量，并且 <math>X=u{v}^*</math>。于是<math>X\neq 0</math>，而<math>
-(i)假设 a 和-b 不共享任何特征值。设 x 是上述齐次方程的解。然后 AX = x (- b) ，它可以被数学归纳法提升到 a ^ kX = x (- b) ^ k。因此，对于任意多项式 p，p (a) x = xp (- b) ，特别地，设 p 是 a 的特征多项式。然后由于 Cayley-Hamilton 定理 p (a) = 0，同时谱映射定理告诉我们 sigma (p (- b)) = p (sigma (- b)) ，其中 sigma (cdot)表示矩阵的谱。由于 a 和-b 不共享任何特征值，p (sigma (- b))不包含零，因此 p (- b)是非奇异的。因此 x = 0。这证明了定理的“如果”部分。
-(ii) Now assume that <math>A</math> and <math>-B</math> share an eigenvalue <math>\lambda</math>. Let <math>u</math> be a corresponding right eigenvector for <math>A</math>, <math>v</math> be a corresponding left eigenvector for <math>-B</math>, and <math>X=u{v}^*</math>. Then <math>X\neq 0</math>, and
-<math>
-AX+XB = A(uv^*)-(uv^*)(-B) = \lambda uv^*-\lambda uv^* = 0.
-</math>
-Hence <math>X</math> is a nontrivial solution to the aforesaid homogeneous equation, justifying the "only if" part of the theorem. '''Q.E.D.'''
-(ii) Now assume that A and -B share an eigenvalue \lambda. Let u be a corresponding right eigenvector for A, v be a corresponding left eigenvector for -B, and X=u{v}^*. Then X\neq 0, and
 AX+XB = A(uv^*)-(uv^*)(-B) = \lambda uv^*-\lambda uv^* = 0.
+</math>。因此，''X'' 是上述齐次方程的非平凡解。这证明了定理的“仅当”部分得证。
-Hence X is a nontrivial solution to the aforesaid homogeneous equation, justifying the "only if" part of the theorem. Q.E.D.
+'''Q.E.D.'''
-(ii)现在假设 a 和-b 共享一个特征值 λ。设 u 是 a 的对应右特征向量，v 是-b 的对应左特征向量，x = u { v } ^ * 。然后 x neq 0，ax + xb = a (uv ^ *)-(uv ^ *)(- b) = lambda uv ^ *-lambda uv ^ * = 0。因此，x 是上述齐次方程的非平凡解，证明了定理的“只有当”部分。Q.e.d.
-As an alternative to the [[spectral mapping theorem]], the nonsigularity of <math>p(-B)</math> in part (i) of the proof can also be demonstrated by the [[Bézout's identity]] for coprime polynomials.
-Let <math>q</math> be the characteristic polynomial of <math>-B</math>. Since <math>A</math> and <math>-B</math> do not share any eigenvalue, <math>p</math> and <math>q</math> are coprime. Hence there exist polynomials <math>f</math> and <math>g</math> such that <math>p(z)f(z)+q(z)g(z)\equiv 1</math>. By the [[Cayley–Hamilton theorem]], <math>q(-B)=0</math>. Thus <math>p(-B)f(-B)=I</math>, implying that <math>p(-B)</math> is nonsigular.
-As an alternative to the spectral mapping theorem, the nonsigularity of p(-B) in part (i) of the proof can also be demonstrated by the Bézout's identity for coprime polynomials.
-Let q be the characteristic polynomial of -B. Since A and -B do not share any eigenvalue, p and q are coprime. Hence there exist polynomials f and g such that p(z)f(z)+q(z)g(z)\equiv 1. By the Cayley–Hamilton theorem, q(-B)=0. Thus p(-B)f(-B)=I, implying that p(-B) is nonsigular.
-作为谱映射定理的另一种形式，证明的第(i)部分 p (- b)的非图形性也可以用贝祖特的互素多项式恒等式来证明。设 q 是-b 的特征多项式。由于 a 和-b 不共享任何特征值，所以 p 和 q 是互素。因此存在多项式 f 和 g，使得 p (z) f (z) + q (z) g (z)等于1。利用 Cayley-Hamilton 定理，q (- b) = 0。因此 p (- b) f (- b) = i，意味着 p (- b)是非奇异的。
-The theorem remains true for real matrices with the caveat that one considers their complex eigenvalues. The proof for the "if" part is still applicable; for the "only if" part, note that both <math>\mathrm{Re}(uv^*)</math> and <math>\mathrm{Im}(uv^*)</math> satisfy the homogenous equation <math>AX+XB=0</math>, and they cannot be zero simultaneously.
-The theorem remains true for real matrices with the caveat that one considers their complex eigenvalues. The proof for the "if" part is still applicable; for the "only if" part, note that both \mathrm{Re}(uv^*) and \mathrm{Im}(uv^*) satisfy the homogenous equation AX+XB=0, and they cannot be zero simultaneously.
-这个定理对实矩阵仍然成立，但需要注意的是考虑它们的复特征值。“如果”部分的证明仍然适用; 对于“只有如果”部分，请注意 mathrum { Re }(uv ^ *)和 mathrum { Im }(uv ^ *)都满足齐次方程 ax + xb = 0，它们不能同时为零。
-==Roth's removal rule==
-==Roth's removal rule==
-= = Roth’s removal rule = =
-Given two square complex matrices ''A'' and ''B'', of size ''n'' and ''m'', and a matrix ''C'' of size ''n'' by ''m'', then one can ask when the following two square matrices of size ''n''&nbsp;+&nbsp;''m'' are [[matrix similarity|similar]] to each other: <math> \begin{bmatrix} A & C \\ 0 & B \end{bmatrix}</math> and <math>\begin{bmatrix} A & 0 \\0&B \end{bmatrix}</math>. The answer is that these two matrices are similar exactly when there exists a matrix ''X'' such that ''AX''&nbsp;−&nbsp;''XB''&nbsp;=&nbsp;''C''. In other words, ''X'' is a solution to a Sylvester equation. This is known as '''Roth's removal rule'''.<ref>{{cite journal|last1=Gerrish|first1=F|last2=Ward|first2=A.G.B|title=Sylvester's matrix equation and Roth's removal rule|journal=The Mathematical Gazette|date=Nov 1998|volume=82|issue=495|pages=423–430|doi=10.2307/3619888|jstor=3619888}}</ref>
-Given two square complex matrices A and B, of size n and m, and a matrix C of size n by m, then one can ask when the following two square matrices of size n + m are similar to each other:  \begin{bmatrix} A & C \\ 0 & B \end{bmatrix} and \begin{bmatrix} A & 0 \\0&B \end{bmatrix}. The answer is that these two matrices are similar exactly when there exists a matrix X such that AX − XB = C. In other words, X is a solution to a Sylvester equation. This is known as Roth's removal rule.
-给定两个 n 和 m 的方阵复矩阵 a 和 b，以及一个 n × m 的矩阵 c，我们可以问下列两个 n + m 的方阵什么时候相似: 开始{ bmatrix } a & c0 & b 结束{ bmatrix } ，开始{ bmatrix } a & 0 & b 结束{ bmatrix }。答案是当存在一个矩阵 x 使得 AX-XB = c 时，这两个矩阵是完全相似的。换句话说，x 是 Sylvester 方程的解。这就是众所周知的罗斯免职规则。
-One easily checks one direction: If ''AX''&nbsp;−&nbsp;''XB''&nbsp;=&nbsp;''C'' then
-:<math>\begin{bmatrix}I_n & X \\ 0 & I_m \end{bmatrix} \begin{bmatrix} A&C\\0&B \end{bmatrix} \begin{bmatrix} I_n & -X \\ 0& I_m \end{bmatrix} = \begin{bmatrix} A&0\\0&B \end{bmatrix}.</math>
-Roth's removal rule does not generalize to infinite-dimensional bounded operators on a Banach space.<ref>Bhatia and Rosenthal, p.3</ref>
-One easily checks one direction: If AX − XB = C then
+作为谱映射定理的另一种形式，证明的第(i)部分<math>\mathrm{Im}(uv^*)</math>的非奇异性也可以用贝祖特的互素多项式恒等式来证明。设 q 是-B 的特征多项式。由于 A 和-B 不共享任何特征值，所以 p 和 q 互素。因此存在多项式 f 和 g，使得 <math>p(z)f(z)+q(z)g(z)\equiv 1</math>。利用 Cayley-Hamilton 定理，可得<math>q(-B)=0</math>，这意味着 <math>p(-B)</math>是非奇异的。
-:\begin{bmatrix}I_n & X \\ 0 & I_m \end{bmatrix} \begin{bmatrix} A&C\\0&B \end{bmatrix} \begin{bmatrix} I_n & -X \\ 0& I_m \end{bmatrix} = \begin{bmatrix} A&0\\0&B \end{bmatrix}.
-Roth's removal rule does not generalize to infinite-dimensional bounded operators on a Banach space.Bhatia and Rosenthal, p.3
-一个很容易检查的方向: 如果 AX-XB = c，那么开始{ bmatrix } i _ n & x0 & i _ m end { bmatrix } begin { bmatrix } a & c0 & b end { bmatrix } begin { bmatrix } i _ n &-x0 & i _ m end { bmatrix } = begin { bmatrix } a & 0 & b end { bmatrix }。Roth 的移除规则不能推广到 Banach 空间上的无限维有界算子
+这个定理对实矩阵仍然成立，但需要注意的是考虑它们的复特征值。充分条件的证明仍然适用; 对于必要条件的证明，请注意 <math>\mathrm{Re}(uv^*)</math>和 <math>\mathrm{Im}(uv^*)</math> 都满足齐次方程 ''AX + XB = 0''，而它们不能同时为零。
-==Numerical solutions==
+= Roth消去法则 =
-A classical algorithm for the numerical solution of the Sylvester equation is the [[Bartels–Stewart algorithm]], which consists of transforming <math>A</math> and <math>B</math> into [[Schur decomposition|Schur form]] by a [[QR algorithm]], and then solving the resulting triangular system via [[Triangular matrix|back-substitution]]. This algorithm, whose computational cost is [[Big O notation|<math>\mathcal{O}(n^3)</math>]] arithmetical operations,{{Citation needed|date=November 2015}} is used, among others, by [[LAPACK]] and the <code>lyap</code> function in [[GNU Octave]].<ref>{{Cite web | url=https://octave.sourceforge.io/control/function/lyap.html | title=Function Reference: Lyap}}</ref> See also the <code>sylvester</code> function in that language.<ref>{{Cite web | url=https://www.gnu.org/software/octave/doc/interpreter/Functions-of-a-Matrix.html | title=Functions of a Matrix (GNU Octave (version 4.4.1))}}</ref><ref>The <code>syl</code> command is deprecated since GNU Octave Version 4.0</ref> In some specific image processing application, the derived Sylvester equation has a closed form solution.<ref>{{cite journal |first=Q. |last=Wei |first2=N.  |last2=Dobigeon |first3=J.-Y.  |last3=Tourneret |title=Fast Fusion of Multi-Band Images Based on Solving a Sylvester Equation|journal=[[IEEE Transactions on Image Processing|IEEE]] |volume=24 |year=2015 |issue=11 |pages=4109–4121 |doi=10.1109/TIP.2015.2458572|pmid=26208345 |arxiv=1502.03121|bibcode=2015ITIP...24.4109W}}</ref>
-A classical algorithm for the numerical solution of the Sylvester equation is the Bartels–Stewart algorithm, which consists of transforming A and B into Schur form by a QR algorithm, and then solving the resulting triangular system via back-substitution. This algorithm, whose computational cost is \mathcal{O}(n^3) arithmetical operations, is used, among others, by LAPACK and the lyap function in GNU Octave. See also the sylvester function in that language.The <code>syl</code> command is deprecated since GNU Octave Version 4.0 In some specific image processing application, the derived Sylvester equation has a closed form solution.
+给定两个大小分别为''n'' 和 ''m'' 的复方阵 ''A'' 和 ''B''，以及大小为 n × m 的矩阵 ''C''，我们可以确认下列两个大小为 ''n + m'' 的方阵\begin{bmatrix} A & C \\ 0 & B \end{bmatrix}和\begin{bmatrix} A & 0 \\0&B \end{bmatrix}是否相似。当存在矩阵 ''X'' 使得 ''AX-XB'' ''='' ''C'' ，这两个矩阵就是相似的。换句话说，''X'' 是维斯特方程的解。这就是众所周知的'''Roth消去法则'''<ref>{{cite journal|last1=Gerrish|first1=F|last2=Ward|first2=A.G.B|title=Sylvester's matrix equation and Roth's removal rule|journal=The Mathematical Gazette|date=Nov 1998|volume=82|issue=495|pages=423–430|doi=10.2307/3619888|jstor=3619888}}</ref>。
-Sylvester 方程数值解的一个经典算法是 Bartels-Stewart 算法，该算法通过 QR 算法将 a 和 b 转化为 Schur 形式，然后通过反代换求解得到三角形方程组。该算法的计算代价是数学{ o }(n ^ 3)算术运算，其中包括 LAPACK ack 和 GNU Octave 中的 lyap 函数。请参阅该语言中的 sylvester 函数。自 GNU Octave Version 4.0以来，不推荐使用 < code > syl  命令。在某些特定的图像处理应用程序中，导出的 Sylvester 方程有一个封闭的解。
+一种简便的检查方式如下：如果''AX-XB = C''，那么：
-==See also==
+<math>\begin{bmatrix}I_n & X \\ 0 & I_m \end{bmatrix} \begin{bmatrix} A&C\\0&B \end{bmatrix} \begin{bmatrix} I_n & -X \\ 0& I_m \end{bmatrix} = \begin{bmatrix} A&0\\0&B \end{bmatrix}.</math>
-* [[Lyapunov equation]]
-* [[Algebraic Riccati equation]]
-* Lyapunov equation
+Roth消去法则无法一般化到巴拿赫空间中的无穷维有界算子中。<ref>Bhatia and Rosenthal, p.3</ref>
-* Algebraic Riccati equation
+==数值解==
+韦斯特方程数值解的一个经典算法是 Bartels-Stewart 算法，该算法通过 QR 算法将 矩阵''A'' 和 ''B'' 转化为舒尔形式，然后通过逆向取代法求解三角矩阵。在LAPACK，或是GNU Octave的lyap函数中，该算法的计算代价是[[Big O notation|<math>\mathcal{O}(n^3)</math>]]<ref>{{Cite web | url=https://octave.sourceforge.io/control/function/lyap.html | title=Function Reference: Lyap}}</ref>。也可以参阅该语言中的 sylvester 函数。自 GNU Octave Version 4.0以来，不推荐使用syl  命令。<ref>{{Cite web | url=https://www.gnu.org/software/octave/doc/interpreter/Functions-of-a-Matrix.html | title=Functions of a Matrix (GNU Octave (version 4.4.1))}}</ref><ref>The <code>syl</code> command is deprecated since GNU Octave Version 4.0</ref>在某些特定的图像处理应用程序中，导出的韦斯特方程有会有解析解<ref>{{cite journal |first=Q. |last=Wei |first2=N.  |last2=Dobigeon |first3=J.-Y.  |last3=Tourneret |title=Fast Fusion of Multi-Band Images Based on Solving a Sylvester Equation|journal=[[IEEE Transactions on Image Processing|IEEE]] |volume=24 |year=2015 |issue=11 |pages=4109–4121 |doi=10.1109/TIP.2015.2458572|pmid=26208345 |arxiv=1502.03121|bibcode=2015ITIP...24.4109W}}</ref>。
-方程
+==相关条目==
-* 代数 Riccati方程
+* 李雅普诺夫方程
+* 代数Riccati方程
 ==Notes==
@@ 第178行： / 第92行： @@
 *
-==External links==
+==外部链接==
 * [http://calculator-fx.com/calculator/linear-algebra/solve-sylvester-equation Online solver for arbitrary sized matrices.]{{deadlink|date=May 2021}}
 * [http://reference.wolfram.com/mathematica/ref/LyapunovSolve.html Mathematica function to solve the Sylvester equation]

“韦斯特方程”的版本间的差异