Matrix Decomposition (or factorization) is pretty important in many research areas, especially in data analysis, such as using SVD or EVD in PCA. Actually there are more than 10 kinds of matrix decomposition methods.  In general, researchers divide these methods into 4 types, diagonal factorization (like SVD), triangularization factorization (like LU), triangle-diagonal decomposition (like schur decomposition) and tri-diagonal decomposition. Here the triangularization factorization is just discussed first.

1 Cholesky factorization

Most theories and methods depend on or derive from mathematics and probability, so it is necessary to review probability before getting started on studying machine learning.

2 Review of Probability(just an outline)

2.1 Probability

Some Probability Formulas:

(1)Sum rule: $Pr[A\cup B]=Pr[A]+Pr[B]-Pr[A\cap B]$

(2)Union bound: $Pr[\cup A_i]=\sum\limits_{i=1}^nPr[A_i]$

(3)Conditional probability: