Markov Chains
data:image/s3,"s3://crabby-images/f04ed/f04edb0ab24730059b8a8a4284359d071a5ad085" alt=""
In a previous page, we studied the movement between the city and suburbs. Indeed, if I are S are the initial population of the inner city and the suburban area, and if we assume that every year 40% of the inner city population moves to the suburbs, while 30% of the suburb population moves to the inner part of the city, then after one year the populations are given by
data:image/s3,"s3://crabby-images/98182/98182aedb3fdd9332a7a853c43034c1848388fae" alt="\begin{displaymath}\left(\begin{array}{c} 0.6 I + 0.3 S\\ 0.4 I + 0.7 S\\ \end... ...ay}\right) \left(\begin{array}{c} I\\ S\\ \end{array}\right).\end{displaymath}"
The matrix
data:image/s3,"s3://crabby-images/cc930/cc9300ae0098e609d2edd8abb755e4abd3da7bb4" alt="\begin{displaymath}P = \left(\begin{array}{cc} 0.6&0.3\\ 0.4&0.7\\ \end{array}\right) \end{displaymath}"
is very special. Indeed, the entries of each column vectors are positive and their sum is 1. Such vectors are called probability vectors. A matrix for which all the column vectors are probability vectors is called transition orstochastic matrix. Andrei Markov, a russian mathematician, was the first one to study these matrices. At the beginning of this century he developed the fundamentals of the Markov Chain theory.
A Markov chain is a process that consists of a finite number of states and some known probabilities pij, where pij is the probability of moving from state j to state i. In the example above, we have two states: living in the city and living in the suburbs. The number pij represents the probability of moving from state i to state j in one year. We may have more than two states. For example, political affiliation: Democrat, Republican, and Independent. For example, pij represents the probability of a son belonging to party i if his father belonged to party j.
Of particular interest is a probability vector p such that
data:image/s3,"s3://crabby-images/c4d66/c4d661c5f63db014d6a60a9d73628579aefb7934" alt="$A \mbox{\bf p} = \mbox{\bf p}$"
data:image/s3,"s3://crabby-images/c0a19/c0a19f3c79d602ddf22c2f064d7d2777d2d0a8a9" alt="\begin{displaymath}\left(\begin{array}{cc} 0.6&0.3\\ 0.4&0.7\\ \end{array}\rig... ...} -0.4&0.3\\ 0.4&-0.3\\ \end{array}\right)\cdot X = {\cal O}.\end{displaymath}"
This system reduces to the equation -0.4 x + 0.3 y = 0. It is easy to see that, if we set
data:image/s3,"s3://crabby-images/b8db5/b8db5eb85fbfdfacd9651d476739cf80f82b4efe" alt="$x = 0.3 \alpha$"
data:image/s3,"s3://crabby-images/5092b/5092bf6ff1d6ea36221da0587f70a2c9dd43296f" alt="\begin{displaymath}X = \left(\begin{array}{c} x\\ y\\ \end{array}\right) = \alpha \left(\begin{array}{c} 0.3\\ 0.4\\ \end{array}\right).\end{displaymath}"
So the vector
data:image/s3,"s3://crabby-images/3bcef/3bcefed3b4259c7e86df7110e3476af78164b4bf" alt="$\mbox{\bf p}_1 = \displaystyle \left(\begin{array}{c} 0.3\\ 0.4\\ \end{array}\right)$"
data:image/s3,"s3://crabby-images/9d360/9d360c0d292b70ce667c42cd799a3012d6410862" alt="$\mbox{\bf p}_1$"
Let us discuss another example on population dynamics.
Example: Age Distribution of Trees in a Forest
Trees in a forest are assumed in this simple model to fall into four age groups: b(k) denotes the number of baby trees in the forest (age group 0-15 years) at a given time period k; similarly y(k),m(k) and o(k) denote the number of young trees (16-30 years of age), middle-aged trees (age 31-45), and old trees (older than 45 years of age), respectively. The length of one time period is 15 years.
How does the age distribution change from one time period to the next? The model makes the following three assumptions:
- A certain percentage of trees in each age group dies.
- Surviving trees enter into the next age group; old trees remain old.
- Lost trees are replaced by baby trees.
Note that the total tree population does not change over time.
We obtain the following difference equations:
b(k+1) | = | ![]() | (1) |
y(k+1) | = | (1-db) b(k) | (2) |
m(k+1) | = | (1-dy) y(k) | (3) |
o(k+1) | = | (1-dm) m(k) + (1-do) o(k) | (4) |
Here 0 < db,dy,dm,do <1 denote the loss rates in each age group in percent.
Let
data:image/s3,"s3://crabby-images/1d51b/1d51bd6e9bb0cae3d824a0e6dd6ef02669201ad6" alt="\begin{displaymath}{\bf x}(k)=\left(\begin{array}{c}b(k)\\ y(k)\\ m(k)\\ o(k)\end{array}\right)\end{displaymath}"
be the ``age distribution vector". Consider the matrix
data:image/s3,"s3://crabby-images/0e45f/0e45f87545de482d1814ca2279f1aec8fa196e13" alt="\begin{displaymath}A = \left(\begin{array}{cccc} d_b&d_y&d_m&d_o\\ 1-d_b&0&0&0\\ 0&1-d_y&0&0\\ 0&0&1-d_m&1-d_o\\ \end{array}\right).\end{displaymath}"
Then we have
data:image/s3,"s3://crabby-images/3e597/3e597d5e0436406b3b28a2f2a954f57d7669d540" alt="\begin{displaymath}{\bf x}(k+1)=A\cdot {\bf x}(k).\end{displaymath}"
Note that the matrix A is a stochastic matrix!
If, db=.1,dy=.2,dm=.3 and do=.4, then
data:image/s3,"s3://crabby-images/324ca/324ca7ee24fb3a9875b4df72ef7f6df8e42e4cab" alt="\begin{displaymath}A = \left(\begin{array}{cccc} 0.1&0.2&0.3&0.4\\ 0.9&0&0&0\\ 0&0.8&0&0\\ 0&0&0.7&0.6\\ \end{array}\right).\end{displaymath}"
After easy calculations, we find the steady state vector for the age distribution in the forest:
data:image/s3,"s3://crabby-images/81f92/81f9278b0e4d2c472ab46abcdb3cbc43ad7997f4" alt="\begin{displaymath}\mbox{\bf p} = \left(\begin{array}{cccc} 1/3.88\\ 0.9/3.88\\... ...n{array}{cccc} 1\\ 0.9\\ 0.72\\ 1.26\\ \end{array}\right) .\end{displaymath}"
Assume a total tree population of 50,000 trees. Suppose the forest is newly planted, i.e.
data:image/s3,"s3://crabby-images/b19f6/b19f6ccb5be06c8427f05a389ff0eda6de490d6f" alt="\begin{displaymath}{\bf x}(0)=\left(\begin{array}{c}50,000\\ 0\\ 0\\ 0\end{array}\right)\end{displaymath}"
After 15 years, the age distribution in the forest is given by
data:image/s3,"s3://crabby-images/363f6/363f689253a17489bc1737468bb14bf6079d00d8" alt="\begin{displaymath}{\bf x}(1) = \left(\begin{array}{cccc} 0.1&0.2&0.3&0.4\\ 0.9... ...\begin{array}{cccc} 0.1\\ 0.9\\ 0\\ 0\\ \end{array}\right).\end{displaymath}"
After 30 years, we have
data:image/s3,"s3://crabby-images/2e9b3/2e9b3b1bbdf30000c8ef11b983be384cb965ee73" alt="\begin{displaymath}{\bf x}(2) = \left(\begin{array}{cccc} 0.1&0.2&0.3&0.4\\ 0.9... ...in{array}{cccc} 0.19\\ 0.09\\ 0.72\\ 0\\ \end{array}\right)\end{displaymath}"
and after 45 years
data:image/s3,"s3://crabby-images/e4b98/e4b987ae24361fe25a8cc2510dba46d6868b92ad" alt="\begin{displaymath}{\bf x}(3) = 50,000\left(\begin{array}{cccc} 0.19\\ 0.09\\ 0.72\\ 0\\ \end{array}\right)\end{displaymath}"
After 15n years, where
data:image/s3,"s3://crabby-images/fb372/fb372e186246d074ad1975df41985c8290c791c0" alt="$n=1,2,\cdots$"
data:image/s3,"s3://crabby-images/6985e/6985eb577bff756e4b9c02fec16e1e2e09318a85" alt="\begin{displaymath}{\bf x}(n) = \left(\begin{array}{cccc} 0.1&0.2&0.3&0.4\\ 0.9... ...eft(\begin{array}{cccc} 1\\ 0\\ 0\\ 0\\ \end{array}\right).\end{displaymath}"
So the problem is to find the nth-power of the matrix A. We have seen that diagonalization technique may be helpful to solve this problem. Another problem deals with the long term behavior of the sequence x(n) when ngets large.
The calculations on the example above becomes tedious. Let us illustrate the problem on a small matrix.
Example. Consider the stochastic matrix
data:image/s3,"s3://crabby-images/8c8e1/8c8e18c7448eed5e3f933c61623846d2fec26ed7" alt="\begin{displaymath}A = \left(\begin{array}{cccc} 0.8&0.2\\ 0.2&0.8\\ \end{array}\right).\end{displaymath}"
Note this is a symmetric matrix. The characteristic polynomial of A is
data:image/s3,"s3://crabby-images/47e1b/47e1bfd6f42b473d7f37adced7527ba79bbc7e7c" alt="\begin{displaymath}p(\lambda) = (0.8 - \lambda)^2-0.2^2 = (1-\lambda)(0.6 - \lambda)\end{displaymath}"
An eigenvector associated to 1 is
data:image/s3,"s3://crabby-images/383eb/383ebfba8a57fcd615a11e2432a7558e89bdd5ed" alt="\begin{displaymath}\left(\begin{array}{cccc} 1\\ 1\\ \end{array}\right)\end{displaymath}"
and an eigenvector associated to 0.6 is
data:image/s3,"s3://crabby-images/e3294/e3294f0f3590056d6b0f598c20d3f1bb2181c626" alt="\begin{displaymath}\left(\begin{array}{cccc} 1\\ -1\\ \end{array}\right).\end{displaymath}"
If we set
data:image/s3,"s3://crabby-images/23b14/23b14b6bbc76e60166cade960464ad72663661d2" alt="\begin{displaymath}P = \left(\begin{array}{rr} 1&1\\ 1&-1\\ \end{array}\right),\end{displaymath}"
then we have
data:image/s3,"s3://crabby-images/e657a/e657a8f7847d9c7b428d5860529339430465b62e" alt="\begin{displaymath}P^{-1}AP = D = \left(\begin{array}{cc} 1&0\\ 0&0.6\\ \end{array}\right).\end{displaymath}"
So, we have
data:image/s3,"s3://crabby-images/c9789/c97890e912b7538b31ecaaa613125082408a67b8" alt="\begin{displaymath}A^n = P D^nP^{-1} = P\left(\begin{array}{cc} 1&0\\ 0&(0.6)^n\\ \end{array}\right)P^{-1}.\end{displaymath}"
When n gets large, the matrices An get closer to the matrix
data:image/s3,"s3://crabby-images/3781a/3781a33cf18960bbab9b4415f1ea2bfb3afb2882" alt="\begin{displaymath}P\left(\begin{array}{cc} 1&0\\ 0&0\\ \end{array}\right)P^{-1}.\end{displaymath}"
So the sequence of vectors defined by
data:image/s3,"s3://crabby-images/c4d5d/c4d5de198d41037235579dd6b94982ab11ce3126" alt="\begin{displaymath}X(n+1) = A X(n),\;\;\mbox{given $X(0)$}\end{displaymath}"
will get closer to
data:image/s3,"s3://crabby-images/83635/8363592c5349a67bd2705d49f00243c303cffa7a" alt="\begin{displaymath}X(\infty) = P\left(\begin{array}{cc} 1&0\\ 0&0\\ \end{array}\right)P^{-1}X(0)\end{displaymath}"
when n gets large. If
data:image/s3,"s3://crabby-images/e8cf0/e8cf0f33b712c5d0b33d8c3b376f8ccab38d8fb3" alt="$X(0) = \displaystyle \left(\begin{array}{cc} a\\ b\\ \end{array}\right)$"
data:image/s3,"s3://crabby-images/63338/63338a239bc6409482bfbb53ebaf2c9c1937d50f" alt="\begin{displaymath}X(\infty) = \left(\begin{array}{rr} 1&1\\ 1&-1\\ \end{array... ...ac{a+b}{2}\left(\begin{array}{cc} 1\\ 1\\ \end{array}\right).\end{displaymath}"
Note that the vector
data:image/s3,"s3://crabby-images/d21f2/d21f29c8e5ab459ff10a2847b5e0ce88d1d86473" alt="$X(\infty)$"
data:image/s3,"s3://crabby-images/e9a8a/e9a8ae6ccfbe798cc7136a2ab03dca35dd1c8ba5" alt="\begin{displaymath}{\bf p} = \frac{1}{2}\left(\begin{array}{cc} 1\\ 1\\ \end{array}\right).\end{displaymath}"
This is not surprising. In fact there is a general result similar to the one above for any stochastic matrix.
data:image/s3,"s3://crabby-images/f04ed/f04edb0ab24730059b8a8a4284359d071a5ad085" alt=""
Do you need more help? Please post your question on our S.O.S. Mathematics CyberBoard.
data:image/s3,"s3://crabby-images/118c3/118c3a3675bdf432bffc4f5eb86f1cdb4b7922bf" alt=""
Author: M.A. Khamsi
Contact us
Math Medics, LLC. - P.O. Box 12395 - El Paso TX 79913 - USA
800 users online during the last hour
No comments:
Post a Comment