wikipedia Dynamic programming

Dynamic programming is both a mathematical optimization method and a computer programming method. The method was developed by Richard Bellman in the 1950s and has found applications in numerous fields, from aerospace engineering（航天工程） to economics. In both contexts it refers to simplifying a complicated problem by breaking it down into simpler sub-problems in a recursive manner. While some decision problems（决策问题） cannot be taken apart this way, decisions that span several points in time do often break apart recursively. Likewise, in computer science, if a problem can be solved optimally by breaking it into sub-problems and then recursively finding the optimal solutions to the sub-problems, then it is said to have optimal substructure（最优子结构）.

If sub-problems can be nested recursively inside larger problems, so that dynamic programming methods are applicable, then there is a relation between the value of the larger problem and the values of the sub-problems.[1]In the optimization literature this relationship is called the Bellman equation.

Overview

Mathematical optimization

In terms of mathematical optimization, dynamic programming usually refers to simplifying a decision by breaking it down into a sequence of decision steps over time. This is done by defining a sequence of value functions V*1, *V*2, ..., *V**n taking y as an argument representing the state of the system at times i from 1 to n. The definition of V**n(y) is the value obtained in state y at the last time n. The values V**i at earlier times i = n −1, n − 2, ..., 2, 1 can be found by working backwards, using a recursive relationship called the Bellman equation. For i = 2, ..., n, V**i−1 at any state y is calculated from V**i by maximizing a simple function (usually the sum) of the gain from a decision at time i − 1 and the function V**i at the new state of the system if this decision is made. Since V**i has already been calculated for the needed states, the above operation yields V**i−1 for those states. Finally, *V*1 at the initial state of the system is the value of the optimal solution. The optimal values of the decision variables can be recovered, one by one, by tracking back the calculations already performed.

Computer programming

There are two key attributes that a problem must have in order for dynamic programming to be applicable: optimal substructure and overlapping sub-problems. If a problem can be solved by combining optimal solutions to non-overlapping sub-problems, the strategy is called "divide and conquer" instead.[1] This is why merge sort and quick sort are not classified as dynamic programming problems.

SUMMARY : sorting并不具备 overlapping sub-problems的特性；

Optimal substructure means that the solution to a given optimization problem can be obtained by the combination of optimal solutions to its sub-problems. Such optimal substructures are usually described by means of recursion. For example, given a graph G=(V,E), the shortest path p from a vertex u to a vertex v exhibits optimal substructure: take any intermediate vertex w on this shortest path p. If p is truly the shortest path, then it can be split into sub-paths p1 from u to w and p2 from w to v such that these, in turn, are indeed the shortest paths between the corresponding vertices (by the simple cut-and-paste argument described in Introduction to Algorithms). Hence, one can easily formulate the solution for finding shortest paths in a recursive manner, which is what the Bellman–Ford algorithm or the Floyd–Warshall algorithm does.

Overlapping sub-problems means that the space of sub-problems must be small, that is, any recursive algorithm solving the problem should solve the same sub-problems over and over, rather than generating new sub-problems. For example, consider the recursive formulation for generating the Fibonacci series: $F_i = F_{i−1} + F_{i−2}$ , with base case $F_1 = F_2 = 1$ . Then $F_{43} = F_{42} + F_{41}$ , and $F_{42} = F_{41} + F_{40}$ . Now $F_{41}$ is being solved in the recursive sub-trees of both $F_{43}$ as well as $F_{42}$ . Even though the total number of sub-problems is actually small (only 43 of them), we end up solving the same problems over and over if we adopt a naive recursive solution such as this. Dynamic programming takes account of this fact and solves each sub-problem only once.

Figure 2.

This can be achieved in either of two ways:[citation needed]

Top-down approach: This is the direct fall-out of the recursive formulation of any problem. If the solution to any problem can be formulated recursively using the solution to its sub-problems, and if its sub-problems are overlapping, then one can easily memoize or store the solutions to the sub-problems in a table. Whenever we attempt to solve a new sub-problem, we first check the table to see if it is already solved. If a solution has been recorded, we can use it directly, otherwise we solve the sub-problem and add its solution to the table.
Bottom-up approach: Once we formulate the solution to a problem recursively as in terms of its sub-problems, we can try reformulating the problem in a bottom-up fashion: try solving the sub-problems first and use their solutions to build-on and arrive at solutions to bigger sub-problems. This is also usually done in a tabular form by iteratively generating solutions to bigger and bigger sub-problems by using the solutions to small sub-problems. For example, if we already know the values of *F*41 and *F*40, we can directly calculate the value of *F*42.

Some programming languages can automatically memoize the result of a function call with a particular set of arguments, in order to speed up call-by-name evaluation (this mechanism is referred to as call-by-need). Some languages make it possible portably (e.g. Scheme, Common Lisp, Perl or D). Some languages have automatic memoization built in, such as tabled Prolog and J, which supports memoization with the M. adverb.[4] In any case, this is only possible for a referentially transparent function. Memoization is also encountered as an easily accessible design pattern within term-rewrite based languages such as Wolfram Language.

Examples: Computer algorithms

Dijkstra's algorithm for the shortest path problem

Fibonacci sequence

A type of balanced 0–1 matrix

Checkerboard

Sequence alignment

Tower of Hanoi puzzle

Egg dropping puzzle

Matrix chain multiplication

Main article: Matrix chain multiplication

Algorithms that use dynamic programming

Recurrent solutions to lattice models for protein-DNA binding
Backward induction as a solution method for finite-horizon discrete-time dynamic optimization problems
Method of undetermined coefficients can be used to solve the Bellman equation in infinite-horizon, discrete-time, discounted, time-invariant dynamic optimization problems
Many string algorithms including longest common subsequence, longest increasing subsequence, longest common substring, Levenshtein distance (edit distance)
Many algorithmic problems on graphs can be solved efficiently for graphs of bounded treewidth or bounded clique-width by using dynamic programming on a tree decomposition of the graph.
The Cocke–Younger–Kasami (CYK) algorithm which determines whether and how a given string can be generated by a given context-free grammar
Knuth's word wrapping algorithm that minimizes raggedness when word wrapping text
The use of transposition tables and refutation tables in computer chess
The Viterbi algorithm (used for hidden Markov models, and particularly in part of speech tagging)
The Earley algorithm (a type of chart parser)
The Needleman–Wunsch algorithm and other algorithms used in bioinformatics, including sequence alignment, structural alignment, RNA structure prediction
Floyd's all-pairs shortest path algorithm
Optimizing the order for chain matrix multiplication
Pseudo-polynomial time algorithms for the subset sum, knapsack and partition problems
The dynamic time warping algorithm for computing the global distance between two time series
The Selinger (a.k.a. System R) algorithm for relational database query optimization
De Boor algorithm for evaluating B-spline curves
Duckworth–Lewis method for resolving the problem when games of cricket are interrupted
The value iteration method for solving Markov decision processes
Some graphic image edge following selection methods such as the "magnet" selection tool in Photoshop
Some methods for solving interval scheduling problems
Some methods for solving the travelling salesman problem, either exactly (in exponential time) or approximately (e.g. via the bitonic tour)
Recursive least squares method
Beat tracking in music information retrieval
Adaptive-critic training strategy for artificial neural networks
Stereo algorithms for solving the correspondence problem used in stereo vision
Seam carving (content-aware image resizing)
The Bellman–Ford algorithm for finding the shortest distance in a graph
Some approximate solution methods for the linear search problem
Kadane's algorithm for the maximum subarray problem
Optimization of electric generation expansion plans in the Wein Automatic System Planning (WASP) package