lbfgs_jpl_version/lsopt.2/lsopt_doc.txt


Description of large scale optimization package, Version 3.0
##############################################################
Patrick Heimbach, MIT/EAPS, 02-Mar-2000
Benny Cheng, NASA/JPL, 13-Apr-2010 (parallel version)

reference:
#########

J.C. Gilbert & C. Lemarechal
Some numerical experiments with variable-storage quasi-Newton algorithms
Mathematical Programming 45 (1989), pp. 407-435

flow chart
##########

      lsopt_top
          |
          |---- check arguments
          |---- CALL INSTORE
          |       |
          |       |---- determine whether OPWARMI available:
          |                * if no:  cold start: create OPWARMI
          |                * if yes: warm start: read from OPWARMI
          |             create or open OPWARMD
          |
          |---- check consistency between OPWARMI and model parameters
          | 
          |---- >>> if COLD start: <<<
          |      |  first simulation with f.g. xx_0; output: first ff_0, gg_0
          |      |  set first preconditioner value xdiff_0 to 1
          |      |  store xx(0), gg(0), xdiff(0) to OPWARMD (first 3 entries)
          |      |
          |     >>> else: WARM start: <<<
          |         read xx(i), gg(i) from OPWARMD (first 2 entries)
          |         for first warm start after cold start, i=0
          |
          |
          |
          |---- /// if ITMAX > 0: perform optimization (increment loop index i)
          |      (
          |      )---- save current values of gg(i-1) -> gold(i-1), ff -> fold(i-1)
          |      (---- CALL LSUPDXX
          |      )       |
          |      (       |---- >>> if jmax=0 <<<
          |      )       |      |  first optimization after cold start:
          |      (       |      |  preconditioner estimated via .01*ff_0 (first guess)
          |      )       |      |  dd(i-1) = -gg(i-1)*preco
          |      (       |      |  
          |      )       |     >>> if jmax > 0 <<<
          |      (       |         dd(i-1) = -gg(i-1)
          |      )       |         CALL HESSUPD
          |      (       |           |
          |      )       |           |---- dd(i-1) modified via Hessian approx.
          |      (       |
          |      )       |---- >>> if <dd,gg> >= 0 <<<
          |      (       |         ifail = 4
          |      )       |
          |      (       |---- compute step size: tact(i-1)
          |      )       |---- compute update: xdiff(i) = xx(i-1) + tact(i-1)*dd(i-1)
          |      (
          |      )---- >>> if ifail = 4 <<<
          |      (         goto 1000
          |      )
          |      (---- if (warm start)
          |      (---- CALL OPTLINE / LSLINE
          |      )       |
          |      (       |
          |      )       |
          |      (       |---- /// loop over simulations
          |      )              (  
          |      (              if (linesearch ) 
          |      (              )---- CALL SIMUL
          |      )              (       |
          |      (              )       |---- input: xdiff(i)
          |      )              (       |---- output: ff(i)
          |      )              (                 reads those values from file
          |      (              )
          |      (              (       |---- perform quadratic line search
          |      )              )
          |      (              else
          |      )              )---- CALL SIMUL
          |      )              (       |
          |      (              )       |---- input: xdiff(i)
          |      )              (       |---- output: ff(i),gg(i)
          |      )              (                 reads those values from file
          |      (              )
          |      )              (---- 1st Wolfe test:
          |      (              )     ff(i) <= tact*xpara1*<gg(i-1),dd(i-1)>
          |      )              (
          |      (              )---- 2nd Wolfe test:
          |      )              (     <gg(i),dd(i-1)> >= xpara2*<gg(i-1),dd(i-1)>
          |      (              )
          |      )              (---- >>> if 1st and 2nd Wolfe tests ok <<<
          |      (              )      |  320: update xx: xx(i) = xdiff(i)
          |      )              (      |
          |      (              )     >>> else 
          |      )              )      |  ifail = 7
          |      )              (      |  set stepsize equal half of previous
          |      (              )      |  value 
          |      (              )      |
          |      )              (         update xdiff for new simulation
          |      (              )
          |      (        
          |      )        
          |      (---- store new values xx(i), gg(i) to OPWARMD (first 2 entries)
          |      )---- >>> if ifail = 7,8,9 <<<
          |      (         goto 1000
          |      )
          |      (---- compute new pointers jmin, jmax to include latest values
          |      )     gg(i)-gg(i-1), xx(i)-xx(i-1) to Hessian matrix estimate
          |      (---- store gg(i)-gg(i-1), xx(i)-xx(i-1) to OPWARMD
          |      )     (entries 2*jmax+2, 2*jmax+3)
          |      (
          |      )---- CALL DGSCALE
          |      (       |
          |      )       |---- call dostore
          |      (       |       |
          |      )       |       |---- read preconditioner of previous iteration diag(i-1)
          |      (       |             from OPWARMD (3rd entry)
          |      )       |
          |      (       |---- compute new preconditioner diag(i), based upon diag(i-1),
          |      )       |     gg(i)-gg(i-1), xx(i)-xx(i-1)
          |      (       |
          |      )       |---- call dostore
          |      (               |
          |      )               |---- write new preconditioner diag(i) to OPWARMD (3rd entry)
          |      (
          |---- \\\ end of optimization iteration loop
          |
          |
          |
          |---- CALL OUTSTORE
          |       |
          |       |---- toggle linesearch flag
          |       |---- store gnorm0, ff(i), current pointers jmin, jmax, iterabs, 
          |       |---- searchflags to OPWARMI
          |
          |         xx(i+1) needs to be computed as input for offline optimization
          |          |
          |          |---- CALL LSUPDXX
          |          |       |
          |          |       |---- compute dd(i), tact(i) -> xdiff(i+1) = x(i) + tact(i)*dd(i)
          |          |
          |          |---- CALL WRITE_CONTROL
          |          |       |
          |          |       |---- write xdiff(i+1) to special file for offline optim.
          |
          |---- print final information
          |
          O


Remarks:
#######

-  Every call to simul refers to a read procedure which
   reads the result of an offline forward run and/or the adjoint run
   itmax = 0, for cold start
   itmax = 1, for warm start
   Also, at the end, x(i+1) needs to be computed and saved
   to be available for the offline model and adjoint run

In order to achieve minimum difference between the online and offline code
xdiff(i+1) is stored to file at the end of an (offline) iteration,
but recomputed identically at the beginning of the next iteration.

2. Number of simulations
-------------------------------------------------

- nfunc: controls the maximum number of safeguarded simulations 

Summary: From one iteration to the next the descent direction changes.
         The updated control used as input for these safeguarded simulations uses the same
         descent direction, but different step sizes.

In detail:
From one iteration to the next the descent direction dd changes using
the result for the adjoint vector gg of the previous iteration.
In lsline the updated control xdiff(i,1) = xx(i-1) + tact(i-1,1)*dd(i-1) serves as input for
a forward and adjoint model run yielding a new gg(i,1).
In general, the new solution passes the 1st and 2nd Wolfe tests 
so xdiff(i,1) represents the solution sought: xx(i) = xdiff(i,1).
If one of the two tests fails, stepsize halving is invoked to determine
a new trial step aize tact(i-1,2).
If more than one function call is permitted, the new step size is used together
with the "old" descent direction dd(i-1) (i.e. dd is not updated using the new gg(i)),
to compute a new xdiff(i,2) = xx(i-1) + tact(i-1,2)*dd(i-1) that serves as input
in a new forward and adjoint run, yielding gg(i,2).
If now, both Wolfe tests are successfull, the updated solution is given by
xx(i) = xdiff(i,2) = xx(i-1) + tact(i-1,2)*dd(i-1).

3. Double-usage of fields dd and xdiff
--------------------------------------

In order to save memory both the fields dd and xdiff have a double usage.

- xdiff: in lsopt_top: used as x(i) - x(i-1) for Hessian update
         in lsline:    intermediate result for control update x = x + tact*dd

- dd   : in lsopt_top, lsline: descent vector, dd = -gg   & hessupd
         in dgscale:           intermediate result to compute new preconditioner

4. Notice for user of old code
------------------------------

Three relevant changes needed to switch to new version:
  
  (i): OPWARMI file: two variables added:
               gnorm0  : norm of first (cold start) gradient
               iabsiter: total number of iterations with respect to cold start

 (ii): routine names that are referenced by main_lsopt.f
       lsoptv1 -> lsopt_top
       lsline1 -> lsline

(iii): parameter list of lsopt_top
       logical loffline included

parameter file data.optim
########################

The optimization is controlled by a set of parameters
provided through the standard input file data.optim,
which is generated within the job script.

  NUPDATE  : max. no. of update pairs (gg(i)-gg(i-1), xx(i)-xx(i-1))
             to be stored in OPWARMD to estimate Hessian
             [pair of current iter. is stored in (2*jmax+2, 2*jmax+3)
             jmax must be > 0 to access these entries]
             Presently NUPDATE must be > 0 
             (i.e. iteration without reference to previous
              iterations through OPWARMD has not been tested)
  EPSX     : relative precision on xx bellow which xx should not be improved
  EPSG     : relative precision on gg below which optimization is considered successful
  IPRINT   : controls verbose (>=1) or non-verbose output
  NUMITER  : always 1
  ITER_NUM : index of new restart file to be created (not necessarily = NUMITER!)
  NFUNC    : max. no. of safeguarded simulations 
             (must be > 0)
             is used if step size tact is interpolated;
             in this case, if NFUNC > 1, a new simulation is performed with
             same gradient but "improved" step size
  FC       : first guess cost function value
  FMIN     : not used

OPWARMI, OPWARMD files
######################

Two files retain values of previous iterations which are
used in latest iteration to update Hessian.
OPWARMI: contains index settings and scalar variables
OPWARMD: contains vectors

Structure of OPWARMI file:
-------------------------
    n, fc, gnorm, m, jmin, jmax, sflag, tflag, safe_iter, stepsize 

    n  = nn      : no. of control variables per processor
    fc = ff      : cost value of last iteration
    m = nupdate  : max. no. of updates for Hessian
    jmin, jmax   : pointer indices for OPWARMD file (cf. below)
    gnorm        : norm of gradient gg
    sflag        : true if linesearch is applied to next iteration
    tflag        : true if next iteration is a safeguarded simulation 
    safe_iter    : number of safeguarded simulations made so far
    stepsize     : value of tact stepsize

Structure of OPWARMD file:
-------------------------
   entry
     1    : xx(i)         : control vector of latest iteration
     2    : gg(i)         : gradient of latest iteration
     3    : xdiff(i), diag: preconditioning vector; (1,...,1) for cold start
    ---
 2*jmax+2 : gold = g(i) - g(i-1) for last update (jmax)
 2*jmax+3 : xdiff = tact * d = xx(i) - xx(i-1) for last update (jmax)

if jmax = 0: cold start; no Hessian update used to compute dd
if jmax > nupdate, old positions are overwritten, starting
                with position pair (4,5)

Example 1: jmin = 1, jmax = 3, mupd = 5

  1   2   3   |   4   5     6   7     8   9     empty     empty
|___|___|___| | |___|___| |___|___| |___|___| |___|___| |___|___|
      0       |     1         2         3

Example 2: jmin = 3, jmax = 7, mupd = 5   ---> jmax = 2

  1   2   3   |  
|___|___|___| | |___|___| |___|___| |___|___| |___|___| |___|___|
              |     6         7         3         4         5


Error handling
##############

  ifail |   description
--------+----------------------------------------------------------
   < 0  | should not appear (flag indic in simul.F not used)
     0  | normal mode during execution
     1  | an input argument is wrong
     2  | warm start file is corrupted
     3  | the initial gradient is too small
     4  | the search direction is not a descent one
     5  | maximal number of iterations reached
     6  | maximal number of simulations reached (handled passively)
     7  | the linesearch failed
     8  | the function could not be improved
     9  | optline parameters wrong
    10  | cold start, no optimization done
    11  | convergence achieved within precision


1
2	Description of large scale optimization package, Version 3.0
3	##############################################################
4	Patrick Heimbach, MIT/EAPS, 02-Mar-2000
5	Benny Cheng, NASA/JPL, 13-Apr-2010 (parallel version)
6
7	reference:
8	#########
9
10	J.C. Gilbert & C. Lemarechal
11	Some numerical experiments with variable-storage quasi-Newton algorithms
12	Mathematical Programming 45 (1989), pp. 407-435
13
14	flow chart
15	##########
16
17	lsopt_top
18	\|
19	\|---- check arguments
20	\|---- CALL INSTORE
21	\| \|
22	\| \|---- determine whether OPWARMI available:
23	\| * if no: cold start: create OPWARMI
24	\| * if yes: warm start: read from OPWARMI
25	\| create or open OPWARMD
26	\|
27	\|---- check consistency between OPWARMI and model parameters
28	\|
29	\|---- >>> if COLD start: <<<
30	\| \| first simulation with f.g. xx_0; output: first ff_0, gg_0
31	\| \| set first preconditioner value xdiff_0 to 1
32	\| \| store xx(0), gg(0), xdiff(0) to OPWARMD (first 3 entries)
33	\| \|
34	\| >>> else: WARM start: <<<
35	\| read xx(i), gg(i) from OPWARMD (first 2 entries)
36	\| for first warm start after cold start, i=0
37	\|
38	\|
39	\|
40	\|---- /// if ITMAX > 0: perform optimization (increment loop index i)
41	\| (
42	\| )---- save current values of gg(i-1) -> gold(i-1), ff -> fold(i-1)
43	\| (---- CALL LSUPDXX
44	\| ) \|
45	\| ( \|---- >>> if jmax=0 <<<
46	\| ) \| \| first optimization after cold start:
47	\| ( \| \| preconditioner estimated via .01*ff_0 (first guess)
48	\| ) \| \| dd(i-1) = -gg(i-1)*preco
49	\| ( \| \|
50	\| ) \| >>> if jmax > 0 <<<
51	\| ( \| dd(i-1) = -gg(i-1)
52	\| ) \| CALL HESSUPD
53	\| ( \| \|
54	\| ) \| \|---- dd(i-1) modified via Hessian approx.
55	\| ( \|
56	\| ) \|---- >>> if <dd,gg> >= 0 <<<
57	\| ( \| ifail = 4
58	\| ) \|
59	\| ( \|---- compute step size: tact(i-1)
60	\| ) \|---- compute update: xdiff(i) = xx(i-1) + tact(i-1)*dd(i-1)
61	\| (
62	\| )---- >>> if ifail = 4 <<<
63	\| ( goto 1000
64	\| )
65	\| (---- if (warm start)
66	\| (---- CALL OPTLINE / LSLINE
67	\| ) \|
68	\| ( \|
69	\| ) \|
70	\| ( \|---- /// loop over simulations
71	\| ) (
72	\| ( if (linesearch )
73	\| ( )---- CALL SIMUL
74	\| ) ( \|
75	\| ( ) \|---- input: xdiff(i)
76	\| ) ( \|---- output: ff(i)
77	\| ) ( reads those values from file
78	\| ( )
79	\| ( ( \|---- perform quadratic line search
80	\| ) )
81	\| ( else
82	\| ) )---- CALL SIMUL
83	\| ) ( \|
84	\| ( ) \|---- input: xdiff(i)
85	\| ) ( \|---- output: ff(i),gg(i)
86	\| ) ( reads those values from file
87	\| ( )
88	\| ) (---- 1st Wolfe test:
89	\| ( ) ff(i) <= tactxpara1<gg(i-1),dd(i-1)>
90	\| ) (
91	\| ( )---- 2nd Wolfe test:
92	\| ) ( <gg(i),dd(i-1)> >= xpara2*<gg(i-1),dd(i-1)>
93	\| ( )
94	\| ) (---- >>> if 1st and 2nd Wolfe tests ok <<<
95	\| ( ) \| 320: update xx: xx(i) = xdiff(i)
96	\| ) ( \|
97	\| ( ) >>> else
98	\| ) ) \| ifail = 7
99	\| ) ( \| set stepsize equal half of previous
100	\| ( ) \| value
101	\| ( ) \|
102	\| ) ( update xdiff for new simulation
103	\| ( )
104	\| (
105	\| )
106	\| (---- store new values xx(i), gg(i) to OPWARMD (first 2 entries)
107	\| )---- >>> if ifail = 7,8,9 <<<
108	\| ( goto 1000
109	\| )
110	\| (---- compute new pointers jmin, jmax to include latest values
111	\| ) gg(i)-gg(i-1), xx(i)-xx(i-1) to Hessian matrix estimate
112	\| (---- store gg(i)-gg(i-1), xx(i)-xx(i-1) to OPWARMD
113	\| ) (entries 2jmax+2, 2jmax+3)
114	\| (
115	\| )---- CALL DGSCALE
116	\| ( \|
117	\| ) \|---- call dostore
118	\| ( \| \|
119	\| ) \| \|---- read preconditioner of previous iteration diag(i-1)
120	\| ( \| from OPWARMD (3rd entry)
121	\| ) \|
122	\| ( \|---- compute new preconditioner diag(i), based upon diag(i-1),
123	\| ) \| gg(i)-gg(i-1), xx(i)-xx(i-1)
124	\| ( \|
125	\| ) \|---- call dostore
126	\| ( \|
127	\| ) \|---- write new preconditioner diag(i) to OPWARMD (3rd entry)
128	\| (
129	\|---- \\\ end of optimization iteration loop
130	\|
131	\|
132	\|
133	\|---- CALL OUTSTORE
134	\| \|
135	\| \|---- toggle linesearch flag
136	\| \|---- store gnorm0, ff(i), current pointers jmin, jmax, iterabs,
137	\| \|---- searchflags to OPWARMI
138	\|
139	\| xx(i+1) needs to be computed as input for offline optimization
140	\| \|
141	\| \|---- CALL LSUPDXX
142	\| \| \|
143	\| \| \|---- compute dd(i), tact(i) -> xdiff(i+1) = x(i) + tact(i)*dd(i)
144	\| \|
145	\| \|---- CALL WRITE_CONTROL
146	\| \| \|
147	\| \| \|---- write xdiff(i+1) to special file for offline optim.
148	\|
149	\|---- print final information
150	\|
151	O
152
153
154
155	Remarks:
156	#######
157
158	- Every call to simul refers to a read procedure which
159	reads the result of an offline forward run and/or the adjoint run
160	itmax = 0, for cold start
161	itmax = 1, for warm start
162	Also, at the end, x(i+1) needs to be computed and saved
163	to be available for the offline model and adjoint run
164
165	In order to achieve minimum difference between the online and offline code
166	xdiff(i+1) is stored to file at the end of an (offline) iteration,
167	but recomputed identically at the beginning of the next iteration.
168
169	2. Number of simulations
170	-------------------------------------------------
171
172	- nfunc: controls the maximum number of safeguarded simulations
173
174	Summary: From one iteration to the next the descent direction changes.
175	The updated control used as input for these safeguarded simulations uses the same
176	descent direction, but different step sizes.
177
178	In detail:
179	From one iteration to the next the descent direction dd changes using
180	the result for the adjoint vector gg of the previous iteration.
181	In lsline the updated control xdiff(i,1) = xx(i-1) + tact(i-1,1)*dd(i-1) serves as input for
182	a forward and adjoint model run yielding a new gg(i,1).
183	In general, the new solution passes the 1st and 2nd Wolfe tests
184	so xdiff(i,1) represents the solution sought: xx(i) = xdiff(i,1).
185	If one of the two tests fails, stepsize halving is invoked to determine
186	a new trial step aize tact(i-1,2).
187	If more than one function call is permitted, the new step size is used together
188	with the "old" descent direction dd(i-1) (i.e. dd is not updated using the new gg(i)),
189	to compute a new xdiff(i,2) = xx(i-1) + tact(i-1,2)*dd(i-1) that serves as input
190	in a new forward and adjoint run, yielding gg(i,2).
191	If now, both Wolfe tests are successfull, the updated solution is given by
192	xx(i) = xdiff(i,2) = xx(i-1) + tact(i-1,2)*dd(i-1).
193
194	3. Double-usage of fields dd and xdiff
195	--------------------------------------
196
197	In order to save memory both the fields dd and xdiff have a double usage.
198
199	- xdiff: in lsopt_top: used as x(i) - x(i-1) for Hessian update
200	in lsline: intermediate result for control update x = x + tact*dd
201
202	- dd : in lsopt_top, lsline: descent vector, dd = -gg & hessupd
203	in dgscale: intermediate result to compute new preconditioner
204
205	4. Notice for user of old code
206	------------------------------
207
208	Three relevant changes needed to switch to new version:
209
210	(i): OPWARMI file: two variables added:
211	gnorm0 : norm of first (cold start) gradient
212	iabsiter: total number of iterations with respect to cold start
213
214	(ii): routine names that are referenced by main_lsopt.f
215	lsoptv1 -> lsopt_top
216	lsline1 -> lsline
217
218	(iii): parameter list of lsopt_top
219	logical loffline included
220
221	parameter file data.optim
222	########################
223
224	The optimization is controlled by a set of parameters
225	provided through the standard input file data.optim,
226	which is generated within the job script.
227
228	NUPDATE : max. no. of update pairs (gg(i)-gg(i-1), xx(i)-xx(i-1))
229	to be stored in OPWARMD to estimate Hessian
230	[pair of current iter. is stored in (2jmax+2, 2jmax+3)
231	jmax must be > 0 to access these entries]
232	Presently NUPDATE must be > 0
233	(i.e. iteration without reference to previous
234	iterations through OPWARMD has not been tested)
235	EPSX : relative precision on xx bellow which xx should not be improved
236	EPSG : relative precision on gg below which optimization is considered successful
237	IPRINT : controls verbose (>=1) or non-verbose output
238	NUMITER : always 1
239	ITER_NUM : index of new restart file to be created (not necessarily = NUMITER!)
240	NFUNC : max. no. of safeguarded simulations
241	(must be > 0)
242	is used if step size tact is interpolated;
243	in this case, if NFUNC > 1, a new simulation is performed with
244	same gradient but "improved" step size
245	FC : first guess cost function value
246	FMIN : not used
247
248	OPWARMI, OPWARMD files
249	######################
250
251	Two files retain values of previous iterations which are
252	used in latest iteration to update Hessian.
253	OPWARMI: contains index settings and scalar variables
254	OPWARMD: contains vectors
255
256	Structure of OPWARMI file:
257	-------------------------
258	n, fc, gnorm, m, jmin, jmax, sflag, tflag, safe_iter, stepsize
259
260	n = nn : no. of control variables per processor
261	fc = ff : cost value of last iteration
262	m = nupdate : max. no. of updates for Hessian
263	jmin, jmax : pointer indices for OPWARMD file (cf. below)
264	gnorm : norm of gradient gg
265	sflag : true if linesearch is applied to next iteration
266	tflag : true if next iteration is a safeguarded simulation
267	safe_iter : number of safeguarded simulations made so far
268	stepsize : value of tact stepsize
269
270	Structure of OPWARMD file:
271	-------------------------
272	entry
273	1 : xx(i) : control vector of latest iteration
274	2 : gg(i) : gradient of latest iteration
275	3 : xdiff(i), diag: preconditioning vector; (1,...,1) for cold start
276	---
277	2*jmax+2 : gold = g(i) - g(i-1) for last update (jmax)
278	2jmax+3 : xdiff = tact d = xx(i) - xx(i-1) for last update (jmax)
279
280	if jmax = 0: cold start; no Hessian update used to compute dd
281	if jmax > nupdate, old positions are overwritten, starting
282	with position pair (4,5)
283
284	Example 1: jmin = 1, jmax = 3, mupd = 5
285
286	1 2 3 \| 4 5 6 7 8 9 empty empty
287	\|___\|___\|___\| \| \|___\|___\| \|___\|___\| \|___\|___\| \|___\|___\| \|___\|___\|
288	0 \| 1 2 3
289
290	Example 2: jmin = 3, jmax = 7, mupd = 5 ---> jmax = 2
291
292	1 2 3 \|
293	\|___\|___\|___\| \| \|___\|___\| \|___\|___\| \|___\|___\| \|___\|___\| \|___\|___\|
294	\| 6 7 3 4 5
295
296
297
298	Error handling
299	##############
300
301	ifail \| description
302	--------+----------------------------------------------------------
303	< 0 \| should not appear (flag indic in simul.F not used)
304	0 \| normal mode during execution
305	1 \| an input argument is wrong
306	2 \| warm start file is corrupted
307	3 \| the initial gradient is too small
308	4 \| the search direction is not a descent one
309	5 \| maximal number of iterations reached
310	6 \| maximal number of simulations reached (handled passively)
311	7 \| the linesearch failed
312	8 \| the function could not be improved
313	9 \| optline parameters wrong
314	10 \| cold start, no optimization done
315	11 \| convergence achieved within precision
316
317