Initial Estimates for Parameter Estimation
Vivek Dua
Centre for Process Systems Engineering, Department of Chemical Engineering, University College London, London WC1E 7JE
Objectives: Parameter estimation for models involving differential equations corresponds to a dynamic optimization problem, which in general has multiple locally optimal solutions. The solution techniques for global optimization can be broadly classified as deterministic and non-deterministic methods. Deterministic methods can guarantee that solution obtained is globally optimal within a certain pre-specified tolerance whereas the non-determinstic methods do not provide any such guarantee. However these methods are computationally demanding and most of the commercially available softwares for parameter estimation are based upon local optimization techniques. The main objective of this work is to develop algorithms for obtaining good quality initial estimates, which can speed-up the solution times, reduce solver failures, and increase possibility of obtaining the globally optimal solution.
Methods: The parameter estimation problem is decomposed into two sub-problems. The first sub-problem corresponds to developing an Artificial Neural Network (ANN) model for the given data. The ANN model represents a reduced data set [1]. The second subproblem is formulated as a parameter estimation problem where the differential terms in the model are obtained by analytically differentiating the ANN model. The second subproblem corresponds to a Linear Program (LP) or a Nonlinear Program (NLP), for which reliable solvers are available. The solution of the second subproblem provides initial estimates for parameters. These estimates are then used to solve the original parameter estimation problem.
Results: This methodology was tested on several parameter estimation problems and the solution was obtained reliably and in a few iterations only.
Conclusions: The main advantage of using the proposed decomposition approach is that the error between the data and model predictions is carried out in the first step and the parameter estimation for a reduced data set is carried out in the second step. ANN is well known for their ability to handle large data sets and characterize highly nonlinear functions very effectively. This ability together with the differentiability properties of ANN makes the proposed approach a very useful tool for parameter estimation of problems involving differential equations and large data sets.References:
[1] Dua, V. (2006) Optimal configuration of artificial neural networks, Proc. of 16th ESCAPE and 9th Int Symp on PSE, W. Marquardt and C. Pantelides (Eds), p.1599-1604, Elsevier, Amsterdam.