Access the full text.
Sign up today, get DeepDyve free for 14 days.
R. Nelson (1987)
Stochastic catastrophe theory in computer performance modelingJ. ACM, 34
D. Bertsekas (2009)
Neuro-Dynamic Programming
Vijay Konda, J. Tsitsiklis (2003)
OnActor-Critic AlgorithmsSIAM J. Control. Optim., 42
S. Primak, V. Kontorovitch, V. Lyandres (2004)
Stochastic Methods and their Applications to Communications: Stochastic Differential Equations Approach
S. Bhatnagar, Shishir Kumar (2004)
A simultaneous perturbation stochastic approximation-based actor-critic algorithm for Markov decision processesIEEE Trans. Autom. Control., 49
F. Vázquez-Abad, H. Kushner (1992)
Estimation of the derivative of a stationary measure with respect to a control parameterJournal of Applied Probability, 29
R. Singer (1970)
Estimating Optimal Tracking Filter Performance for Manned Maneuvering TargetsIEEE Transactions on Aerospace and Electronic Systems, AES-6
(2009)
Optimal parameter trajectory estimation in parameterized SDEs : An algorithmic procedure
S. Bhatnagar (2007)
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimizationACM Trans. Model. Comput. Simul., 18
C. Charalambous, S. Djouadi, S. Denic (2005)
Stochastic power control for wireless networks via SDEs: probabilistic QoS measuresIEEE Transactions on Information Theory, 51
F. Campillo, A. Traore (1995)
A stabilization algorithm for linear controlled SDE'sProceedings of 1995 34th IEEE Conference on Decision and Control, 2
M. Styblinski, Tian-Shen Tang (1990)
Experiments in nonconvex optimization: Stochastic approximation with function smoothing and simulated annealingNeural Networks, 3
(2009)
ACM Transactions on Modeling and Computer Simulation Optimal Parameter Trajectory Estimation in Parameterized SDEs @BULLET
H. Kushner, D. Clark (1978)
wchastic. approximation methods for constrained and unconstrained systems
P. Marbach, J. Tsitsiklis (1998)
Simulation-based optimization of Markov reward processesProceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171), 3
(2007)
Received May
H. Kushner (2000)
Numerical Methods for Stochastic Control Problems in Continuous Time
S. Bhatnagar (2005)
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimizationACM Trans. Model. Comput. Simul., 15
Andrew Lim, Yu Zhou, J. Moore
Multiple-objective Risk-sensitive Control and Its Small Noise Limit
R. Rubinstein (1981)
Simulation and the Monte Carlo method
S. Bhatnagar, Karmeshu (2011)
Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systemsApplied Mathematical Modelling, 35
J. Spall (1992)
Multivariate stochastic approximation using a simultaneous perturbation gradient approximationIEEE Transactions on Automatic Control, 37
P. Glasserman (2003)
Monte Carlo Methods in Financial Engineering
R. Moose, Hugh Vanlandingham, D. McCabe (1979)
Modeling and Estimation for Tracking Maneuvering TargetsIEEE Transactions on Aerospace and Electronic Systems, AES-15
(1998)
BAHL, P., AND CHLAMTAC, I
Y. Ho, Xi-Ren Cao (1991)
Perturbation analysis of discrete event dynamic systems
F. Campillo, A. Traore (1994)
Lyapunov exponents of controlled SDE's and stabilizability property : Some examples
R. Korn, H. Kraft (2001)
A Stochastic Control Approach to Portfolio Problems with Stochastic Interest RatesSIAM J. Control. Optim., 40
M. Hirsch (1989)
Convergent activation dynamics in continuous time networksNeural Networks, 2
Vijay Konda, J. Tsitsiklis (1999)
Actor-Critic Algorithms
S. Bhatnagar, M. Fu, S. Marcus, Shashank Bhatnagar (2001)
Two-timescale algorithms for simulation optimization of hidden Markov modelsIIE Transactions, 33
Mohammed Abdulla, S. Bhatnagar (2007)
Reinforcement Learning Based Algorithms for Average Cost Markov Decision ProcessesDiscrete Event Dynamic Systems, 17
S. Bhatnagar, V. Borkar (1998)
A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric OptimizationProbability in the Engineering and Informational Sciences, 12
S. Bhatnagar, V. Borkar (2003)
Multiscale Chaotic SPSA and Smoothed Functional Algorithms for Simulation OptimizationSIMULATION, 79
Tong Liu, P. Bahl, I. Chlamtac (1998)
Mobility modeling, location tracking, and trajectory prediction in wireless ATM networksIEEE J. Sel. Areas Commun., 16
D. Bertsekas (1995)
Dynamic programming and optimal control, 3rd Edition
P. Glynn (1990)
Likelihood ratio gradient estimation for stochastic systemsCommun. ACM, 33
We consider the problem of estimating the optimal parameter trajectory over a finite time interval in a parameterized stochastic differential equation (SDE), and propose a simulation-based algorithm for this purpose. Towards this end, we consider a discretization of the SDE over finite time instants and reformulate the problem as one of finding an optimal parameter at each of these instants. A stochastic approximation algorithm based on the smoothed functional technique is adapted to this setting for finding the optimal parameter trajectory. A proof of convergence of the algorithm is presented and results of numerical experiments over two different settings are shown. The algorithm is seen to exhibit good performance. We also present extensions of our framework to the case of finding optimal parameterized feedback policies for controlled SDE and present numerical results in this scenario as well.
ACM Transactions on Modeling and Computer Simulation (TOMACS) – Association for Computing Machinery
Published: Mar 1, 2009
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.