Abstract
This paper proposes a novel sensor scheduling scheme based on adaptive dynamic programming, which makes the sensor energy consumption and tracking error optimal over the system operational horizon for wireless sensor networks with solar energy harvesting. Neural network is used to model the solar energy harvesting. Kalman filter estimation technology is employed to predict the target location. A performance index function is established based on the energy consumption and tracking error. Critic network is developed to approximate the performance index function. The presented method is proven to be convergent. Numerical example shows the effectiveness of the proposed approach.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Rout RR, Ghosh SK (2013) Enhancement of lifetime using duty cycle and network coding in wireless sensor networks. IEEE Trans Wirel Commun 12(2):656–667
Li Y, Chen CS, Song Y, Wang Z, Sun Y (2009) Enhancing real-time delivery in wireless sensor networks with two-hop information. IEEE Trans Ind Inform 5(2):113–122
Shi L, Jia QS, Mo YL, Sinopoli B (2011) Sensor scheduling over a packet-delaying network. Automatica 47:1089–1092
Mo YL, Ambrosino R, Sinopoli B (2011) Sensor selection strategies for state estimation in energy constrained wireless sensor networks. Automatica 47:1330–1338
Koutsopoulos I, Stańczak S (2012) The impact of transmit rate control on energy-efficient estimation in wireless sensor networks. IEEE Trans Wirel Commun 11(9):3261–3271
Wei D, Jin Y, Vural S, Moessner K, Tafazolli R (2011) An energy-efficient clustering solution for wireless sensor networks. IEEE Trans Wirel Commun 10(11):3973–3983
Li B, Li H, Wang W, Yin Q, Liu H (2013) Performance analysis and optimization for energy-efficient cooperative transmission in random wireless sensor network. IEEE Trans Wirel Commun 12(9):4647–4657
Wu Y, Liu W (2013) Routing protocol based on genetic algorithm for energy harvesting-wireless sensor networks. IET Wirel Sens Syst 3(2):112–118
Alippi C, Galperti C (2008) An adaptive system for optimal solar energy harvesting in wireless sensor network nodes. IEEE Trans Circuits Syst I 55(6):1742–1750
Werbos PJ (1977) Advanced forecasting methods for global crisis warning and models of intelligence. Gen Syst Yearb 22:25–38
Werbos PJ (1991) A menu of designs for reinforcement learning over time. In: Miller WT, Sutton RS, Werbos PJ (eds) Neural networks for control. MIT Press, Cambridge, pp 67–95
Wei Q, Wang D, Zhang D (2013) Dual iterative adaptive dynamic programming for a class of discrete-time nonlinear systems with time-delays. Neural Comput Appl 23(7–8):1851–1863
Wang Z, Liu D (2013) A data-based state feedback control method for a class of nonlinear systems. IEEE Trans Ind Inform 9(4):2284–2292
Wei Q, Liu D (2014) A novel iterative \(\theta\)-adaptive dynamic programming for discrete-time nonlinear systems. IEEE Trans Autom Sci Eng 11(4):1176–1190
Heydari A, Balakrishnan SN (2013) Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics. IEEE Trans Neural Netw Learn Syst 24(1):145–157
Liu D, Wei Q (2013) Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems. IEEE Trans Cybern 43(2):779–789
Xu X, Hou Z, Lian C, He H (2013) Online learning control using adaptive critic designs with sparse kernel machines. IEEE Trans Neural Netw Learn Syst 24(5):762–775
Wei Q, Liu D (2012) An iterative \(\epsilon\)-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state. Neural Netw 32(6):236–244
Liu D, Javaherian H, Kovalenko O, Huang T (2008) Adaptive critic learning techniques for engine torque and air–fuel ratio control. IEEE Trans Syst Man Cybern Part B Cybern 38(4):988–993
Wei Q, Liu D (2014) Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems. Neural Comput Appl 24(6):1355–1367
Prokhorov DV, Wunsch DC (1997) Adaptive critic designs. IEEE Trans Neural Netw 8(5):997–1007
Liang J, Molina DD, Venayagamoorthy GK, Harley RG (2013) Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation. IEEE Trans Power Syst 28(3):2670–2678
Ni Z, He H, Wen J, Xu X (2013) Goal representation heuristic dynamic programming on maze navigation. IEEE Trans Neural Netw Learn Syst 24(12):2038–2050
Ni Z, He H, Wen J (2013) Adaptive learning in tracking control based on the dual critic network design. IEEE Trans Neural Netw Learn Syst 24(6):913–928
Wei Q, Liu D (2014) Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification. IEEE Trans Autom Sci Eng 11(4):1020–1036
Song R, Zhang H (2013) The finite horizon optimal control for a class of time-delay affine nonlinear system. Neural Comput Appl 22(2):229–235
Jiang Y, Jiang ZP (2014) Robust adaptive dynamic programming and feedback stabilization of nonlinear systems. IEEE Trans Neural Netw Learn Syst 25(5):882–893
Bian T, Jiang Y, Jiang ZP (2014) Adaptive dynamic programming and optimal control of nonlinear nonaffine systems. Automatica 50(10):2624–2632
Jiang ZP, Jiang Y (2013) Robust adaptive dynamic programming for linear and nonlinear systems: an overview. Eur J Control 19(5):417–425
Xu X, Lian C, Zuo L, He H (2014) Kernel-based approximate dynamic programming for real-time online learning control: an experimental study. IEEE Trans Control Syst Technol 22(1):146–156
Molina D, Venayagamoorthy GK, Liang J, Harley RG (2013) Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming. IEEE Trans Smart Grid 4(1):498–508
Squartini S, Lu J, Wei Q (2013) The neural paradigm for complex systems: new algorithms and applications. Neural Comput Appl 22(2):203–204
Xu H, Jagannathan S (2013) Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming. IEEE Trans Neural Netw Learn Syst 24(3):471–484
Modares H, Lewis FL (2014) Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning. Automatica 50(7):1780–1792
Kiumarsi B, Lewis FL, Modares H, Karimpur A, Naghibi-Sistani MB (2014) Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics. Automatica 50(4):1167–1175
Modares H, Lewis FL, Naghibi-Sistani MB (2014) Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems. Automatica 50:193–202
Zhang H, Wei Q, Liu D (2011) An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 47(1):207–214
Vamvoudakis KG, Lewis FL (2011) Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton–Jacobi equations. Automatica 47(8):1556–1569
Zhang H, Wei Q, Luo Y (2008) A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Trans Syst Man Cybern Part B Cybern 38(4):937–942
Huang Y, Liu D (2014) Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm. Neurocomputing 125:46–56
Wei Q, Liu D (2015) Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors. Neurocomput Part A 149(3):106–115
Modares H, Lewis FL (2014) Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning. IEEE Trans Autom Control 59:3051–3056
Kiumarsi B, Lewis FL, Naghibi-Sistani MB, Karimpour A (2015) Approximate dynamic programming for optimal tracking control of unknown linear systems using measured data. IEEE Trans Cybern. doi:10.1109/TCYB.2014.2384016
Hengster-Movric K, You K, Lewis FL, Xie L (2013) Synchronization of discrete-time multi-agent systems on graphs using Riccati design. Automatica 49(2):414–423
Song R, Xiao W, Zhang H, Sun C (2014) Adaptive dynamic programming for a class of complex-valued nonlinear systems. IEEE Trans Neural Netw Learn Syst 25(9):1733–1739
Bian T, Jiang Y, Jiang ZP (2014) Decentralized adaptive optimal control of large-scale systems with application to power systems. IEEE Trans Ind Electron. doi:10.1109/TIE.2014.2345343
Jiang Y, Jiang ZP (2013) Robust adaptive dynamic programming with an application to power systems. IEEE Trans Neural Netw Learn Syst 24:1150–1156
Wei Q, Liu D, Shi G (2015) A novel dual iterative Q-learning method for optimal battery management in smart residential environments. IEEE Trans Ind Electron 62(4):2509–2518
Wei Q, Wang F-Y, Liu D, Yang X (2014) Finite-approximation-error-based discrete-time iterative adaptive dynamic programming. IEEE Trans Cybern 44(12):2820–2833
Wei Q, Liu D (2014) A novel iterative \(\theta\)-adaptive dynamic programming for discrete-time nonlinear systems. IEEE Trans Autom Sci Eng 11(4):1176–1190
Wei Q, Liu D (2014) Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification. IEEE Trans Autom Sci Eng 11(4):1020–1036
Kiumarsi B, Lewis FL (2015) Actor-critic based optimal tracking for partially unknown nonlinear discrete-time systems. IEEE Trans Neural Netw Learn Syst 26(1):140–151
Modares H, Lewis FL, Naghibi-Sistani MB (2013) Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks. IEEE Trans Neural Netw Learn Syst 24(10):1513–1525
Xiao W, Song, R (2012) Self-learning sensor scheduling for target tracking in wireless sensor networks based on adaptive dynamic programming. In: 10th world congress on intelligent control and automation, pp 1056–1061
Xiao W, Song R (2012) Adaptive dynamic programming for sensor scheduling in energy-constrained wireless sensor networks. In: 15th international conference on information fusion, pp 991–996
Fadare DA (2009) Modelling of solar energy potential in Nigeria using an artificial neural network model. Appl Energy 86:1410–1422
Maheswararajah S, Halgamuge SK, Premaratne M (2009) Sensor scheduling for target tracking by suboptimal algorithms. IEEE Trans Veh Technol 58(3):1467–1479
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China under Grants 61304079, 61374105, in part by the Beijing Natural Science Foundation under Grants 4132078, 4143065, in part by the China Postdoctoral Science Foundation under Grant 2013M530527, and in part by Fundamental Research Funds for the Central Universities under Grant FRF-TP-14-119A2, and in part by the Open Research Project from SKLMCCS under Grants 20150104, 20120106.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Song, R., Wei, Q. & Xiao, W. ADP-based optimal sensor scheduling for target tracking in energy harvesting wireless sensor networks. Neural Comput & Applic 27, 1543–1551 (2016). https://doi.org/10.1007/s00521-015-1954-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-015-1954-4