ÎŰÎ۲ÝÝ®ĘÓƵ
Prof. Aditya Mahajan belongs to the Department of Electrical and Computer Engineering and leads the Systems and Control group at CIM.
2023
J. Subramanian, A. Sinha, and A. Mahajan, “Robustness and sample complexity of model-based MARL for general-sum Markov games,” Dynamic Games and Applications, pp. 56–88, March 2023.
S. Sudhakara, A. Mahajan, A. Nayyar, and Y. Ouyang, “Scalable regret for learning to control network-coupled subsystems with unknown dynamics,” IEEE Transactions on Control of Networked Systems, vol. 10, no. 1, pp. 2–14, March 2023.
M. Afshari and A. Mahajan, “Decentralized linear quadratic systems with major and minor agents and non-Gaussian noise,” IEEE Transactions on Automatic Control, vol. 68, no. 8, pp. 4666–4681, Aug 2023
H. Nekoei, A. Badrinaaraayanan, A. Sinha, M. Amini, J. Rajendran, A. Mahajan, and S. Chandar, “Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning,” Conference on Lifelong Learning Agents, Montreal, Aug 2023.
B. Sayedana, M. Afshari, P.E. Caines, and A. Mahajan, “Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems,” IEEE Conference of Decision and Control, Singapore, Dec 2023.
A. Sinha and A. Mahajan, “Asymmetric Actor Critic with Approximate Information State,” IEEE Conference on Decision and Control, Singapore, Dec 2023.
B. Bozkurt, A. Mahajan, A. Nayyar, and Y. Ouyang, “Weighted norm bounds in MDPs with unbounded per-step cost,” IEEE Conference on Decision and Control, Singapore, Dec 2023.
J. Subramanian, A. Sinha, and A. Mahajan, “Robustness and sample complexity of model-based reinforcement learning for general-sum Markov games,” Dynamics Games and Applications Workshop, Paris, France, Oct 2023.
J. Sumbramanian, A. Kumar, and A. Mahajan, "Mean-field games among teams," PGMODays, Paris, France, Nov 2023.
T. Ni, B. Eysenbach, E. SeyedSalehi , M. Ma, C. Gehring, A. Mahajan, and P.L. Bacon, “Bridging State and History Representations: Understanding self-predictive RL,” Neurips workshop on Self-Supervised Learning - Theory and Practice, New Orleans, USA, Dec 2023.
G. Patil , A. Mahajan, and D. Precup, “On learning history-based policies for controlling Markov decision processes,” ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, Hawaii, USA, July 2023.
A. Sinha and A. Mahajan, “Asymmetric Actor Critic with Approximate Information State,” Meeting on Systems and Control Theory, Waterloo, ON, May 2023.
A. Sinha and A. Mahajan, “Asymmetric Actor Critic with Approximate Information State,” Les Cahiers du GERAD, no. G-2023-57, Nov 2023.
B. Sayedana, M. Afshari, P.E. Caines, and A. Mahajan, “Strong consistency and rate of convergence of switched least squares system identification for autonomous switched Markov jump linear systems,” Les Cahiers du GERAD, no. G-2023-05, Mar 2023.
N. Akbarzadeh and A. Mahajan, “On learning Whittle index policy for restless bandits with scalable regret”, Workshop on restless bandits, index policies and applications in reinforcement learning, University of Grenoble-Alpes, Nov 2023.
2022
Jayakumar Subramanian et al. “Approximate information state for approximate planning and reinforcement learning in partially observed systems”. In: The Journal of Machine Learning Research 23.1 (2022), pp. 483–565.
Shuang Gao and Aditya Mahajan. “Optimal control of network-coupled subsystems: Spectral decomposition and low-dimensional solutions”. In: IEEE Transactions on Control of Network Systems 9.2 (2021), pp. 657–669.
Abhinav Dahiya et al. “Scalable operator allocation for multirobot assistance: A restless bandit approach”. In: IEEE Transactions on Control of Network Systems 9.3 (2022), pp. 1397–1408.
Nima Akbarzadeh and Aditya Mahajan. “Conditions for indexability of restless bandits and an O (Kˆ 3) algorithm to compute Whittle index”. In: Annals of Applied Probability 54.4 (2022), pp. 1164–1192.
Anirudha Jitani et al. “Structure-aware reinforcement learning for node-overload protection in mobile edge computing”. In: IEEE Transactions on Cognitive Communications and Networking 8.4 (2022), pp. 1881–1897.
Gandharv Patil, Aditya Mahajan, and Doina Precup. “On learning history based policies for controlling Markov decision processes”. In: Conference on Reinforcement Learning and Decision Making (RLDM). 2022.
Nima Akbarzadeh and Aditya Mahajan. “Partially observable restless bandits with restarts: Indexability and computation of Whittle index”. In: 2022 IEEE 61st Conference on Decision and Control (CDC). IEEE. 2022, pp. 4898–4904.
Mukul Gagrani et al. “A modified Thompson sampling-based learning algorithm for unknown linear systems”. In: 2022 IEEE 61st Conference on Decision and Control (CDC). IEEE. 2022, pp. 6658–6665.
Hadi Nekoei et al. “Dealing With Nonstationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning”. In: 2023.
Hadi Nekoei et al. “Staged independent learning: Towards decentralized cooperative multiagent Reinforcement Learning”. In: ICLR 2022 Workshop on Gamification and Multiagent Solutions. 2022.
2021
M. Afshari and A. Mahajan. “Multi-agent estimation and filtering for minimizing team mean-squared error,” IEEE Transactions on Signal Processing, vol. 69, pp. 5206–5221, Aug 2021.
A. Jitani, A. Mahajan, Z. Zhu, H. Abou-zeid, E.T. Fapi, and H. Purmehdi. “Structure-aware reinforcement learning for node overload protection in mobile edge computing,” IEEE International Conference on Communications, Montreal, Canada, June 2021.
K. Kaza, A. Mahajan, and J. Le Ny. “Decision referrals in human-automation teams,” IEEE Conference on Decision and Control, Austin, TX, Dec 2021.
R. Seraj, A. Mahajan, and J. Le Ny. “Mean-field approximation for large-population beauty-contest games,” IEEE Conference on Decision and Control, Austin, TX, Dec 2021.
M. Gagrani, S. Sudhakara, A. Mahajan, A. Nayyar, and Y. Ouyang. “Thompson sampling for linear quadratic mean-field teams,” IEEE Conference on Decision and Control, Austin, TX, Dec 2021. (invited talk)
J. Subramanian, A. Sinha, and A. Mahajan. “Robustness of Markov perfect equilibrium to model approximations in general-sum dynamic games,” IEEE Indian Control Conference, Mumbai, India, Dec 2021. (invited talk)
N. Akbarzadeh and A. Mahajan. “Maintenance of a collection of machines under partial observability: Indexability and computation of Whittle index,” Les Cahiers du GERAD, no. G-2021-26, April 2021.
2020
M. Afshari and A. Mahajan, "Optimal local and remote controllers with unreliable uplink channels: An elementary proof." IEEE Transactions on Automatic Control, vol 65. no 8 pp. 3606-3622, Aug 2020
J. Subramanian and A. Mahajan, "Renewal Monte Carlo: Renewal theory based reinforcement learning," IEEE Transactions on Automatic Control, vol. 65, no. 8, pp. 3663-3670. Aug 2020.
B. Sayedana and A. Mahajan, "Counterexamples on the monotonicity of delay optimal strategies for energy harvesting transmitters," IEEE Wireless Communication Letters, vol. 9, no. 7. pp. 1070-1074, Jul 2020
J. Chakravorty and A. Mahajan, "Remote estimation over packet-drop channel with Markovian state," IEEE Transactions on Automatic Control, vol. 65, nno. 5, pp. 2016-2031, May 2020.
B. Sayedana, A. Mahajan, and E. Yeh, "Cross-layer communication over fading channels with adaptive decision feedback," International Symposium on Modeling and Optimization in Mobile, Ad Hov, and Wireless Networks (WiOpt), Jun 2020.
N. Akbarzadeh and A. Mahajan, "Restless bandits, indexability, and computation of Whittle Index," Les Cahiers du GERAD, no G-2020-34, June 2020.
Z. Zhu, H. Abou-zeid, A. Mahajan, and A. Jitani*, "Overload protection for edge cluster using two-tier reinforcement learning models", submitted US patent 4015-11327/P081852WO01
2019
J. Subramanian and A. Mahajan, “Reinforcement learning in stationary mean-field games,” International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Montreal, Canada, 13–17 May, 2019.
J. Subramanian and A. Mahajan, “Approximate information state for partially observed systems,” IEEE Conference on Decision and Control, Nice, France, 11–13 Dec, 2019. (invited talk)
N. Akbarzadeh and A. Mahajan, “Restless bandits with controlled restarts: Indexability and computation of Whittle index,” IEEE Conference on Decision and Control, Nice, France, 11–13 Dec, 2019.
S. Gao and A. Mahajan, “Networked control of coupled subsystems: Spectral decomposition and low-dimensional solutions,” IEEE Conference on Decision and Control, Nice, France, 11–13 Dec, 2019.
N. Akbarzadeh and A. Mahajan, “Dynamic spectrum access under partial observations: A restless bandit approach,” Canadian Workshop on Information Theory (CWIT), Hamilton, Ontario, June 2–5, 2019.
J. Subramanian and A. Mahajan, “Approximate information state for partially observed systems,” Conference on Reinforcement Learning and Decision Making (RLDM), Montreal, Canada, 7–10 July, 2019.
J. Subramanian, R. Seraj, and A. Mahajan, “Reinforcement learning for mean-field teams,” Conference on Reinforcement Learning and Decision Making (RLDM), Montreal, Canada, 7–10 July, 2019.
J. Subramanian and A. Mahajan, “Approximate information state for partially observed systems,” Neural Information Processing Systems (NeurIPS) Workshop on Optimization Foundations of Machine Learning, Vancouver, Canada, 14 Dec, 2019.
A. Mahajan and J. Subramanian, “Representation Learning via state aggregation: A perspective of control over communication channels,” Neural Information Processing Systems (NeurIPS) Workshop on Information Theory and Machine Learning, Vancouver, Canada, 7 Dec, 2019.
J. Subramanian, R. Seraj, and A. Mahajan, “Reinforcement learning for mean-field teams,” AAMAS Workshop on Adaptive and Learning Agents (ALA), Montreal, Canada, 13–17 May, 2019.
J. Subramanian, A. Kumar, and A. Mahajan, “Mean-field games between teams,” 11th Workshop on Dynamic Games in Management Science, Montreal, Canada, 24–25 Oct, 2019.
J. Subramanian and A. Mahajan, “Approximate dynamic programming and reinforcement learning for partially observed systems,” Montreal AI Symposium, Montreal, Canada, 6 Sep, 2019.
J. Subramanian and A. Mahajan, “Reinforcement learning in stationary mean-field games,” Information Theory and Applications (ITA) Workshop, San Diego, CA, 11–15 Feb, 2019.
J. Subramanian and A. Mahajan, “Reinforcement learning in stationary mean-field games,” Les Cahiers du GERAD, no. G-2019-18, March 2019.
2018
J. Chakravorty and A. Mahajan, “Sufficient conditions for the value function and optimal strategy to be even and quasi-convex,” IEEE Transactions on Automatic Control, pp. 3858–3864, Nov 2018.
S. Li, A. Khisti, and A. Mahajan, “Information-theoretic privacy for smart metering systems with a rechargeable battery,” IEEE Transactions on Information Theory, pp. 3679–3695, May 2018.
M. Afshari and A. Mahajan, “Team optimal decentralized state estimation,” IEEE Conference on Decision and Control, Miami, Florida, Dec 17–19, 2018.
S. Mathew, K.H. Johannson, and A. Mahajan, “Optimal sampling of multiple linear processes over a shared medium,” IEEE Conference on Decision and Control, Miami, Florida, Dec 17–19, 2018.
J. Subramanian, A. Mahajan, and A.A. Paranjape, “On Controllability of Leader-Follower Dynamics over a Directed Graph,” IEEE Conference on Decision and Control, Miami, Florida, Dec 17–19, 2018.
J. Subramanian and A. Mahajan, “Renewal Monte Carlo: Renewal theory based reinforcement learning,” IEEE Conference on Decision and Control, Miami, Florida, Dec 17–19, 2018.
M. Afshari and A. Mahajan, “Optimal decentralized control of two agent linear system with partial output feedback: certainty equivalence and optimality of linear strategies,” IFAC Workshop on Distributed Estimation and Control in Networked Systems, Groningen, Netherlands, August 27-28, 2018.
J. Subramanian and A. Mahajan, “A policy gradient algorithm to compute boundedly rational stationary mean field equilibria,” ICML/IJCAI/AAMAS Workshop on Planning and
Learning (PAL-18), Stockholm, Sweden, July 13–15, 2018
2017
J. Chakravorty and A. Mahajan, “Fundamental limits of remote estimation of autoregressive Markov processes under communication constraints,” IEEE Transactions on Automatic Control, pp. 1109–1124, March 2017.
C. Ma*, A. Mahajan, and B. Meyer, “Multi-armed bandits for efficient lifetime estimation in MPSoC design,” Design, Automation and Test in Europe (DATE), Laussane, Switzerland, Mar 23–27, 2017.
J. Chakravorty, J. Subramanian, and A. Mahajan, “Stochastic approximation based methods for computing the optimal thresholds in remote-state estimation with packet drops,” American Control Conference, Seattle, WA, May 24–26, 2017.
J. Chakravorty and A. Mahajan, “Structure of optimal strategies for remote estimation over Gilbert-Elliott channel with feedback,” IEEE International Symposium of Information Theory (ISIT), Aachen, Germany, Jun 25–30, 2017.
M. Afshari and A. Mahajan, “Static teams with common information,” IFAC World Congress, Toulouse, France, Jul 9–14, 2017.
A. Mahajan, “Remote estimation over control area networks,” IEEE Vehicular Technology Conference (VTC), Networked Vehicles for Intelligent Transportation and Smart Grids (NetV) Workshop, Toronto, Canada, Sep 24–27, 2017.
Y. Liu, A. Khisti, and A. Mahajan, “On Privacy in Smart Metering Systems with Periodically Time-Varying Input Distribution,” GlobalSIP Symposium on Control and Information Theoretic Approaches to Security and Privacy, Montreal, Canada, Nov 14–16, 2017.
J.Subramanian, J.Chakravorty, and A.Mahajan,“Renewal theory based reinforcement learning for Markov processes,” Optimization Days, Montreal, QC, May 10–11, 2017.
Mahajan, “When to observe a Markov process,” INFORMS Applied Probability Society Conference, Evanston, IL, July 10–12, 2017.
M. Afshari and A. Mahajan, “Decentralized Kalman Filtering,” Fields Institute Workshop on Stochastic Processes and their Applications, Carleton University, Ottawa, ON, Aug 9–11, 2017.
J. Subramanian and A. Mahajan, “A new policy based RL algorithm with reduced bias and variance,” Montreal AI Symposium, Montreal, QC, Sep 26, 2017.
M. Afshari and A. Mahajan, “Team optimal decentralized filtering with coupled cost,” Ninth Workshop on Dynamic Games in Management Science, Montreal, QC, Oct 12–13, 2017.
M. Afshari and A. Mahajan, “Static teams with common information,” Les Cahiers du GERAD, no. G-2017-29, April 2017.