skip to main content
10.1145/2528282.2528296acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbuildsysConference Proceedingsconference-collections
research-article

Online Learning for Personalized Room-Level Thermal Control: A Multi-Armed Bandit Framework

Published: 11 November 2013 Publication History

Abstract

We consider the problem of automatically learning the optimal thermal control in a room in order to maximize the expected average satisfaction among occupants providing stochastic feedback on their comfort through a participatory sensing application. Not assuming any prior knowledge or modeling of user comfort, we first apply the classic UCB1 online learning policy for multi-armed bandits (MAB), that combines exploration (testing out certain temperatures to understand better the user preferences) with exploitation (spending more time setting temperatures that maximize average-satisfaction) for the case when the total occupancy is constant. When occupancy is time-varying, the number of possible scenarios (i.e., which particular set of occupants are present in the room) becomes exponentially large, posing a combinatorial challenge. However, we show that LLR, a recently-developed combinatorial MAB online learning algorithm that requires recording and computation of only a polynomial number of quantities can be applied to this setting, yielding a regret (cumulative gap in average satisfaction with respect to a distribution aware genie) that grows only polynomially in the number of users, and logarithmically with time. This in turn indicates that difference in unit-time satisfaction obtained by the learning policy compared to the optimal tends to 0. We quantify the performance of these online learning algorithms using real data collected from users of a participatory sensing iPhone app in a multi-occupancy room in an office building in Southern California.

References

[1]
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. "Finite-time analysis of the multiarmed bandit problem". In: Machine learning 47.2-3 (2002), pp. 235--256.
[2]
Pablo Bermejo et al. "Design and simulation of a thermal comfort adaptive system based on fuzzy logic and on-line learning". In: Energy and Buildings 49 (2012), pp. 367--379.
[3]
Francesco Calvino et al. "The control of indoor thermal comfort conditions: introducing a fuzzy adaptive controller". In: Energy and Buildings 36.2 (2004), pp. 97--102.
[4]
K Dalamagkidis et al. "Reinforcement learning for energy conservation and comfort in buildings". In: Building and environment 42.7 (2007), pp. 2686--2698.
[5]
David Daum, Frédéric Haldi, and Nicolas Morel. "A personalized measure of thermal comfort for building controls". In: Building and Environment 46.1 (2011), pp. 3--11.
[6]
Poul O Fanger et al. "Thermal comfort. Analysis and applications in environmental engineering." In: Thermal comfort. Analysis and applications in environmental engineering. (1970).
[7]
Yi Gai, Bhaskar Krishnamachari, and Rahul Jain. "Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations". In: IEEE/ACM Transactions on Networking (TON) 20.5 (2012), pp. 1466--1478.
[8]
A Guillemin and N Morel. "Experimental results of a self-adaptive integrated control system in buildings: a pilot study". In: Solar Energy 72.5 (2002), pp. 397--403.
[9]
Farrokh Jazizadeh, Franco Moiso Marin, and Burcin Becerik-Gerber. "A thermal preference scale for personalized comfort profile identification via participatory sensing". In: Building and Environment 68.0 (2013), pp. 140 --149.
[10]
Farrokh Jazizadeh et al. "A Human-Building Interaction Framework for Personalized Thermal Comfort Driven Systems in Office Buildings". In: Journal of Computing in Civil Engineering (2013).
[11]
D Kolokotsa et al. "Design and installation of an advanced EIB fuzzy indoor comfort controller using Matlab". In: Energy and buildings 38.9 (2006), pp. 1084--1092.
[12]
Yoshifumi Murakami et al. "Field experiments on energy consumption and thermal comfort in the office environment controlled by occupants' requirements from PC terminal". In: Building and Environment 42.12 (2007), pp. 4022--4027.
[13]
Yi Wang et al. "A framework of energy efficient mobile sensing for automatic user state recognition". In: Proceedings of the 7th international conference on Mobile systems, applications, and services. ACM. 2009, pp. 179--192.

Cited By

View all
  • (2024)Smart Crystal Ball on a Budget: Reinforcement Learning and Prediction for Budget-Friendly Comfort2024 IEEE International Conference on Agents (ICA)10.1109/ICA63002.2024.00039(146-151)Online publication date: 4-Dec-2024
  • (2022)Indoor occupancy estimation for smart utilities: A novel approach based on depth sensorsBuilding and Environment10.1016/j.buildenv.2022.109406222(109406)Online publication date: Aug-2022
  • (2021)Control, intervention, and behavioral economics over human social networks against COVID-19Advanced Robotics10.1080/01691864.2021.192855335:11(733-739)Online publication date: 19-May-2021
  • Show More Cited By

Index Terms

  1. Online Learning for Personalized Room-Level Thermal Control: A Multi-Armed Bandit Framework

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    BuildSys '13: Proceedings of the 5th ACM Workshop on Embedded Systems For Energy-Efficient Buildings
    November 2013
    221 pages
    ISBN:9781450324311
    DOI:10.1145/2528282
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 November 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Online Learning
    2. Personalized Control
    3. Thermal Comfort

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SenSys '13

    Acceptance Rates

    BuildSys '13 Paper Acceptance Rate 22 of 57 submissions, 39%;
    Overall Acceptance Rate 148 of 500 submissions, 30%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Smart Crystal Ball on a Budget: Reinforcement Learning and Prediction for Budget-Friendly Comfort2024 IEEE International Conference on Agents (ICA)10.1109/ICA63002.2024.00039(146-151)Online publication date: 4-Dec-2024
    • (2022)Indoor occupancy estimation for smart utilities: A novel approach based on depth sensorsBuilding and Environment10.1016/j.buildenv.2022.109406222(109406)Online publication date: Aug-2022
    • (2021)Control, intervention, and behavioral economics over human social networks against COVID-19Advanced Robotics10.1080/01691864.2021.192855335:11(733-739)Online publication date: 19-May-2021
    • (2020)Leveraging Fine-Grained Occupancy Estimation Patterns for Effective HVAC Control2020 IEEE/ACM Fifth International Conference on Internet-of-Things Design and Implementation (IoTDI)10.1109/IoTDI49375.2020.00016(92-103)Online publication date: Apr-2020
    • (2019)Occupancy detection systems for indoor environments: A survey of approaches and methodsIndoor and Built Environment10.1177/1420326X1987562129:8(1053-1069)Online publication date: 16-Sep-2019
    • (2019)OccuThermProceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation10.1145/3360322.3360858(81-90)Online publication date: 13-Nov-2019
    • (2018)Smart HVAC Systems — Adjustable Airflow DirectionAdvanced Computing Strategies for Engineering10.1007/978-3-319-91638-5_10(193-209)Online publication date: 19-May-2018
    • (2016)FORCESProceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing10.1145/2971648.2971700(1188-1199)Online publication date: 12-Sep-2016
    • (2014)An energy-harvesting sensor architecture and toolkit for building monitoring and event detectionProceedings of the 1st ACM Conference on Embedded Systems for Energy-Efficient Buildings10.1145/2674061.2674083(100-109)Online publication date: 3-Nov-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media