Abstract
This paper asks: Is it possible for cameras in public areas, say ceiling cameras in a museum, to send personalized messages to people without knowing any addresses of their phones? We define this kind of problem as Private Human Addressing and develop a real-time end-to-end system called PHADE to solve it. Unlike traditional data transmission protocols that need to first learn the destination's address, our cameras rely on viewing user's motion patterns, and use the uniqueness of these patterns as the address for communication. Once receiving the wireless broadcast from the cameras, the user's phone can locally compare the "motion address" of the packet against its own motion sensor data, and accept the packet upon a "good" match.
In addition to requiring no data from users, our system transforms the motion patterns into low-dimensional codes to prevent leakage of user's walking behaviors. Thus, a hacker who collects all the broadcast messages would still not be able to infer the motion patterns of users. Real-world experiments show that PHADE discriminates 2, 4, 6, 8, 10 people with 98%, 95%, 90%, 90%, 87% correctness and about 3 seconds constant delay. Since abundant and accurate information can be extracted from videos, PHADE would find applications in various contexts. Extended to localization system and audio guide, PHADE achieves a median error of 0.19m and 99.7% matching correctness, respectively. PHADE can also deliver messages based on human gestures. There is no need to deploy any extra infrastructures or to require users to rent any dedicated device.
- Edoardo Amaldi and Viggo Kann. 1998. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science 209, 1--2 (1998), 237--260. Google ScholarDigital Library
- Anton Andriyenko and Konrad Schindler. 2011. Multi-target tracking by continuous energy minimization. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 1265--1272. Google ScholarDigital Library
- Ashwin Ashok, Shubham Jain, Marco Gruteser, Narayan Mandayam, Wenjia Yuan, and Kristin Dana. 2014. Capacity of pervasive camera based communication under perspective distortions. In Pervasive Computing and Communications (PerCom), 2014 IEEE International Conference on. IEEE, 112--120.Google ScholarCross Ref
- Hristo Bojinov, Yan Michalevsky, Gabi Nakibly, and Dan Boneh. 2014. Mobile device identification via sensor fingerprinting. arXiv preprint arXiv:1408.1416 (2014).Google Scholar
- Raymond C Browning, Emily A Baker, Jessica A Herron, and Rodger Kram. 2006. Effects of obesity and sex on the energetic cost and preferred speed of walking. Journal of Applied Physiology 100, 2 (2006), 390--398.Google ScholarCross Ref
- Zhe Cao, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2017. Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. In CVPR.Google Scholar
- Shaxun Chen, Amit Pande, and Prasant Mohapatra. 2014. Sensor-assisted Facial Recognition: An Enhanced Biometric Authentication System for Smartphones. In Proceedings of the 12th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys '14). ACM, New York, NY, USA, 109--122. Google ScholarDigital Library
- Navneet Dalal and Bill Triggs. 2005. Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, Vol. 1. IEEE, 886--893. Google ScholarDigital Library
- Yves-Alexandre De Montjoye, César A Hidalgo, Michel Verleysen, and Vincent D Blondel. 2013. Unique in the crowd: The privacy bounds of human mobility. Scientific reports 3 (2013), 1376.Google Scholar
- Sanorita Dey, Nirupam Roy, Wenyuan Xu, Romit Roy Choudhury, and Srihari Nelakuditi. 2014. AccelPrint: Imperfections of Accelerometers Make Smartphones Trackable.. In NDSS.Google Scholar
- Piotr Dollár. {n. d.}. Piotr's Computer Vision Matlab Toolbox (PMT). https://github.com/pdollar/toolbox.Google Scholar
- P. Dollár, R. Appel, and W. Kienzle. 2012. Crosstalk Cascades for Frame-Rate Pedestrian Detection. In ECCV. 645--659.Google Scholar
- Piotr Dollár, Ron Appel, and Wolf Kienzle. 2012. Crosstalk cascades for frame-rate pedestrian detection. In Computer Vision--ECCV 2012. Springer, 645--659.Google Scholar
- Piotr Dollár, Vincent Rabaud, Garrison Cottrell, and Serge Belongie. 2005. Behavior recognition via sparse spatio-temporal features. In Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005. 2nd Joint IEEE International Workshop on. IEEE, 65--72. Google ScholarDigital Library
- Howard Whitley Eves. 1972. A survey of geometry. Vol. 1. Allyn and Bacon.Google Scholar
- Olivier Faugeras. 1993. Three-dimensional computer vision: a geometric viewpoint. MIT press. Google ScholarDigital Library
- Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2001. The elements of statistical learning. Vol. 1. Springer series in statistics Springer, Berlin.Google Scholar
- Ardeshir Goshtasby. 1986. Piecewise linear mapping functions for image registration. Pattern Recognition 19, 6 (1986), 459--466.Google ScholarCross Ref
- Ardeshir Goshtasby. 1988. Image registration by local approximation methods. Image and Vision Computing 6, 4 (1988), 255--261. Google ScholarDigital Library
- Helmut Grabner, Jiri Matas, Luc Van Gool, and Philippe Cattin. 2010. Tracking the invisible: Learning where the object might be. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 1285--1292.Google ScholarCross Ref
- Isabelle Guyon and André Elisseeff. 2003. An introduction to variable and feature selection. Journal of machine learning research 3, Mar (2003), 1157--1182. Google ScholarDigital Library
- Kiryong Ha, Zhuo Chen, Wenlu Hu, Wolfgang Richter, Padmanabhan Pillai, and Mahadev Satyanarayanan. 2014. Towards wearable cognitive assistance. In Proceedings of the 12th annual international conference on Mobile systems, applications, and services. ACM, 68--81. Google ScholarDigital Library
- Puneet Jain, Justin Manweiler, and Romit Roy Choudhury. 2015. Overlay: Practical mobile augmented reality. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services. ACM, 331--344. Google ScholarDigital Library
- Niall Jenkins. {n. d.}. 245 million video surveillance cameras installed globally in 2014. IHS Technology ({n. d.}).Google Scholar
- Haojian Jin, Christian Holz, and Kasper Hornbñk. 2015. Tracko: Ad-hoc mobile 3d tracking using bluetooth low energy and inaudible signals for cross-device interaction. In Proceedings of the 28th Annual ACM Symposium on User Interface Software 8 Technology. ACM, 147--156. Google ScholarDigital Library
- Deokwoo Jung, Thiago Teixeira, and Andreas Savvides. 2010. Towards cooperative localization of wearable sensors using accelerometers and cameras. In INFOCOM, 2010 Proceedings IEEE. IEEE, 1--9. Google ScholarDigital Library
- Iris A Junglas and Richard T Watson. 2008. Location-based services. Commun. ACM 51, 3 (2008), 65--69. Google ScholarDigital Library
- Rudolph Emil Kalman et al. 1960. A new approach to linear filtering and prediction problems. Journal of basic Engineering 82, 1 (1960), 35--45.Google ScholarCross Ref
- Daniel Kelly, Seamas Donnelly, and Brian Caulfield. 2015. Smartphone derived movement profiles to detect changes in health status in COPD patients-A preliminary investigation. In Engineering in Medicine and Biology Society (EMBC), 2015 37th Annual International Conference of the IEEE. IEEE, 462--465.Google ScholarCross Ref
- Cheng-Hao Kuo and Ram Nevatia. 2011. How does person identity recognition help multi-person tracking?. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 1217--1224. Google ScholarDigital Library
- Hanchuan Li, Peijin Zhang, Samer Al Moubayed, Shwetak N Patel, and Alanson P Sample. 2016. Id-match: a hybrid computer vision and rid system for recognizing individuals in groups. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 4933--4944. Google ScholarDigital Library
- Tianxing Li, Chuankai An, Xinran Xiao, Andrew T Campbell, and Xia Zhou. 2015. Real-time screen-camera communication behind any scene. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services. ACM, 197--211. Google ScholarDigital Library
- Baiyang Liu, Junzhou Huang, Lin Yang, and Casimir Kulikowsk. 2011. Robust tracking using local sparse appearance model and k-selection. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 1313--1320. Google ScholarDigital Library
- David Fernández Llorca, Ricardo Quintero, I Parra, and MA Sotelo. 2017. Recognizing individuals in groups in outdoor environments combining stereo vision, RFID and BLE. Cluster Computing 20, 1 (2017), 769--779. Google ScholarDigital Library
- Yan Michalevsky, Aaron Schulman, Gunaa Arumugam Veerapandian, Dan Boneh, and Gabi Nakibly. {n. d.}. PowerSpy: Location Tracking Using Mobile Device Power Analysis.Google Scholar
- AG Amitha Perera, Chukka Srinivas, Anthony Hoogs, Glen Brooksby, and Wensheng Hu. 2006. Multi-object tracking through simultaneous long occlusions and split-merge conditions. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, Vol. 1. IEEE, 666--673. Google ScholarDigital Library
- Hamed Pirsiavash, Deva Ramanan, and Charless C Fowlkes. 2011. Globally-optimal greedy algorithms for tracking a variable number of objects. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE, 1201--1208. Google ScholarDigital Library
- Nirupam Roy, He Wang, and Romit Roy Choudhury. 2014. I am a smartphone and i can tell my user's walking direction. In Proceedings of the 12th annual international conference on Mobile systems, applications, and services. ACM, 329--342. Google ScholarDigital Library
- Jochen Schiller and Agnès Voisard. 2004. Location-based services. Elsevier. Google ScholarDigital Library
- Matthias Scholz, Fatma Kaplan, Charles L Guy, Joachim Kopka, and Joachim Selbig. 2005. Non-linear PCA: a missing data approach. Bioinformatics 21, 20 (2005), 3887--3895. Google ScholarDigital Library
- Tomas Simon, Hanbyul Joo, Iain Matthews, and Yaser Sheikh. 2017. Hand Keypoint Detection in Single Images using Multiview Bootstrapping. In CVPR.Google Scholar
- Oncel Tuzel, Fatih Porikli, and Peter Meer. 2006. Region covariance: A fast descriptor for detection and classification. In European conference on computer vision. Springer, 589--600. Google ScholarDigital Library
- Anran Wang, Zhuoran Li, Chunyi Peng, Guobin Shen, Gan Fang, and Bing Zeng. 2015. Inframe++: Achieve simultaneous screen-human viewing and hidden screen-camera communication. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services. ACM, 181--195. Google ScholarDigital Library
- Chen Wang, Xiaonan Guo, Yan Wang, Yingying Chen, and Bo Liu. 2016. Friend or foe?: Your wearable devices reveal your personal pin. In Proceedings of the 11th ACM on Asia Conference on Computer and Communications Security. ACM, 189--200. Google ScholarDigital Library
- He Wang, Xuan Bao, Romit Roy Choudhury, and Srihari Nelakuditi. 2013. InSight: recognizing humans without face recognition. In Proceedings of the 14th Workshop on Mobile Computing Systems and Applications. ACM, 7. Google ScholarDigital Library
- He Wang, Xuan Bao, Romit Roy Choudhury, and Srihari Nelakuditi. 2015. Visually fingerprinting humans without face recognition. In Proceedings of the 13th Annual International Conference on Mobile Systems, Applications, and Services. ACM, 345--358. Google ScholarDigital Library
- He Wang, Ted Tsung-Te Lai, and Romit Roy Choudhury. 2015. Mole: Motion leaks through smartwatch sensors. In Proceedings of the 21st Annual International Conference on Mobile Computing and Networking. ACM, 155--166. Google ScholarDigital Library
- Shu Wang, Huchuan Lu, Fan Yang, and Ming-Hsuan Yang. 2011. Superpixel tracking. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 1323--1330. Google ScholarDigital Library
- Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR.Google Scholar
- Junliang Xing, Haizhou Ai, and Shihong Lao. 2009. Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 1200--1207.Google Scholar
- Bo Yang and Ram Nevatia. 2012. Online learned discriminative part-based appearance models for multi-human tracking. In European Conference on Computer Vision. Springer, 484--498. Google ScholarDigital Library
- Bo Yang and Ramakant Nevatia. 2014. Multi-target tracking by online learning a CRF model of appearance and motion patterns. International Journal of Computer Vision 107, 2 (2014), 203--217. Google ScholarDigital Library
- Zhe Yang, Yuting Bao, Chuhao Luo, Xingya Zhao, Siyu Zhu, Chunyi Peng, Yunxin Liu, and Xinbing Wang. 2016. ARTcode: preserve art and code in any image. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 904--915. Google ScholarDigital Library
- Zengbin Zhang, David Chu, Xiaomeng Chen, and Thomas Moscibroda. 2012. Swordfight: Enabling a new class of phone-to-phone action games on commodity phones. In Proceedings of the 10th international conference on Mobile systems, applications, and services. ACM, 1--14. Google ScholarDigital Library
Index Terms
- Enabling Public Cameras to Talk to the Public
Recommendations
Video: Enabling Public Cameras to Talk to the Public
MobiSys '18: Proceedings of the 16th Annual International Conference on Mobile Systems, Applications, and ServicesThis video presents a real-time end-to-end system which enables cameras to send personalized messages to people in a public area without knowing any addresses of their mobile phones. For facilitating this communication, we solve the problem of digitally ...
A New Remote Camera Work System for Teleconference Using a Combination of Omni-Directional and Network Controlled Cameras
AINA '08: Proceedings of the 22nd International Conference on Advanced Information Networking and ApplicationsAs advent of communication network and multimedia processing technologies, various video conference systems to smoothly communicate each other between remote sites using with high quality audio and video have been developed. However, since those video ...
Simple, Accurate, and Robust Projector-Camera Calibration
3DIMPVT '12: Proceedings of the 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & TransmissionStructured-light systems are simple and effective tools to acquire 3D models. Built with off-the-shelf components, a data projector and a camera, they are easy to deploy and compare in precision with expensive laser scanners. But such a high precision ...
Comments