ABSTRACT
Data generators have been heavily used in creating massive trajectory datasets to address common challenges of real-world datasets, including privacy, cost of data collection, and data quality. However, such generators often overlook social and physiological characteristics of individuals and as such their results are often limited to simple movement patterns. To address these shortcomings, we propose an agent-based simulation framework that facilitates the development of behavioral models in which agents correspond to individuals that act based on personal preferences, goals, and needs within a realistic geographical environment. Researchers can use a drag-and-drop interface to design and control their own world including the geospatial and social (i.e. geo-social) properties. The framework is capable of generating and streaming very large data that captures the basic patterns of life in urban areas. Streaming data from the simulation can be accessed in real time through a dedicated API.
- Apple privacy policy. https://www.apple.com/legal/privacy/, Accessed Mar 15, 2019.Google Scholar
- JGraphX. https://github.com/jgraph/jgraphx, Accessed Mar 15, 2019.Google Scholar
- New york city taxi and limousine commission, trip record data. http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml, Accessed Mar 15, 2019.Google Scholar
- C. P. Alderfer. An empirical test of a new theory of human needs. Organizational behavior and human performance, 4(2):142--175, 1969.Google Scholar
- T. Brinkhoff. A Framework for Generating Network-Based Moving Objects. GeoInformatica, 6(2):153--180, 2002.Google ScholarDigital Library
- A. Crooks, N. Malleson, E. Manley, and A. Heppenstall. Agent-Based Modelling and Geographical Information Systems: A Practical Primer. SAGE Publications Limited, 2018.Google Scholar
- Y.-A. De Montjoye, C. A. Hidalgo, M. Verleysen, and V. D. Blondel. Unique in the crowd: The privacy bounds of human mobility. Scientific reports, 3:1376, 2013.Google ScholarCross Ref
- E. Deci and R. M. Ryan. Intrinsic motivation and self-determination in human behavior. Springer Science & Business Media, 1985.Google ScholarCross Ref
- R. J. Gore, C. J. Lynch, and H. Kavak. Applying statistical debugging for enhanced trace validation of agent-based models. SIMULATION, 93(4):273--284, 2017.Google ScholarDigital Library
- H. Kavak, J.-S. Kim, A. Crooks, D. Pfoser, C. Wenk, and A. Züfle. Location-based social simulation. In SSTD, pages 218--221. ACM, 2019.Google ScholarDigital Library
- J.-S. Kim, H. Kavak, and A. Crooks. Procedural city generation beyond game development. SIGSPATIAL Special, 10(2):34--41, 2018.Google ScholarDigital Library
- W. Liu, Y. Zheng, S. Chawla, J. Yuan, and X. Xing. Discovering spatio-temporal causal interactions in traffic data streams. In ACM SIGKDD, pages 1010--1018. ACM, 2011.Google ScholarDigital Library
- S. Luke, C. Cioffi-Revilla, L. Panait, K. Sullivan, and G. Balan. Mason: A multiagent simulation environment. Simulation, 81(7):517--527, 2005.Google ScholarDigital Library
- K. Lynch. The image of the city, volume 11. MIT press, 1960.Google Scholar
- A. H. Maslow. A theory of human motivation. Psychological review, 50(4):370, 1943.Google ScholarCross Ref
- D. C. McClelland, J. W. Atkinson, R. A. Clark, and E. L. Lowell. The achievement motive. 1953.Google Scholar
- M. F. Mokbel, L. Alarabi, J. Bao, A. Eldawy, A. Magdy, M. Sarwat, E. Waytas, and S. Yackel. Mntg: an extensible web-based traffic generator. In SSTD, pages 38--55. Springer, 2013.Google ScholarCross Ref
- J. D. Park, E. Seglem, E. Lin, and A. Züfle. Protecting user privacy: Obfuscating discriminative spatio-temporal footprints. In ACM SIGSPATIAL LocalRec Workshop, article 2. ACM, 2017.Google ScholarDigital Library
- D. Pfoser and Y. Theodoridis. Generating semantics-based trajectories of moving objects. Computers, Environment and Urban Systems, 27(3):243--263, 2003.Google ScholarCross Ref
- K. Ramamohanarao, H. Xie, L. Kulik, S. Karunasekera, E. Tanin, R. Zhang, and E. B. Khunayn. Smarts: Scalable microscopic adaptive road traffic simulator. ACM TIST, 8(2):26, 2017.Google ScholarDigital Library
- A. Sadilek, H. A. Kautz, and V. Silenzio. Predicting disease transmission from geo-tagged micro-blog data. In AAAI, pages 136--142, 2012.Google Scholar
- K. Sullivan, M. Coletti, and S. Luke. Geomason: Geospatial support for mason. Technical report, George Mason University, 2010.Google Scholar
- L.-Y. Wei, Y. Zheng, and W.-C. Peng. Constructing popular routes from uncertain trajectories. In ACM SIGKDD, pages 195--203. ACM, 2012.Google ScholarDigital Library
- A. Y. Xue, R. Zhang, Y. Zheng, X. Xie, J. Huang, and Z. Xu. Destination prediction by sub-trajectory synthesis and privacy protection against such prediction. In ICDE, pages 254--265. IEEE, 2013.Google ScholarDigital Library
- J. Yuan, Y. Zheng, C. Zhang, W. Xie, X. Xie, G. Sun, and Y. Huang. T-drive: driving directions based on taxi trajectories. In ACM SIGSPATIAL, pages 99--108. ACM, 2010.Google ScholarDigital Library
- J. Yuan, Y. Zheng, L. Zhang, X. Xie, and G. Sun. Where to find my next passenger. In Ubiquitous Computing, pages 109--118. ACM, 2011.Google Scholar
Index Terms
- Simulating Urban Patterns of Life: A Geo-Social Data Generation Framework
Recommendations
Location-Based Social Simulation
SSTD '19: Proceedings of the 16th International Symposium on Spatial and Temporal DatabasesLocation-based social networks (LBSNs) have been studied extensively in recent years. However, utilizing real-world LBSN datasets in such studies has severe weaknesses: sparse and small datasets, privacy concerns, and a lack of authoritative ground-...
A Framework to Generate Synthetic Multi-label Datasets
A controlled environment based on known properties of the dataset used by a learning algorithm is useful to empirically evaluate machine learning algorithms. Synthetic (artificial) datasets are used for this purpose. Although there are publicly ...
A tool for generating synthetic data
DATA '18: Proceedings of the First International Conference on Data Science, E-learning and Information SystemsIt is popular to use real-world data to evaluate data mining techniques. However, there are some disadvantages to use real-world data for such purposes. Firstly, real-world data in most domains is difficult to obtain for several reasons, such as budget, ...
Comments