skip to main content
10.1145/2339530.2339533acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
keynote

Mining heterogeneous information networks: the next frontier

Published: 12 August 2012 Publication History

Abstract

Real world physical and abstract data objects are interconnected, forming gigantic, interconnected networks. By structuring these data objects into multiple types, such networks become semi-structured heterogeneous information networks. Most real world applications that handle big data, including interconnected social media and social networks, scientific, engineering, or medical information systems, online e-commerce systems, and most database systems, can be structured into heterogeneous information networks. For example, in a medical care network, objects of multiple types, such as patients, doctors, diseases, medication, and links such as visits, diagnosis, and treatments are intertwined together, providing rich information and forming heterogeneous information networks. Effective analysis of large-scale heterogeneous information networks poses an interesting but critical challenge.
In this talk, we present a set of data mining scenarios in heterogeneous information networks and show that mining heterogeneous information networks is a new and promising research frontier in data mining research. Departing from many existing network models that view data as homogeneous graphs or networks, the semi-structured heterogeneous information network model leverages the rich semantics of typed nodes and links in a network and can uncover surprisingly rich knowledge from interconnected data. This heterogeneous network modeling will lead to the discovery of a set of new principles and methodologies for mining interconnected data. The examples to be used in this discussion include (1) meta path-based similarity search, (2) rank-based clustering, (3) rank-based classification, (4) meta path-based link/relationship prediction, (5) relation strength-aware mining, as well as a few other recent developments. We will also point out some promising research directions and provide convincing arguments on that mining heterogeneous information networks is the next frontier in data mining.

Supplementary Material

JPG File (keynote_2.jpg)
MP4 File (keynote_2.mp4)

References

[1]
M. Ji, J. Han, and M. Danilevsky. Ranking-based classification of heterogeneous information networks. In KDD'11, San Diego, CA, Aug. 2011.
[2]
M. Ji, Y. Sun, M. Danilevsky, J. Han, and J. Gao. Graph regularized transductive classification on heterogeneous information networks. In ECMLPKDD'10, Barcelona, Spain, Sept. 2010.
[3]
Y. Sun, C. C. Aggarwal, and J. Han. Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. PVLDB, 5:394--405, 2012.
[4]
Y. Sun, R. Barber, M. Gupta, C. Aggarwal, and J. Han. Co-author relationship prediction in heterogeneous bibliographic networks. In ASONAM'11, Kaohsiung, Taiwan, July 2011.
[5]
Y. Sun and J. Han. Mining Heterogeneous Information Networks: Principles and Methodologies. Morgan & Claypool Publishers, 2012.
[6]
Y. Sun, J. Han, X. Yan, P. S. Yu, and T. Wu. PathSim: Meta path-based top-k similarity search in heterogeneous information networks. In VLDB'11, Seattle, WA, Aug. 2011.
[7]
Y. Sun, J. Han, P. Zhao, Z. Yin, H. Cheng, and T. Wu. RankClus: Integrating clustering with ranking for heterogeneous information network analysis. In EDBT'09, Saint-Petersburg, Russia, Mar. 2009.
[8]
Y. Sun, Y. Yu, and J. Han. Ranking-based clustering of heterogeneous information networks with star network schema. In KDD'09, Paris, France, June 2009.
[9]
C. Wang, J. Han, Y. Jia, J. Tang, D. Zhang, Y. Yu, and J. Guo. Mining advisor-advisee relationships from research publication networks. In KDD'10, Washington D.C., July 2010.

Cited By

View all
  • (2024)Graph Representation Learning Based on Cognitive Spreading ActivationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.343778136:12(8408-8420)Online publication date: Dec-2024
  • (2023)Detecting Medical Insurance Fraud Using a Heterogeneous Information Network with a Multi-behavior PatternComputer Science and Education10.1007/978-981-99-2443-1_60(704-720)Online publication date: 14-May-2023
  • (2021)A survey on : Online Social Networking Attacks Detection TechniquesInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology10.32628/CSEIT21732(44-50)Online publication date: 1-May-2021
  • Show More Cited By

Index Terms

  1. Mining heterogeneous information networks: the next frontier

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2012
    1616 pages
    ISBN:9781450314626
    DOI:10.1145/2339530

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data mining
    2. heterogeneous information networks

    Qualifiers

    • Keynote

    Conference

    KDD '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Graph Representation Learning Based on Cognitive Spreading ActivationsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.343778136:12(8408-8420)Online publication date: Dec-2024
    • (2023)Detecting Medical Insurance Fraud Using a Heterogeneous Information Network with a Multi-behavior PatternComputer Science and Education10.1007/978-981-99-2443-1_60(704-720)Online publication date: 14-May-2023
    • (2021)A survey on : Online Social Networking Attacks Detection TechniquesInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology10.32628/CSEIT21732(44-50)Online publication date: 1-May-2021
    • (2021)A Meta-Path-Based Prediction Method for Disease Comorbidities2021 IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS)10.1109/CBMS52027.2021.00022(219-224)Online publication date: Jun-2021
    • (2019)Coupled Tensor Decomposition for User Clustering in Mobile Internet Traffic Interaction PatternIEEE Access10.1109/ACCESS.2019.28942677(18113-18124)Online publication date: 2019
    • (2019)SemRecWorld Wide Web10.1007/s11280-018-0553-622:1(153-184)Online publication date: 1-Jan-2019
    • (2018)Supervised ranking framework for relationship prediction in heterogeneous information networksApplied Intelligence10.1007/s10489-017-1044-748:5(1111-1127)Online publication date: 1-May-2018
    • (2018)A k-NN-Based Approach Using MapReduce for Meta-path Classification in Heterogeneous Information NetworksSoft Computing in Data Analytics10.1007/978-981-13-0514-6_28(277-284)Online publication date: 22-Aug-2018
    • (2016)Deep graphs—A general framework to represent and analyze heterogeneous complex systems across scalesChaos: An Interdisciplinary Journal of Nonlinear Science10.1063/1.495296326:6(065303)Online publication date: Jun-2016
    • (2016)S-Rank: A Supervised Ranking Framework for Relationship Prediction in Heterogeneous Information NetworksTrends in Applied Knowledge-Based Systems and Data Science10.1007/978-3-319-42007-3_26(305-319)Online publication date: 14-Jul-2016
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media