skip to main content
10.1145/3209542.3212476acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
tutorial

Efficient Auto-Generation of Taxonomies for Structured Knowledge Discovery and Organization

Published: 03 July 2018 Publication History

Abstract

This tutorial introduces the audience to the latest breakthroughs in the area of interpreting unstructured content through an analysis of the key enabling scientific results along with their real-world applications. With technical presentations of problems like named-entity disambiguation and dynamically updating the knowledge hierarchy with domain-specific vocabulary, it would provide the fundamentals to the building-blocks of various applications in Artificial Intelligence, Natural Language Processing, Machine Learning, and Data Mining.

References

[1]
Luca Maria Aiello, Rossano Schifanella, Daniele Quercia, and Francesco Aletta. 2016. Chatty maps: constructing sound maps of urban areas from social media data. Royal Society open science Vol. 3, 3 (2016), 150690.
[2]
P. Bille. 2005. A survey on tree edit distance and related problems. Theoretical Computer Science Vol. 337, 1 (2005), 217--239.
[3]
David M. Blei. 2012. Probabilistic Topic Models. Commun. ACM Vol. 55, 4 (April. 2012), 77--84.
[4]
Paolo Ferragina and Ugo Scaiella. 2010. TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) Proceedings of the 19th ACM Conference on Information and Knowledge Management, CIKM 2010, Toronto, Ontario, Canada, October 26--30, 2010. ACM, 1625--1628.
[5]
E. B. Fowlkes and C. L. Mallows. 1983. A Method for Comparing Two Hierarchical Clusterings. J. Amer. Statist. Assoc. Vol. 78, 383 (1983), 553--569.
[6]
S. R. Gunn. 1998. Support Vector Machines for Classification and Regression. Technical Report. Univ. of Southampton, USA.
[7]
Victoria Henshaw. 2013. Urban smellscapes: Understanding and designing city smell environments. Routledge.
[8]
Ioana Hulpus, Conor Hayes, Marcel Karnstedt, and Derek Greene. 2013. Unsupervised Graph-based Topic Labelling Using Dbpedia WSDM.
[9]
Leo Katz. 1953. A new status index derived from sociometric analysis. Psychometrika Vol. 18, 1 (1953), 39--43.
[10]
Danai Koutra, Neil Shah, Joshua T. Vogelstein, Brian Gallagher, and Christos Faloutsos. 2016. DeltaCon: A principled massive-graph similarity function with attribution. ACM Transactions on Knowledge Discovery from Data (TKDD) Vol. 10, 3 (2016).
[11]
Tiep Mai, Bichen Shi, Patrick K. Nicholson, Deepak Ajwani, and Alessandra Sala. 2017. Scalable Disambiguation System Capturing Individualities of Mentions Language, Data, and Knowledge - First International Conference, LDK 2017, Galway, Ireland, June 19--20, 2017, Proceedings (Lecture Notes in Computer Science), Vol. Vol. 10318. Springer, 365--379.
[12]
Brian McFee and Gert Lanckriet. 2010. Metric Learning to Rank. In Proceedings of the 27th International Conference on International Conference on Machine Learning. 775--782.
[13]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Distributed Representations of Words and Phrases and Their Compositionality Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2. 3111--3119.
[14]
Daniele Quercia, Rossano Schifanella, Luca Maria Aiello, and Kate McLean. 2015. Smelly maps: the digital life of urban smellscapes. arXiv preprint arXiv:1505.06851 (2015).
[15]
Jiankai Sun, Deepak Ajwani, Patrick K. Nicholson, Alessandra Sala, and Srinivasan Parthasarathy. 2017. Breaking cycles in noisy hierarchies. In ACM Conference on Web Science. 151--160.

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HT '18: Proceedings of the 29th on Hypertext and Social Media
July 2018
266 pages
ISBN:9781450354271
DOI:10.1145/3209542
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 July 2018

Check for updates

Author Tags

  1. Katz centrality
  2. entity linking
  3. graph measures
  4. linked knowledge hierarchies
  5. topic labeling
  6. word embeddings

Qualifiers

  • Tutorial

Conference

HT '18
Sponsor:

Acceptance Rates

HT '18 Paper Acceptance Rate 19 of 69 submissions, 28%;
Overall Acceptance Rate 378 of 1,158 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 84
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media