Skip to main content

Graph Identification

(Extended Abstract)

  • Conference paper
Advances in Intelligent Data Analysis IX (IDA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6065))

Included in the following conference series:

Abstract

There is a growing amount of observational data describing networks– examples include social networks, communication networks, and biological networks. As the amount of available data increases, so has our interest in analyzing these networks in order to uncover (1) general laws that govern their structure and evolution, and (2) patterns and predictive models to develop better policies and practices. However, a fundamental challenge in dealing with this newly available observational data describing networks is that the data is often of dubious quality–it is noisy and incomplete–and before any analysis method can be applied, the data must be cleaned, missing information inferred and mistakes corrected. Skipping this cleaning step can lead to flawed conclusions for things as simple as degree distribution and centrality measures; for more complex analytic queries, the results are even more likely to be inaccurate and misleading.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Benjelloun, O., Garcia-Molina, H., Su, Q., Widom, J.: Swoosh: A generic approach to entity resolution. Technical report, Stanford University (2005)

    Google Scholar 

  2. Bhattacharya, I., Getoor, L.: Collective entity resolution in relational data. ACM Transactions on Knowledge Discovery from Data 1(1) (2007)

    Google Scholar 

  3. Bhattacharya, I., Godbole, S., Joshi, S.: Structured entity identification and document categorization: two tasks with one joint model. In: KDD (2008)

    Google Scholar 

  4. Liben-Nowell, D., Kleinberg, J.: The link prediction problem for social networks. In: CIKM (2003)

    Google Scholar 

  5. Lu, Q., Getoor, L.: Link-based classification. In: ICML (2003)

    Google Scholar 

  6. McDowell, L., Gupta, K.M., Aha, D.W.: Cautious inference in collective classification. In: AAAI (2007)

    Google Scholar 

  7. Neville, J., Jensen, D.: Iterative classification in relational data. In: AAAI Workshop on Learning Statistical Models from Relational Data (2000)

    Google Scholar 

  8. O’Madadhain, J., Hutchins, J., Smyth, P.: Prediction and ranking algorithms for event-based network data. SIGKDD Explorations Newsletter 7(2), 23–30 (2005)

    Article  Google Scholar 

  9. Poon, H., Domingos, P.: Joint unsupervised coreference resolution with Markov logic. In: EMNLP (2008)

    Google Scholar 

  10. Popescul, A., Ungar, L.H.: Statistical relational learning for link prediction. In: IJCAI 2003 Workshop on Learning Statistical Models from Relational Data (2003)

    Google Scholar 

  11. Roth, D., Yih, W.-T.: A linear programming formulation for global inference in natural language tasks. In: CoNLL (2004)

    Google Scholar 

  12. Sarawagi, S.: Information extraction. Foundations and Trends in Databases 1(3) (2008)

    Google Scholar 

  13. Sen, P., Namata, G.M., Bilgic, M., Getoor, L., Gallagher, B., Eliassi-Rad, T.: Collective classification in network data. AI Magazine 29(3), 93–106 (2008)

    Google Scholar 

  14. Singla, P., Domingos, P.: Entity resolution with Markov logic. In: ICDM (2006)

    Google Scholar 

  15. Taskar, B., Wong, M.-F., Abbeel, P., Koller, D.: Link prediction in relational data. In: NIPS (December 2003)

    Google Scholar 

  16. Wick, M.L., Rohanimanesh, K., Schultz, K., McCallum, A.: A unified approach for schema matching, coreference and canonicalization. In: KDD (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Getoor, L. (2010). Graph Identification. In: Cohen, P.R., Adams, N.M., Berthold, M.R. (eds) Advances in Intelligent Data Analysis IX. IDA 2010. Lecture Notes in Computer Science, vol 6065. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13062-5_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13062-5_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13061-8

  • Online ISBN: 978-3-642-13062-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics