Das Fachgebiet „Informationssysteme“ am Hasso-Plattner-Institut

Naumann, Felix; Krestel, Ralf

doi:10.1007/s13222-016-0239-0

Das Fachgebiet „Informationssysteme“ am Hasso-Plattner-Institut

Datenbankgruppen vorgestellt
Published: 03 January 2017

Volume 17, pages 69–76, (2017)
Cite this article

Datenbank-Spektrum Aims and scope Submit manuscript

Felix Naumann¹ &
Ralf Krestel¹

393 Accesses
1 Citation
Explore all metrics

Zusammenfassung

Das Hasso-Plattner-Institut (HPI) ist ein privat finanziertes Institut an der Universität Potsdam. Stifter ist Professor Hasso Plattner, Mitgründer und Aufsichtsratsvorsitzender des Software-Konzerns SAP. Das Fachgebiet Informationssysteme, das von Prof. Dr. Felix Naumann geleitet wird, beschäftigt sich mit dem effizienten und effektiven Umgang mit heterogenen Daten und Texten. Gegründet wurde das Fachgebiet 2006 und bietet derzeit 12 Doktoranden und circa 15 Masterstudenten eine Forschungsumgebung.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literatur

Abedjan Z, Gruetze T, Jentzsch A, Naumann F (2014) Profiling and mining RDF data with ProLOD. In: Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society, Washington DC, S 1198–1201 (Demo)
Google Scholar
Abedjan Z, Golab L, Naumann F (2015) Profiling relational data: a survey. VLDB J 24(4):557–581
Article Google Scholar
Agichtein E, Gravano L (2000) Snowball: extracting relations from large plain-text collections. In: Proceedings of the ACM Conference on Digital Libraries. ACM, New York, S 85–94
Chapter Google Scholar
Agrawal D, Bernstein P, Bertino E, Davidson S, Dayal U, Franklin M, Widom J et al (2012) Challenges and opportunities with Big Data. Technical report, Computing Community Consortium. http://cra.org/ccc/docs/init/bigdatawhitepaper.pdf. Zugegriffen: 1.1.2017
Google Scholar
Draisbach U, Naumann F, Szott S, Wonneberg O (2012) Adaptive windows for duplicate detection. In: Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society, Washington DC, S 1073–1083
Google Scholar
Godde C, Lazaridou K, Krestel R (2016) Classification of German newspaper comments. In: Proceedings of the Conference Lernen, Wissen, Daten, Analysen (LWDA). Hasso Plattner Institut, Potsdam, S 299–310
Google Scholar
Gruetze T, Yao G, Krestel R (2015) Learning temporal tagging behaviour. In: Proceedings of the Temporal Web Analytics Workshop (TempWeb) at the International World Wide Web Conference (WWW). ACM, New York, S 1333–1338
Gruetze T, Krestel R, Naumann F (2016) Topic shifts in StackOverflow: ask it like Socrates. In: Proceedings of the 21st International Conference on Applications of Natural Language to Information Systems (NLDB), Bd. 9612. Springer, Heidelberg, S 213–221
Google Scholar
Hernández MA, Stolfo SJ (1998) Real-world data is dirty: data cleansing and the merge/purge problem. Data Min Knowl Discov 2(1):9–37
Article Google Scholar
Huhtala Y, Kärkkäinen J, Porkka P, Toivonen H (1999) TANE: an efficient algorithm for discovering functional and approximate dependencies. Comput J 42(2):100–111
Article MATH Google Scholar
Köhler H, Link S, Zhou X (2015) Possible and certain SQL keys. Proceedings VLDB Endowment 8(11):1118–1129
Article Google Scholar
Krestel R, Wall A, Nejdl W (2012) Treehugger or Petrolhead? Identifying bias by comparing online news articles with political speeches. In: Proceedings of the International World Wide Web Conference (WWW). ACM, New York, S 547–548
Google Scholar
Krestel R, Werkmeister T, Wiradarma TP, Kasneci G (2015) Tweet-recommender: finding relevant tweets for news articles. In: Proceedings of the International World Wide Web Conference (WWW). ACM, New York, S 53–54
Chapter Google Scholar
Kruse S, Jentzsch A, Papenbrock T, Kaoudi Z, Quiane-Ruiz JA, Naumann F (2016) RDfind: scalable conditional inclusion dependency discovery in RDF datasets. In: Proceedings of the International Conference on Management of Data (SIGMOD). ACM, New York, S 953–967
Lange D, Naumann F (2011) Efficient similarity search: arbitrary similarity measures, arbitrary composition. In: Proceedings of the International Conference on Information and Knowledge Management (CIKM). ACM, New York, S 1679–1688
Google Scholar
Papenbrock T, Bergmann T, Finke M, Zwiener J, Naumann F (2015) Data profiling with Metanome (demo). Proceedings VLDB Endowment 8(12):1860–1871
Article Google Scholar
Papenbrock T, Ehrlich J, Marten J, Neubert T, Rudolph JP, Schönberg M, Zwiener J, Naumann F (2015) Functional dependency discovery: an experimental evaluation of seven algorithms. Proceedings VLDB Endowment 8(10):1082–1093
Article Google Scholar
Papenbrock T, Naumann F (2017) A hybrid approach for efficient unique column combination discovery. In: Proc. der Fachtagung Business, Technologie und Web (BTW). GI, Bonn, Deutschland (accepted)
Google Scholar
Park J, Blume-Kohout M, Krestel R, Nalisnick E, Smyth P (2016) Analyzing NIH funding patterns over time with statistical text analysis. In: Scholarly Big Data: AI Perspectives, Challenges, and Ideas, Workshop at AAAI. AAAI Press, Palo Alto, CA, S 698–704
Google Scholar
Weis M, Naumann F, Jehle U, Lufter J, Schuster H (2008) Industry-scale duplicate detection. Proceedings VLDB Endowment 1(2):1253–1264
Article Google Scholar
Zuo Z, Kasneci G, Gruetze T, Naumann F (2014) BEL: bagging for entity linking. In: Proceedings of the International Conference on Computational Linguistics (COLING). ACL, Stroudsburg, PA, 2075–2086
Google Scholar

Download references

Danksagung

Unserer Forschung genoss die Unterstützung verschiedener Partner wie der DFG und Unternehmen, die sich für das Verständnis und die Verbesserung ihrer Daten interessieren. Die hier vorgestellten Arbeiten beruhen – natürlich – auf der Forschung unserer hervorragenden Doktoranden: Tobias Bleifuß, Toni Grütze, Hazar Harmouch, Maximilian Jenders, Anja Jentzsch, John Koumarelas, Sebastian Kruse, Konstantina Lazaridou, Michael Loster, Thorsten Papenbrock, Julian Risch, Ahmad Samiei und Zhe Zuo.

Zwei weitere HPI-Fachgebiete, mit denen wir kooperieren, arbeiten ebenfalls in der Datenbank-Community: Das Fachgebiet „Enterprise Platforms and Integration Concepts“ (EPIC) unter der Leitung von Hasso Plattner und Matthias Uflacker sowie das Fachgebiet „Knowledge Discovery und Data Mining“ (KDD) unter der Leitung von Emmanuel Müller.

Author information

Authors and Affiliations

Hasso-Plattner-Institut, 14482, Potsdam, Deutschland
Felix Naumann & Ralf Krestel

Authors

Felix Naumann
View author publications
You can also search for this author in PubMed Google Scholar
Ralf Krestel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Felix Naumann.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Naumann, F., Krestel, R. Das Fachgebiet „Informationssysteme“ am Hasso-Plattner-Institut. Datenbank Spektrum 17, 69–76 (2017). https://doi.org/10.1007/s13222-016-0239-0

Download citation

Published: 03 January 2017
Issue Date: March 2017
DOI: https://doi.org/10.1007/s13222-016-0239-0

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Das Fachgebiet „Informationssysteme“ am Hasso-Plattner-Institut

Zusammenfassung

Access this article

Literatur

Danksagung

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation