No abstract available.
Proceeding Downloads
The Microsoft academic search dataset and KDD Cup 2013
- Senjuti Basu Roy,
- Martine De Cock,
- Vani Mandava,
- Swapna Savanna,
- Brian Dalessandro,
- Claudia Perlich,
- William Cukierski,
- Ben Hamner
KDD Cup 2013 challenged participants to tackle the problem of author name ambiguity in a digital library of scientific publications. The competition consisted of two tracks, which were based on large-scale datasets from a snapshot of Microsoft Academic ...
Combination of feature engineering and ranking models for paper-author identification in KDD Cup 2013
- Chun-Liang Li,
- Yu-Chuan Su,
- Ting-Wei Lin,
- Cheng-Hao Tsai,
- Wei-Cheng Chang,
- Kuan-Hao Huang,
- Tzu-Ming Kuo,
- Shan-Wei Lin,
- Young-San Lin,
- Yu-Chen Lu,
- Chun-Pai Yang,
- Cheng-Xia Chang,
- Wei-Sheng Chin,
- Yu-Chin Juan,
- Hsiao-Yu Tung,
- Jui-Pin Wang,
- Cheng-Kuang Wei,
- Felix Wu,
- Tu-Chun Yin,
- Tong Yu,
- Yong Zhuang,
- Shou-de Lin,
- Hsuan-Tien Lin,
- Chih-Jen Lin
The track 1 problem in KDD Cup 2013 is to discriminate between papers confirmed by the given authors from the other deleted papers. This paper describes the winning solution of team National Taiwan University for track 1 of KDD Cup 2013. First, we ...
KDD Cup 2013 - author-paper identification challenge: second place team
This paper describes our submission to the KDD Cup 2013 Track 1 Challenge: Author-Paper Indentification in the Microsoft Academic Search database. Our approach is based on Gradient Boosting Machine (GBM) of Friedman ([5]) and deep feature engineering. ...
The scorecard solution to the author-paper identification challenge
This paper describes team mb74's solution to Track 1 of KDD Cup 2013. The challenge is to determine whether an author has written a given paper in the Microsoft Academic Search database. The key part of our solution is the feature generation which is ...
Feature engineering and tree modeling for author-paper identification challenge
The ability to search literature and collect/aggregate metrics around publications is a central tool for modern research. Both academic and industry researchers across hundreds of scientific disciplines, from astronomy to zoology, increasingly rely on ...
Contextual rule-based feature engineering for author-paper identification
We present the ideas and methodologies that we used to address the KDD Cup 2013 challenge on author-paper identification. We firstly formulate the problem as a personalized ranking task and then propose to solve the task through a supervised learning ...
Effective string processing and matching for author disambiguation
- Wei-Sheng Chin,
- Yu-Chin Juan,
- Yong Zhuang,
- Felix Wu,
- Hsiao-Yu Tung,
- Tong Yu,
- Jui-Pin Wang,
- Cheng-Xia Chang,
- Chun-Pai Yang,
- Wei-Cheng Chang,
- Kuan-Hao Huang,
- Tzu-Ming Kuo,
- Shan-Wei Lin,
- Young-San Lin,
- Yu-Chen Lu,
- Yu-Chuan Su,
- Cheng-Kuang Wei,
- Tu-Chun Yin,
- Chun-Liang Li,
- Ting-Wei Lin,
- Cheng-Hao Tsai,
- Shou-De Lin,
- Hsuan-Tien Lin,
- Chih-Jen Lin
Track 2 in KDD Cup 2013 aims at determining duplicated authors in a data set from Microsoft Academic Search. This type of problems appears in many large-scale applications that compile information from different sources. This paper describes our ...
Ranking-based name matching for author disambiguation in bibliographic data
Author name ambiguity is a frequently encountered problem in digital publication libraries such as Microsoft Academic Search. The cause of this problem mostly is that different authors may publish under the same name, while the same author could publish ...
KDD Cup 2013: author disambiguation
This paper describes our team's (BS Man & Dmitry & Leustagos) approach to the KDD Cup 2013 track 2 challenge: Author Disambiguation in the Microsoft Academic Search database.
A semi-supervised approach for author disambiguation in KDD CUP 2013
Name disambiguation, which aims to identify multiple names which correspond to one person and same names which refer to different persons, is one of the most important basic problems in many areas such as natural language processing, information ...
Index Terms
- Proceedings of the 2013 KDD Cup 2013 Workshop