Monash University
Browse
DipteshK-Final thesis - Cherie Lau.pdf (6.19 MB)

Investigations into Distributional Semantics for Cognate Detection and Phylogenetics

Download (6.19 MB)
thesis
posted on 2021-06-29, 01:16 authored by Diptesh Kanojia
This thesis investigates distributional semantics for cognate detection, false friends' detection and computational phylogenetics to present the insights drawn from our research, for 14 Indian languages pairs. Shared vocabulary facilitates second language learning and enables the computational models to perform cross-lingual learning for natural language processing (NLP) tasks. Distributional semantics aids NLP as it allows these models to understand natural languages. Our investigations use cross-lingual features to help detect cognates and false friends across languages. We also generate typological trees for Indian languages and, additionally, propose the division of the text into meaningful functional units which aid phylogenetic tree generation.

History

Campus location

Australia

Principal supervisor

Gholamreza Haffari

Additional supervisor 1

Pushpak Bhattacharyya

Additional supervisor 2

Malhar Kulkarni

Year of Award

2021

Department, School or Centre

Information Technology (Monash University Clayton)

Additional Institution or Organisation

IITB-Monash

Course

Doctor of Philosophy

Degree Type

Doctorate

Faculty

Faculty of Information Technology