No abstract available.
Proceeding Downloads
Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech
- Sandip Modha,
- Thomas Mandl,
- Gautam Kishore Shahi,
- Hiren Madhu,
- Shrey Satapara,
- Tharindu Ranasinghe,
- Marcos Zampieri
The HASOC track is dedicated to the evaluation of technology for finding Offensive Language and Hate Speech. HASOC is creating a multilingual data corpus mainly for English and under-resourced languages(Hindi and Marathi). This paper presents one HASOC ...
Overview of the DravidianCodeMix 2021 Shared Task on Sentiment Detection in Tamil, Malayalam, and Kannada
- Ruba Priyadharshini,
- Bharathi raja Chakravarthi,
- Sajeetha Thavareesan,
- Dhivya Chinnappa,
- Durairaj Thenmozhi,
- Rahul Ponnusamy
We present the results of the Dravidian-CodeMix shared task1 held at FIRE 2021, a track on sentiment analysis for Dravidian Languages in Code-Mixed Text. We describe the task, its organization, and the submitted systems. This shared task is the ...
Working Notes of the Workshop Arabic Misogyny Identification (ArMI-2021)
This paper provides an overview of the first shared task on misogyny identification in Arabic tweets. Arabic Misogyny Identification task (ArMI) is introduced within the Hate Speech and Offensive Content detection (HASOC) track at FIRE-2021. The ArMI ...
UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu
- Maaz Amjad,
- Alisa Zhila,
- Grigori Sidorov,
- Andrey Labunets,
- Sabur Butt,
- Hamza Imam Amjad,
- Oxana Vitman,
- Alexander Gelbukh
With the growth of spread and importance of social media platforms, the effect of their misuse became more and more impactful. This shared task address the task of abusive and threatening language detection in Urdu language that has more than 230 ...
AILA 2021: Shared task on Artificial Intelligence for Legal Assistance
- Vedant Parikh,
- Upal Bhattacharya,
- Parth Mehta,
- Ayan Bandyopadhyay,
- Paheli Bhattacharya,
- Kripa Ghosh,
- Saptarshi Ghosh,
- Arindam Pal,
- Arnab Bhattacharya,
- Prasenjit Majumder
AILA 2021 was the third edition of the Shared task on Artificial Intelligence for Legal Assistance, that was organized with the FIRE 2021 conference. This year two tasks were offered. While the rhetorical role labelling task was continued from last ...
Findings of Shared Task on Offensive Language Identification in Tamil and Malayalam
- Prasanna Kumar Kumaresan,
- Premjith,
- Ratnasingam Sakuntharaj,
- Sajeetha Thavareesan,
- Subalalitha Navaneethakrishnan,
- Anand Kumar Madasamy,
- Bharathi Raja Chakravarthi,
- John P. McCrae
We present the results of HASOC-Dravidian-CodeMix shared task1 held at FIRE 2021, a track on offensive language identification for Dravidian languages in Code-Mixed Text in this paper. This paper will detail the task, its organisation, and the ...
UrduFake@FIRE2021: Shared Track on Fake News Identification in Urdu
This study reports the second shared task named as UrduFake@Fire2021 on identifying fake news detection in Urdu language. This is a binary classification problem in which the task is to classify a given news article into two classes: (i) real news, or (...
Overview of the FIRE 2021 track: Information Retrieval from Microblogs during Disasters (IRMiDis)
Microblogging sites such as Twitter play an important role in dealing with various mass emergencies including natural disasters and pandemics. The FIRE2021 track on Information Retrieval from Microblogs during Disasters (IRMiDis) focused on two ...
Overview of the Causality-driven Adhoc Information Retrieval (CAIR) task at FIRE-2021
The CAusality-based Information Retrieval (CAIR) track at FIRE 2021 focuses on the task of retrieving potentially relevant documents in response to a query indicating one or more events, where the notion of relevance is determined by whether a document ...
Towards Automatic Green Claim Detection
Companies frequently make claims about positive impacts on the environment, but these claims are not always valid. In this sense, “greenwashing” is used when a company states a false claim about its products and practices being environmentally ...
Approximate Nearest Neighbour Search on Privacy-aware Encoding of User Locations to Identify Susceptible Infections in Simulated Epidemics
Amidst an increasing number of infected cases during the Covid-19 pandemic, it is essential to trace, as early as possible, the susceptible people who might have been infected by the disease due to their close proximity with people who were tested ...
Query Change as a Contextual Markov Model for Simulating User Search Behaviour
Search engine users issue queries to formulate their information need and gain useful insights. However, it is challenging for search engines to understand different users’ search type intents and return appropriate results. Simulating user search ...
From Opinion Mining to Improvement Mining : Understanding Product Improvements from User Reviews
A valuable trove of information exists for product(s) or services online via user opinions like detailed reviews provided by customers on popular e-commerce websites. Users express their individual opinions in the form of overall product/service ...
Meta-Learning for Offensive Language Detection in Code-Mixed Texts
This research investigates the application of Model-Agnostic Meta-Learning (MAML) and ProtoMAML to identify offensive code-mixed text content on social media in Tamil-English and Malayalam-English code-mixed texts. We follow a two-step strategy: The ...
A Survey of Recent Neural Network Models on Code-Mixed Indian Hate Speech Data
In recent years, given the exponential increase in social media content also led to an increase in online hate speech. We need automatic hate speech detection methods due to the volume of data on the web. Various approaches have been proposed to ...
Leveraging Transfer learning techniques- BERT, RoBERTa, ALBERT and DistilBERT for Fake Review Detection
In this era of the internet, the online review system has grown tremendously, where customers share their first-hand experiences about the products or services. These reviews influence the purchasing decision of future customers and have a positive or ...
Smart City Umbrella Ontology :Context -Driven Framework For Traffic Planning
Presently, huge public and private data sets are available from different government sources regarding transportation services in modern cities.The heterogeneous data-sets address day- to- day operations in transportation research,and these challenges ...
Zero-shot reductive paraphrasing for digitally semi-literate
People in developing countries with restricted schooling, face hurdles with their digital enablement. Constrained education creates issues with comprehensibility of information on internet and other digital platforms. Most content on digital platforms ...
Data Augmentation for Layperson’s Medical Entity Linking Task
Due to the vast amount of health-related data on social media, it is beneficial to monitor health-related issues experienced by the users, such as monitoring adverse drug effects. This problem is known as the Medical Entity Linking (MEL) task, which ...
Attention based end to end Speech Recognition for Voice Search in Hindi and English
We describe here our work with automatic speech recognition (ASR) in the context of voice search functionality on the Flipkart e-Commerce platform. Starting with the deep learning architecture of Listen-Attend-Spell (LAS), we build upon and expand the ...
Index Terms
- Proceedings of the 13th Annual Meeting of the Forum for Information Retrieval Evaluation
Recommendations
The integration of french language processing in an information retrieval
RIAO '97: Computer-Assisted Information Searching on Internet - Volume 2Cet article décrit les approches que nous avons implantées dans le cadre d'une collaboration de recherche entre nos deux groupes. Ces approches visent à créer une représentation plus précise pour les documents et les requêtes dans un système de ...