Apurba Bangla Sentiment Analysis Corpus

Introduction

This corpus is specifically targeted for Bangla sentiment analysis and made available to researchers under an open-source licensing scheme. We have collected and manually annotated over 10,000 sentences with sentiment polarity. We then moved to the Word domain and annotated over 15,000 words derived from these sentences with sentiment polarity. Each entry is the corpus has been cross-annotated by at least two and sometimes three annotators for ensuring quality. Also as a pre-requisite of creating a high quality sentiment analysis corpus, we had to build a secondary corpus for Bangla word stemming, which is also been cross-validated by at least two and sometimes three annotators for ensuring quality


Download Corpus

Request Download


Data Source

View Data Source


Corpus Statistics

View Corpus Statistics


Corpus Properties

View Corpus Properties


Release Date : 11 July 2019
Version No : 1.0