Skip to content

Social media comments in Bengali language with labels

Notifications You must be signed in to change notification settings

cypher-07/Bangla-Text-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Bangla-Text-Dataset : Social media comments in Bengali language with labels.

The data has been gathered and marked from the remark association area under public posts by celebrities, government officials, athletes on the Facebook stage. The total amount of collected comments is 44001. The dataset is compiled with the aim of developing the ability of machines to differentiate whether a word is a bully expression or not with the help of Natural Language Processing and to what extent it is improper if it is an inappropriate comment. The comments are labelled with different category bullies with the help of experts and consensus. Due to the scarcity of data collection of categorised Bengali language comments, this dataset can have a significant role for research in detecting bully words, identifying inappropriate comments, detecting different categories of Bengali bullies, etc.

About

Social media comments in Bengali language with labels

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published