Bangla-Text-Dataset : Social media comments in Bengali language with labels.

The data has been gathered and marked from the remark association area under public posts by celebrities, government officials, athletes on the Facebook stage. The total amount of collected comments is 44001. The dataset is compiled with the aim of developing the ability of machines to differentiate whether a word is a bully expression or not with the help of Natural Language Processing and to what extent it is improper if it is an inappropriate comment. The comments are labelled with different category bullies with the help of experts and consensus. Due to the scarcity of data collection of categorised Bengali language comments, this dataset can have a significant role for research in detecting bully words, identifying inappropriate comments, detecting different categories of Bengali bullies, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
dataset.csv		dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bangla-Text-Dataset : Social media comments in Bengali language with labels.

About

Releases

Packages

cypher-07/Bangla-Text-Dataset

Folders and files

Latest commit

History

Repository files navigation

Bangla-Text-Dataset : Social media comments in Bengali language with labels.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages