GlotWeb: Web Indexing for Low-Resource Languages -- under construction.
-
Updated
Nov 24, 2024 - Python
GlotWeb: Web Indexing for Low-Resource Languages -- under construction.
This dataset contains cyber security news articles from 'The Hacker News'.
This repository contains the dataset for news classification.
Bangla News Article Categorization Using Conv-LSTM Net. It is a multi-class classification problem.
This repository have codes that extracts meaningful information from News headline data-set.
Performing classification tasks with the LibSVM toolkit on four different datasets: Iris, News, Abalone, and Income.
Collection of 100 news articles in Marathi along with their extractive text summaries.
π° News: π Real or π Fake β ... using π€ machine learning to build a system that identifies unreliable π news articles.
GlotSparse: Building Corpora in Under-Resourced Languages
Text classification by neural network models uses a dataset collected from Uzbek news sites.
These news scrapers were created for the final bachelor's thesis, text datasets were collected (an article and its summary), and finally a neural network was trained to generate text summaries in Lithuanian
Add a description, image, and links to the news-dataset topic page so that developers can more easily learn about it.
To associate your repository with the news-dataset topic, visit your repo's landing page and select "manage topics."