TwiBot-22/src/Dehghan at master · LucaKro/TwiBot-22

History

Name		Name	Last commit message	Last commit date
parent directory ..
codes/graphwave		codes/graphwave
Twibot-20		Twibot-20
Twibot-22		Twibot-22
all_fea.py		all_fea.py
check_dataset.py		check_dataset.py
count_edges.py		count_edges.py
create_edges.py		create_edges.py
cresci-2015		cresci-2015
deepwalk.py		deepwalk.py
fast_read_csv.py		fast_read_csv.py
get_tweet.py		get_tweet.py
main_xgboost.py		main_xgboost.py
network.py		network.py
nlp_fea_for_22.py		nlp_fea_for_22.py
nlp_features.py		nlp_features.py
node2vec_fea.py		node2vec_fea.py
profile_features.py		profile_features.py
readme.md		readme.md
roix_fea.py		roix_fea.py
role2vec_fea.py		role2vec_fea.py
save_edges.py		save_edges.py
split_id.py		split_id.py
struct_fea.py		struct_fea.py
totxt.py		totxt.py

readme.md

Detecting Bots in Social-Networks Using Node and Structural Embeddings

authors: Ashkan Dehghan, Kinga Siuta, Agata Skorupka, Akshat Dubey, Andrei Betlen, David Miller,Wei Xu, Bogumił Kaminski,Paweł Prałat
link: https://www.researchsquare.com/article/rs-1428343/latest.pdf
file structure:

├── network.py # generate graph features
├── profile_features.py # generate category features
├── nlp_features.py # generate text features
├── all_fea.py # check to make sure all features needed has been generated
├── **__fea*.py # features using specific model
└── main_xgboost.py # train model on every dataset

implement details:
- We did not reimplement the rest of the algorithms on cresci-2015 due to the limiation of the computational resources and the lack of the efficiency of the aforementioned algorithms on such a large dataset with millions of edges.

How to reproduce:

The data has been preprocessed and stored in folders e.g./cresci-2015

first run all_fea.py to generate the total features used for training remember to change the file path according to the dataset name

dataset='Twibot-20'

then you can use the feature generated ,change the dataset name ,and run main_xgboost.py Check the results in results/dataset.log

Result:

dataset		acc	precison	recall	f1
Twibot-22	mean	-	-	-	-
Twibot-20_all	mean	0.8604	0.9472	0.8219	0.8801
Twibot-20_Deepwalk	mean	0.8634	0.9400	0.8300	0.8816
Twibot-20_Node2vec	mean	0.8607	0.9425	0.8798	0.8718
Twibot-20_Role2Vec	mean	0.8607	0.9484	0.8261	0.8805
Twibot-20_RolX	mean	0.8653	0.9313	0.8378	0.8820
Twibot-20_Struc2Vec	mean	0.8617	0.9366	0.8298	0.8799
Twibot-20_GraphWave	mean	0.8668	0.9331	0.6311	0.7620
cresci-2015_GraphWave	mean	0.6206	0.9615	0.8388	0.8834
cresci-2015_Node2Vec	mean	0.6318	0.9615	0.8388	0.7743

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dehghan

Dehghan

readme.md

Detecting Bots in Social-Networks Using Node and Structural Embeddings

How to reproduce:

Result:

Files

Dehghan

Directory actions

More options

Directory actions

More options

Latest commit

History

Dehghan

Folders and files

parent directory

readme.md

Detecting Bots in Social-Networks Using Node and Structural Embeddings

How to reproduce:

Result: