Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FAQ and content translations #8

Open
wants to merge 2 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 9 additions & 0 deletions content/_index.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
---
title: "Bot Spotter"
date: 2020-06-29T13:51:23-02:00
draft: false
---

The **Bot Spotter** verifies a social media network profile and scores based on the probability of that profile being a _bot_. The greater scores indicates a higher chance of being a _bot_.

[Know about..](/faq/)
22 changes: 22 additions & 0 deletions content/about.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
---
title: "Sobre o Pegabot"
date: 2020-06-29T11:07:39-02:00
draft: false
menu:
- "main"
- "header"
slug: "sobre"
weight: 1
rootClass: "about-page"
---
{{< video src="/assets/video/pegabot--480x480.mp4">}}

# Dismantling structural botnets

_Bots_, short for robots, are computer programs created to perform specific tasks. The first robots did not have malicious intentions, and even today there are good _bots_, whose purpose is to demand accountability from politicians, viralize causes for gender equality or help organize the (many) daily tasks of their users. Good right?

But in the late 1990s, "bots" started to develop a negative reputation. Some _bots_ were used to send SPAMs by e-mail, steal users' personal data, credit card fraud and disinformation attacks to manipulate the public sphere. These _bots_ aim to spread lies to influence narratives, a phenomenon that has been gaining global scale since 2014. These _bots_ are out there and almost nobody knows how they work, who develops them and by whom they are financed. In oder to address these issues, we launched **Bot Spotter**, a tool that brings more transparency to the use of _bots_ in Brazil.

On this platform, the user can check the activity of a social network account to know the probability of the profile being a bot. The higher the score, the greater the chance of being a bot. For now, the platform is integrated with Twitter, but soon we will support other social media.

The **Bot Spotter** is a project in partnership amongst the [Institute for Technology and Society (ITS Rio)](https://itsrio.org/) and [Instituto Equidade &amp; Tecnologia](http://tecnologiaequidade.org.br/). The platform was launched in March 2018 and is in the testing phase.
11 changes: 11 additions & 0 deletions content/faq.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
---
title: "FAQ"
date: 2018-03-13T18:17:00-03:00
draft: false
menu:
- "main"
- "header"
slug: "faq"
layout: faq
weight: 2
---
10 changes: 10 additions & 0 deletions content/faq/avaliate-result.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T18:08:36-03:00
draft: false
id: "como-avaliar-o-resultado"
title: "How can I evaluate the given results?"
layout: none
type: faq
weight: 7
---
When having an indication that an account may or may not be a Bot, it is important that you make a subjective analysis of the profile you are trying to analyze. That is to say, check for the purpose that the given profile is trying to achieve. If you identify that the profile only serves to spread the message of another person, or that most of the time that profile exists to attack someone or a specific group, more the chances of this profile confirms to be a bot. In this case, it is worth questioning the possible beneficiaries of that bot's performance if the profile is really a bot and why the beneficiary is using this resource. If you are a politician, for example, ask the politician directly if he knows there is a bot working for him and if he believes that this behavior is correct.
10 changes: 10 additions & 0 deletions content/faq/how-to-intepret-the-value.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T18:05:53-03:00
draft: false
id: "como-interpretar-as-notas"
title: "How should I interpret the notes?"
layout: none
type: faq
weight: 5
---
The higher the score, the more likely the account is to be a bot. Each profile surveyed occupies a zone of the measurer: purple, yellow or green. If the profile searched is in the purple zone, the chance of being a bot is high. Many results are classified in the yellow zone, which means that there are enough elements to suspect the use of bots, but at a lower level. It may also be a profile that is used infrequently, or a very repetitive human use. If it's green, it's probably human.
10 changes: 10 additions & 0 deletions content/faq/twitter-permissions.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T18:01:06-03:00
draft: true
id: "twitter-permissões"
title: "Do I need to be registered on Twitter to grant permissions to Bot Spotter?"
layout: none
type: faq
weight: 3
---
No! Our algorithms access the public Twitter API, which is in no way linked to your account. We do not access or store private information, nor do we tweet on your behalf. Your private API keys are never transmitted to our servers. This means that we do not have the ability to place orders on your behalf at all.
10 changes: 10 additions & 0 deletions content/faq/what-is-a-bot.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T17:51:20-03:00
draft: false
id: "o-que-e-um-twitter-bot"
title: "What is a Bot?"
layout: none
type: faq
weight: 2
---
A Twitter Bot is an account controlled by an algorithm or script, normally used to perform repetitive tasks, for example, retweeting content containing particular keywords, replying to new followers and sending direct messages to new followers. More complex Twitter Bots can participate in online conversations and, in some cases, behave very similar to human behavior. Bot accounts make up between 9 and 15% of all active Twitter accounts, but further studies indicate that this percentage may be even higher due to the difficulty in identifying complex bots. Twitter bots are generally not created with malicious intent; they are often used to improve online interaction or service delivery by companies, governments and other organizations, so it is important to separate good bots from bad bots.
23 changes: 23 additions & 0 deletions content/faq/what-is-the-bot-probability.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
---
date: 2018-03-13T18:04:04-03:00
draft: false
id: "o-que-a-nota-significa"
title: "What do these results mean?"
layout: none
type: faq
weight: 4
---
**Bot Spotter** analyzes the posting history of the social network profile that you ask to be analyzed. At this time, **Bot Spotter** is based on behavior patterns to identify whether a human using that profile is more likely, or a robot. The criteria for making this assessment are the time interval between each post (a short interval between each post, 2 seconds for example, may indicate that the post was made by a robot); the frequency and randomness in the time the posts are made (posts always made at the same time, at 10 am, for example, may have been made by a robot); and the personality given to the posted texts (texts repeated or extracted from other publications, pre-formatted, are indicative of having been made by a robot). Based on the overall average of the profile posts, **Bot Spotter** gives a score, which indicates the probability of whether the profile is bot or not.

| | Features for bot identification |
|---|---|
| Content (posts) | It is based on linguistic cues through natural language processing. Include frequency of verbs, nouns and adverbs in tweets. The system analyzes the length and entropy of the content of the tweet. Misleading messages generally display informal language and short phrases. |
| Feeling (emotional) | The various emotional states are extracted from the sentiment analysis of the tweets. A human usually expresses a variety of emotional states, while a bot does not. |
| User | User characteristics are based on account-related metadata. This data includes the number of friends and followers, the number of tweets produced by users, the description of the profile and the account settings (language, location, account creation date). |
| Friends | It includes statistics related to social contacts, such as the mean, the moments and the entropy of the distribution of the number of followers, followers and posts. Follower-follower relationships, retweets, and mentions are discussed. For each field, the characteristics on the use of the language, local time, popularity are extracted. |
| Network | The structure of the network shows various dimensions of the patterns of information dissemination. The system rebuilds three types of networks: retweets, mentions and hashtags co-occurrence. All networks are weighed according to the frequency of interaction or co-occurrences. |
| Time | The time characteristic captures patterns related to user activity. Includes the average period in content generation (tweets) and consumption (retweets), the time between two posts. |

Ferrara, E., Varol, O., Davis, C., Menczer, F. & Flammini, A. (2016) The Rise of Social Bots. Communications of the ACM. DOI: 10.1145/2818717.

Varol, O., Ferrara, E., Davis, C., Menczer, F. & Flammini, A. (2017). Online Human-Bot Interactions: Detection, Estimation, and Characterization. arXiv preprint arXiv:1703.03107.
10 changes: 10 additions & 0 deletions content/faq/which-social-networks-are-available.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T17:54:29-03:00
draft: false
id: "o-pegabot-funciona-com-que-redes-sociais"
title: "Does Bot Spotter work with all social networks?"
layout: none
type: faq
weight: 1
---
For now no. In this first release, we only have access to Twitter. Facebook, WhatsApp and other social networks impose more barriers to its API's, so we are still working so that the next versions of Bot Spotter are integrated with other platforms.
19 changes: 19 additions & 0 deletions content/faq/who-made-pegabot.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
---
date: 2018-03-13T18:09:27-03:00
draft: false
id: "quem-desenvolveu-o-pegabot"
title: "Who developed Bot Spotter?"
layout: none
type: faq
weight: 8
---

PegaBot is developed by the Instituto Tecnologia & Equidade (IT&E) and Institute for Tecnology and Society (ITS Rio). The analysis engine is free software and can be freely used, studied, modified and redistributed. Its code is available on [Github] (https://github.com/AppCivico/spottingbot).

Why is it important for a bot detection system to be built on free software?

Free software systems give the user the freedom to modify a code. We understand that having an open code allows the participation of society in its construction and improvement as a way of producing a more democratic knowledge of free technologies. In addition to enabling people to exchange information horizontally, it strengthens actions with digital platforms, their usage policies, and especially how disinformation networks can work using these techniques.

This initiative is also in line with the White Paper ["Systemic Recommendations to combat misinformation in the Brazilian elections"] (https://tecnologiaequidade.org.br/projects/desinformacao-em-eleicoes/), carried out by Instituto Tecnologia e Equidade ( IT&E).

In topic 5 of the document, there is the information: CREATE NEW ADJUSTMENT FEEDBACK CYCLES - in Recommendations 2019-2020. in this item, the Institute believes that "Demanding Academies and Research Institutes: development of free technologies for detecting robots, false profiles, haters and disinformation" is a point of leverage to combat the imbalances presented in the electoral system.
10 changes: 10 additions & 0 deletions content/faq/who-sponsor-pegabot.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T18:10:01-03:00
draft: false
id: "quem-financia-o-pegabot"
title: "Who funds Bot Spotter?"
layout: none
type: faq
weight: 9
---
Bot Spotter (Pegabot) is an initiative of the Institue for Technology and Society and the Institute of Technology & Equity. Both organizations have been using their own resources for the development of this project, since they understand that making a tool like this available is essential for Latin American democracy. If you also want to support the project, write to [email protected]. For its Spanish version, Enjambre Digital and the Openlabs project have taken care of supporting the empowerment.
10 changes: 10 additions & 0 deletions content/faq/wrong-result.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
date: 2018-03-13T18:07:28-03:00
draft: false
id: "o-resultado-está-errado"
title: "The result is wrong! I'm sure an account is bot / human."
layout: none
type: faq
weight: 6
---
Bot Spotter is still in the testing phase, the accuracy of the results will improve over time. You will soon be able to submit this error to review and correct the error. That said, detecting bots is indeed a difficult task (if it were easy, social media companies would have solved the whole problem). Our platform works by recognizing behavioral patterns. Therefore, the results should serve to complement, not replace, your own opinion.
10 changes: 10 additions & 0 deletions content/results.en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
---
title: "Results"
date: 2018-01-24T14:30:36-02:00
draft: false
layout: results
menu:
slug: "results"
---

The final result is not the average of other results, but the calculation of our algorithm. For more information, check the documentation in the github repository.