🛑 Follow This Series Live: Jarvis 2.0 Series

🛑 Follow This Series Live: Jarvis 2.0 Series

JARVIS-AGI

Project Overview

JARVIS-AGI is an advanced AI project designed to integrate multiple AI capabilities, including speech recognition, text processing, and image analysis, into a cohesive system. Named after the iconic AI assistant from popular culture, Jarvis is built with cutting-edge natural language processing capabilities, allowing users to interact with it through voice commands. Whether it's checking the weather, setting reminders, managing calendars, or searching the web, Jarvis is equipped to handle a wide range of tasks efficiently and effectively. With its intuitive interface and robust functionality, Jarvis aims to revolutionize the way users engage with technology, making everyday tasks simpler and more convenient.

Features

Speech Recognition: Convert spoken language into text using various models.
Text Processing: Analyze and generate text with multiple AI tools.
Image Analysis: Perform image recognition and processing tasks.
Audio Tools: Detect hotwords and manage audio playback interruptions.
Interactive Prompts: Predefined prompts to guide AI interactions.

Directory Structure

The project is organized into several key directories:

JARVIS-AGI/
├── .env
├── .env.example
├── .gitattributes
├── .gitignore
├── ASSETS/
│   ├── CLAP_DETECTS/
│   │   └── MODELS/
│   │       └── Model.txt
│   ├── SOUNDS/
│   │   ├── activation_sound.wav
│   │   ├── audio_file.mp3
│   │   └── deactivation_sound.wav
│   ├── STREAM_AUDIOS/
│   │   ├── output_audio_6.mp3
│   │   ├── output_audio_7.mp3
│   │   ├── output_audio_8.mp3
│   │   ├── output_audio_9.mp3
│   │   ├── output_audio_10.mp3
│   │   ├── output_audio_11.mp3
│   │   ├── output_audio_12.mp3
│   │   ├── output_audio_13.mp3
│   │   ├── output_audio_14.mp3
│   │   ├── output_audio_15.mp3
│   │   ├── output_audio_16.mp3
│   │   ├── output_audio_17.mp3
│   │   ├── output_audio_18.mp3
│   │   ├── output_audio_19.mp3
│   │   └── output_audio_20.mp3
│   ├── USERDATA/
│   │   └── LE CHAT/
│   │       └── How_To_Store_UserData.txt
│   ├── Vosk/
│   ├── available_working_proxies.txt
│   ├── conversation_history.json
│   └── openGPT_IDs.txt
├── BRAIN/
│   ├── AI/
│   │   ├── IMAGE/
│   │   │   ├── decohere_ai.py
│   │   │   └── deepInfra_IMG.py
│   │   ├── TEXT/
│   │   │   ├── API/
│   │   │   │   ├── Blackbox_ai.py
│   │   │   │   ├── Bnn_GPT.py
│   │   │   │   ├── FarFalle.py
│   │   │   │   ├── Hugging_Face_TEXT.py
│   │   │   │   ├── Le_Chat.py
│   │   │   │   ├── Phind.py
│   │   │   │   ├── Pi_Ai.py
│   │   │   │   ├── Uncensored.py
│   │   │   │   ├── basedGPT.py
│   │   │   │   ├── deepInfra_TEXT.py
│   │   │   │   ├── deepseek_ai.py
│   │   │   │   ├── hugging_chat.py
│   │   │   │   ├── liaobots.py
│   │   │   │   ├── openGPT.py
│   │   │   │   └── openrouter.py
│   │   │   ├── LOCAL/
│   │   │   │   └── llama_CPP.py
│   │   │   └── STREAM/
│   │   │       ├── basedGPT.py
│   │   │       └── deepInfra_TEXT.py
│   │   └── VISION/
│   │       └── deepInfra_VISION.py
│   └── TOOLS/
│       └── groq_web_access.py
├── ENGINE/
│   ├── STT/
│   │   ├── DevsDoCode.py
│   │   ├── NetHyTech.py
│   │   ├── src/
│   │   │   └── index.html
│   │   └── vosk_recog.py
│   └── TTS/
│       ├── STREAMING/
│       │   ├── DeepGram.py
│       │   └── speechify.py
│       ├── DeepGram.py
│       ├── ElevenLabs.py
│       ├── ai_voice.py
│       ├── deepAI.py
│       ├── edge_tts.py
│       ├── hearling.py
│       ├── speechify.py
│       └── stream_elements_api.py
├── PLAYGROUND/
│   ├── ADB_CALL/
│   │   ├── ADB COMMANDS.txt
│   │   ├── Details.txt
│   │   ├── IMP Commands.txt
│   │   ├── Information.txt
│   │   ├── android_device_connection_setup.py
│   │   └── make_call.py
│   ├── CAMERA/
│   │   └── camera_vision.py
│   ├── CLAP_NN/
│   │   ├── DATASETS/
│   │   │   └── Informtation.txt
│   │   ├── ClapDetector.py
│   │   ├── Model_Trainer.py
│   │   ├── audio_inference.py
│   │   ├── cnn_sound_model.py
│   │   └── load_dataset.py
│   └── WEBSITE_ASSISTANT/
│       ├── chrome_latest_url.py
│       └── jenna_reader.py
├── PROMPTS/
│   ├── BISECTORS.py
│   ├── INSTRUCTIONS.py
│   ├── PROMPTS.py
│   └── SYSTEM.py
├── TOOLS/
│   ├── AUDIO/
│   │   ├── Hotword_Detection.py
│   │   └── Interrupted_Playsound.py
│   ├── LE_CHAT_COOKIES/
│   │   └── Cookie_Extractor.py
│   ├── SYSTEM_SETTINGS/
│   │   ├── SETTING.py
│   │   ├── system_theme.py
│   │   └── taskbar.py
│   ├── Alpaca_DS_Converser.py
│   ├── ProxyAPI.py
│   ├── RawDog.py
│   ├── TXT_DS_Converser.py
│   ├── Web_Results.py
│   └── stream_audio_cleanup.py
├── CODE_OF_CONDUCT.md
├── IMPORTS.py
├── LICENCE
├── Le_Chat_Tester.py
├── Memory ConvoTxt.py
├── SpeedTester.py
├── StreamSpeak.py
├── WebTester.py
├── main.py
├── readme.md
└── requirements.txt

Installation

Clone the repository:

git clone https://github.com/SreejanPersonal/JARVIS-AGI.git
cd JARVIS-AGI

Install the required packages:
```
pip install -r requirements.txt
```
(Optional) Install Vosk Speech Recognition Models:

Vosk provides pre-trained models for various languages. To install the models for your desired language, follow these steps:
- Go to the Vosk GitHub repository releases page: Vosk GitHub Releases
- Download the model folder for your language. For example, if you want English models, download the folder named vosk-model-en-us-aspire-0.2.
- Extract the contents of the folder into a directory named ASSETS in your project directory.
- Ensure that the extracted model folder is directly under the ASSETS directory, without any additional nesting.
- Now, you should have a structure like this: <your-main-project-directory>/ASSETS/vosk-model-en-us-aspire-0.2.
- Modify the main.py or any relevant script to point to the model directory. For example:
```
from ENGINE.STT.vosk_recog import speech_to_text

for speech in speech_to_text(model_path="ASSETS/Vosk/vosk-model-small-en-us-0.15"):
    if speech != "":
        print("Human >>", speech)
```

Usage

Run the main script:
```
python main.py
```
Configuration: Modify API configuration

in the .env directory to suit your needs.

Contributing

We welcome contributions to improve JARVIS-AGI. To contribute, follow these steps:

Fork the repository.
Create a new branch (git checkout -b feature-branch).
Commit your changes (git commit -m 'Add new feature').
Push to the branch (git push origin feature-branch).
Create a new Pull Request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Connect with Us

Made With 💓 By - Sree (Devs Do Code)

For any questions or concerns, reach out to us via our social media handles. Our top choice for contact is Telegram: Devs Do Code Telegram

YouTube Channel: Devs Do Code
Telegram Group: Devs Do Code Telegram
Discord Server: Devs Do Code Discord
Instagram:
- Personal: Sree
- Channel: Devs Do Code

Devs Do Code

Dive into the world of coding with Devs Do Code - where passion meets programming! Make sure to hit that Subscribe button to stay tuned for exciting content!

Pro Tip: For optimal performance and a seamless experience, we recommend using the default library versions demonstrated in this demo. Your coding journey just got even better! Happy coding!

Now you're all set to explore the Devs Do Code's project! Enjoy coding!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛑 Follow This Series Live: Jarvis 2.0 Series

JARVIS-AGI

Project Overview

Table of Contents

Features

Directory Structure

Installation

Usage

Contributing

License

Connect with Us

Devs Do Code

About

Releases

Sponsor this project

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github		.github
ASSETS		ASSETS
BRAIN		BRAIN
ENGINE		ENGINE
PLAYGROUND		PLAYGROUND
PROMPTS		PROMPTS
TOOLS		TOOLS
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
IMPORTS.py		IMPORTS.py
LICENCE		LICENCE
Le_Chat_Tester.py		Le_Chat_Tester.py
Memory ConvoTxt.py		Memory ConvoTxt.py
SpeedTester.py		SpeedTester.py
StreamSpeak.py		StreamSpeak.py
Timer_Alarm_Tester.py		Timer_Alarm_Tester.py
WebTester.py		WebTester.py
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt

License

SreejanPersonal/JARVIS-AGI

Folders and files

Latest commit

History

Repository files navigation

🛑 Follow This Series Live: Jarvis 2.0 Series

JARVIS-AGI

Project Overview

Table of Contents

Features

Directory Structure

Installation

Usage

Contributing

License

Connect with Us

Devs Do Code

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 2

Languages

Packages