YouTube Adview Count Analysis

Welcome to the YouTube Adview Count Analysis repository! This project aims to predict YouTube ad view counts using machine learning models. Below you will find detailed descriptions of the files, models, and datasets used in this project, along with instructions for setting up and running the code.

Project Overview

This repository contains a collection of machine learning models and datasets used to analyze and predict ad view counts on YouTube. The project demonstrates various machine learning techniques, including linear regression, decision trees, and ensemble methods, to build effective prediction models.

Files and Directories

Here is a brief description of the files and directories in this repository:

File/Directory	Description
`artificial_neural_network.keras`	A Keras model file containing the trained Artificial Neural Network model used for predicting YouTube ad view counts.
`decision_tree.pkl`	A pickle file containing the trained Decision Tree model for predicting YouTube ad view counts.
`linear_regression.pkl`	A pickle file containing the trained Linear Regression model for predicting YouTube ad view counts.
`random_forest.pkl`	A pickle file containing the trained Random Forest model for predicting YouTube ad view counts.
`svr.pkl`	A pickle file containing the trained Support Vector Regression (SVR) model for predicting YouTube ad view counts.
`train.csv`	CSV file containing the training dataset with features and target values for model training.
`test.csv`	CSV file containing the test dataset used for evaluating the performance of the models.
`predictedAdview.csv`	CSV file containing the predicted ad view counts from the models.
`internship.ipynb`	Jupyter Notebook containing the code and analysis for the YouTube ad view count prediction project.
`README.md`	This file, providing an overview of the project, file descriptions, and instructions.

Datasets

`train.csv`

Description: The training dataset used for training the machine learning models.
Format: CSV
Columns: Features and target values for training.

`test.csv`

Description: The test dataset used to evaluate the performance of the machine learning models.
Format: CSV
Columns: Features for testing the models.

`predictedAdview.csv`

Description: A CSV file containing the predicted ad view counts generated by the models.
Format: CSV
Columns: Predicted ad view counts.

Models

`artificial_neural_network.keras`

Description: A Keras model file for an Artificial Neural Network.
Purpose: Used for predicting YouTube ad view counts.
Requirements: Keras and TensorFlow libraries.

`decision_tree.pkl`

Description: A pickle file for the Decision Tree model.
Purpose: Used for predicting YouTube ad view counts.
Requirements: scikit-learn library.

`linear_regression.pkl`

Description: A pickle file for the Linear Regression model.
Purpose: Used for predicting YouTube ad view counts.
Requirements: scikit-learn library.

`random_forest.pkl`

Description: A pickle file for the Random Forest model.
Purpose: Used for predicting YouTube ad view counts.
Requirements: scikit-learn library.

`svr.pkl`

Description: A pickle file for the Support Vector Regression (SVR) model.
Purpose: Used for predicting YouTube ad view counts.
Requirements: scikit-learn library.

Getting Started

To get started with this project, follow these steps:

1. Clone the Repository

git clone https://github.com/hitesh-bhatnagar/YouTube-Adview-count-.git
cd YouTube-Adview-count-

2. Install Dependencies

Make sure you have the required libraries installed:

pip install -r requirements.txt

3. Load the Data

You can load the datasets using pandas:

import pandas as pd

train_data = pd.read_csv('train.csv')
test_data = pd.read_csv('test.csv')

4. Load the Models

To load the saved models:

import joblib

decision_tree_model = joblib.load('decision_tree.pkl')
linear_regression_model = joblib.load('linear_regression.pkl')
random_forest_model = joblib.load('random_forest.pkl')
svr_model = joblib.load('svr.pkl')

5. Make Predictions

To make predictions using the models:

predictions = {
    'decision_tree': decision_tree_model.predict(test_data),
    'linear_regression': linear_regression_model.predict(test_data),
    'random_forest': random_forest_model.predict(test_data),
    'svr': svr_model.predict(test_data)
}

6. Evaluate the Models

Use the test dataset to evaluate the models’ performance:

from sklearn.metrics import mean_squared_error

for model_name, preds in predictions.items():
    mse = mean_squared_error(test_data['target'], preds)
    print(f'{model_name} Mean Squared Error: {mse}')

Usage

To use the models and datasets:

Load the train.csv and test.csv datasets.
Load the pre-trained models from their respective .pkl files or the artificial_neural_network.keras file.
Use the models to make predictions and evaluate their performance.

Contributing

Contributions are welcome! If you would like to contribute to this project, please fork the repository and submit a pull request with your changes. Ensure that your changes follow the guidelines outlined in the CONTRIBUTING.md file.

License

This project is licensed under the MIT License.

Sample `requirements.txt`

Make sure to include a requirements.txt file in your repository with the necessary dependencies:

numpy==1.21.2
pandas==1.3.3
scikit-learn==1.0.2
tensorflow==2.9.1
joblib==1.1.0

CONTRIBUTING

# Contributing to YouTube Adview Count Analysis

Thank you for your interest in contributing to the YouTube Adview Count Analysis project! We welcome contributions to improve the project.

## How to Contribute

1. **Fork the repository** to your own GitHub account.
2. **Clone the forked repository** to your local machine.
   ```bash
   git clone https://github.com/your-username/YouTube-Adview-count-.git

Create a new branch for your changes.
```
git checkout -b my-new-feature
```

Make your changes and commit them.

git add .
git commit -m "Add some feature"

Push your changes to your forked repository.
```
git push origin my-new-feature
```
Submit a pull request to the main branch of the original repository.

Pull Request Guidelines

Ensure that your code follows the project’s style guidelines.
Provide a clear and concise description of your changes.
Include tests for new features or bug fixes.

Thank you for your contributions!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YouTube Adview Count Analysis

Table of Contents

Project Overview

Files and Directories

Datasets

`train.csv`

`test.csv`

`predictedAdview.csv`

Models

`artificial_neural_network.keras`

`decision_tree.pkl`

`linear_regression.pkl`

`random_forest.pkl`

`svr.pkl`

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Load the Data

4. Load the Models

5. Make Predictions

6. Evaluate the Models

Usage

Contributing

License

Sample `requirements.txt`

CONTRIBUTING

Pull Request Guidelines

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitattributes		.gitattributes
README.md		README.md
artificial_neural_network.keras		artificial_neural_network.keras
decision_tree.pkl		decision_tree.pkl
internship.ipynb		internship.ipynb
linear_regression.pkl		linear_regression.pkl
predictedAdview.csv		predictedAdview.csv
random_forest.pkl		random_forest.pkl
svr.pkl		svr.pkl
test.csv		test.csv
train.csv		train.csv

hitesh-bhatnagar/YouTube-Adview-count-

Folders and files

Latest commit

History

Repository files navigation

YouTube Adview Count Analysis

Table of Contents

Project Overview

Files and Directories

Datasets

train.csv

test.csv

predictedAdview.csv

Models

artificial_neural_network.keras

decision_tree.pkl

linear_regression.pkl

random_forest.pkl

svr.pkl

Getting Started

1. Clone the Repository

2. Install Dependencies

3. Load the Data

4. Load the Models

5. Make Predictions

6. Evaluate the Models

Usage

Contributing

License

Sample requirements.txt

** CONTRIBUTING **

Pull Request Guidelines

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`train.csv`

`test.csv`

`predictedAdview.csv`

`artificial_neural_network.keras`

`decision_tree.pkl`

`linear_regression.pkl`

`random_forest.pkl`

`svr.pkl`

Sample `requirements.txt`

CONTRIBUTING

Packages