Overview

Predict stock price of MAG-7 by building the LSTM neutral network on Keras framework.

Compare the outcomes when epoch is 10, 100, and 1,000 and plot the results.

Extract historical price on Yahoo Finance using Selenium:

Compile and train the LSTM network:

The future stock price predicted (Plotted 3 patterns by epoch in red for each stock):

Key Features

Predict future stock price using LSTM networks.

Data Preparation:
- Scrape historical stock price from Yahoo Finance using Selenium Webdriver
- Remove null valeues and store the dataset in the CSV file
- Run EDA to understand the data structure
Train/Test Data
- Take the adj. close price as data frame and split them into train and test dataset
- Nominalize the datasets from 0 to 1 using the MinMaxScaler preprocessing class from the scikit-learn
- Split the dataset into train and test data and reform those into NumPy array
Compile LSTM Network:
- Compile LSTM network using Keras Sequential framework with 5 dense layers
- Train the network using datasets
Stock Price Prediction:
- Visualize the results on graph. (Compare results by epoch = 10, 100, and 1,000)
Evaluation:
- Evaluate the results using RMSE and MAPE.

Technologies Used

Python: Primary programming language. We use ver 3.12

[data-scraping]

Selenium WebDriver: Scrape data from the website
BeatifulSoup4: A library that makes it easy to scrape information from web pages
XMLX: A simple and compact XML parser

[eda]

Matplotlib: A library for data visualization, typically in the form of plots, graphs and charts.

[ml-stack]

TensorFlow: A ML/AI software library
keras: An open-source API for artifitial neutral network
NumPy: A Python library to operate large, multi-dimensional arrays and matrices
pandas: An open source library with data structures and data analysis tools
scikit-learn: A Python ML module built on top of SciPy
scikeras: keras x scikit-learn

[deployment]

pip, pipenv: Python package manager

Project Structure

.
├── __init__.py
├── predict.py
├── utils/
│   ├── web_scraper.py
│   └── ext_analysis.py
└── sample_data/            # Store the scraped dataset 
└── requirements.txt

Setup

Install the pipenv package manager:
```
pip install pipenv
```

Install dependencies:

pipenv shell
pip install -r requirements.txt -v

Usage

Scrape the latest stock price data:
```
pipenv shell
python utils/web_scraper.py
```
You will be asked to enter a specific ticker or use default tickers of MAG7.
Run EDA:
```
pipenv shell
python utils/ext_analysis.py
```
You will be asked to select a ticker and title. You can skip them simply pressing enter.
Predict stock price:
```
pipenv shell
python main.py
```
You will be asked to input a ticker or you can skip this by pressing enter. (Default ticker is GOOG)

Development

Package Management with pipenv

Add a package: pipenv install <package>
Remove a package: pipenv uninstall <package>
Run a command in the virtual environment: pipenv run <command>

After adding/removing the package, update requirements.txt accordingly or run pip freeze > requirements.txt to reflect the changes in dependencies.
To reinstall all the dependencies, delete Pipfile and Pipfile.lock files, then run:
```
pipenv shell
pipenv install -r requirements.txt -v
```

Customizing LSTM Network

To customize LSTM, edit the main.py file.

Adding EDA

To add more EDA, edit the ext_analysis.py file.

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/your-amazing-feature)
Commit your changes (git commit -m 'Add your-amazing-feature')
Push to the branch (git push origin feature/your-amazing-feature)
Open a pull request

Troubleshooting

Common issues and solutions:

Memory errors: If processing large contracts, you may need to increase the available memory for the Python process.
Data scraping issues: Selenium relies on the hard-coded HTML structures. Update web_scraper.py accordingly to see if the data was properly scraped.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Table of Contents

Key Features

Technologies Used

Project Structure

Setup

Usage

Development

Package Management with pipenv

Customizing LSTM Network

Adding EDA

Contributing

Troubleshooting

About

Languages

License

krik8235/build-lstm-network

Folders and files

Latest commit

History

Repository files navigation

Overview

Table of Contents

Key Features

Technologies Used

Project Structure

Setup

Usage

Development

Package Management with pipenv

Customizing LSTM Network

Adding EDA

Contributing

Troubleshooting

About

Topics

Resources

License

Stars

Watchers

Forks

Languages