llm_with_cpp

LLM inference with cpp.

Only for fun.

Referenced from llama2.c

Platform

Ubuntu 22.04.3
GCC 17

Usage

Clone this project.

git clone https://github.com/SuperJokerayo/llm_with_cpp.git

Install sentencepiece tokenizer.

sudo apt-get install cmake build-essential pkg-config libgoogle-perftools-dev

git clone https://github.com/google/sentencepiece.git ./third_party/sentencepiece/

Download LLM checkpoint from llama2.c repo. For example:

# download OG model which has 15M parameters
wget -P ./checkpoints/  https://huggingface.co/karpathy/tinyllamas/resolve/main/stories15M.bin

Compile this project with cmake tool and run.

mkdir build
cd build
cmake
make -j $nproc

then, the executable file is put in ./bin

and you can run with:

./bin/run

The config parameters are written in config.ini, and custom config is supported.
You can also run the shell script if dependencies are installed manually.

bash ./run.sh

License

Have a look at the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src		src
CMakelists.txt		CMakelists.txt
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
run.cpp		run.cpp
run.sh		run.sh
tokenizer.model		tokenizer.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm_with_cpp

Platform

Usage

License

About

Releases

Packages

Languages

License

therontau0054/llm_with_cpp

Folders and files

Latest commit

History

Repository files navigation

llm_with_cpp

Platform

Usage

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages