C++ Inference Engine from scratch

I am developing this project to learn C++ and get hands-on experience with inference engines.

How to build

clone the project: git clone [email protected]:MichalPitr/inference_engine.git
cd inference_engine
sh build.sh

CMake will complain if you are missing some system dependencies: protobuf, gtest, google benchmark, yaml-cpp

How to run simple example

This starts up an http server and uses python to send requests. You can also do the equivalent with curl via command line.

Build like explained above.
cd server
go run main.go
Open another terminal
cd utils
source venv/bin/activate
python infer_server.py

Backlog:

Optimize Cuda kernels. Gemm is very naive at the moment.
Add dynamic batching to Go server.
Add graph optimizations.
Add input validations to Go server.
Optimize memory allocator usage - should check available memory during loading, total memory usage can be pretty accurately estimated.
Improve error handling.
Explore NVTX profiling.

Contributing

This project wasn't designed with the idea of external contributions but if you fancy, improvements are welcome!

Blog posts

I enjoy writing technical blog posts and I've written some about this project:

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.vscode		.vscode
inputs		inputs
model_repository		model_repository
models		models
server		server
src		src
utils		utils
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

C++ Inference Engine from scratch

How to build

How to run simple example

Backlog:

Contributing

Blog posts

About

Releases

Packages

Languages

License

MichalPitr/inference_engine

Folders and files

Latest commit

History

Repository files navigation

C++ Inference Engine from scratch

How to build

How to run simple example

Backlog:

Contributing

Blog posts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages