GitHub - mobarski/ai-bricks-v3: Simple, unified interface to multiple Generative AI providers and local servers

AI Bricks

Simple, unified interface to multiple Generative AI providers and local model servers.

This project is similar in scope to aisuite, but with the following differences:

stronger support for local model servers (tabbyAPI, KoboldCpp, LMStudio, Ollama, ...)
focus on improving your application without having to change your code
middleware for logging, usage tracking, styling, etc
configuration driven approach
minimal dependencies (requests, pyyaml, jinja2)

Supported providers

Provider	Example Connection String	Environmental Variables	Notes
OpenAI	`openai:gpt-4o-mini`	OPENAI_API_KEY
Google	`google:gemini-1.5-flash`	GEMINI_API_KEY
OpenRouter	`openrouter:openai/gpt-4o`	OPENROUTER_API_KEY
ArliAI	`arliai:Llama-3.1-70B-Tulu-2`	ARLIAI_API_KEY
XAI	`xai:grok-beta`	XAI_API_KEY
Together	`together:meta-llama/Meta-Llama-3-8B-Instruct-Turbo`	TOGETHER_API_KEY
Ollama	`ollama:qwen2.5-coder:7b`	-	GGUF
LMStudio	`lmstudio:qwen2.5-14b-instruct`	-	GGUF dynamic model loading
KoboldCpp	`koboldcpp`	-	GGUF
LlamaCpp	`llamacpp`	-	GGUF
tabbyAPI	`tabbyapi`	TABBYAPI_API_KEY TABBYAPI_ADMIN_KEY HF_API_TOKEN	EXL2, GPTQ dynamic model downloads dynamic model loading
dummy	`dummy`	-

License

MIT

Installation

Don't. The project is still in its infancy.

How to use

import aibricks

# OpenAI drop in replacement
client = aibricks.client()
resp = client.chat.completions.create(
    model='openrouter:qwen/qwen-2.5-coder-32b-instruct',
    messages=[{"role": "user", "content": "Tell me a joke."}],
    temperature=0.7
)
print(resp)

# parameter overriding example
client = aibricks.client(
    # any parameter can have a default value
    temperature=0.7,
    # or can be overridden by a lambda taking the original value
    model=lambda x: 'openrouter:qwen/qwen-2.5-coder-32b-instruct',
)

# somewhere else in the code
resp = client.chat.completions.create(
    model='openrouter:openai/gpt-3.5-turbo', # ignored
    messages=[{"role": "user", "content": "Tell me a joke."}],
    # default temperature=0.7 is used
)
print(resp)

How to use middleware

# initialize OpenAI compatible client as usual
client = aibricks.client()

# add 2 lines to create the illusion of an infinite context
summary_middleware = ChatSummaryMiddleware(max_in_context_chars=12000)
client.add_middleware(summary_middleware)

# use the client as usual
resp = client.chat.completions.create(...)

How to create middleware

class TimingMiddleware(MiddlewareBase):

    def request(self, data, ctx):
        ctx["start"] = time.perf_counter()
        return data

    def response(self, data, ctx):
        ctx["duration"] = time.perf_counter() - ctx["start"]
        return data

client = aibricks.connect('openrouter:qwen/qwen-2.5-coder-32b-instruct')
client.add_middleware(TimingMiddleware())
resp = client.chat_create([{'role': 'user', 'content': 'Tell me a joke.'}])
print(client.ctx)

How to test

# Run simple connection test for a specific provider
python3 -m aibricks.providers.lmstudio_api

# Run all tests
pytest

# Run only middleware tests
pytest tests/aibricks/middleware/

Examples

Easy

game - simple "chat with a character"
- nice stepping stone for building rpg/adventure games
- v1 as simple as it gets
- v2 adds:
  - "infinite context"
  - colors and formatting
  - more characters
- v3 adds configuration driven client+middleware creation
- v4 adds save/load/undo functionality (TODO)
- v5 adds streaming (TODO)
worldgen - simple, hierarchical world generator (based on this lesson)
- another stepping stone for building rpg/adventure games
- v1 as simple as it gets
- v2 adds:
  - jinja macros to clean up the prompts
  - more entities (NPCs, locations, events)
  - prompt_and_parse helper function
  - ~~automatic XML fixing~~
- v3 ~~adds ability to use images for the inspiration~~
codegen - simple code generation for tiny web apps
- v1 ~~as simple as it gets~~
- v2 ~~adds UI designer role~~

Advanced

memgpt - basic MemGTP agent

Planned features

anthropic api support
server mode
outlines support
vllm support
mamba support
huggingface downloads
more middleware

References

aisuite repo - simple, unified interface to multiple Generative AI providers
MemGPT paper
letta repo - framework for creating LLM services with memory

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
aibricks		aibricks
examples		examples
tests		tests
.gitignore		.gitignore
.python-version		.python-version
KANBAN.md		KANBAN.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Bricks

Supported providers

License

Installation

How to use

How to use middleware

How to create middleware

How to test

Examples

Easy

Advanced

Planned features

References

About

Languages

License

mobarski/ai-bricks-v3

Folders and files

Latest commit

History

Repository files navigation

AI Bricks

Supported providers

License

Installation

How to use

How to use middleware

How to create middleware

How to test

Examples

Easy

Advanced

Planned features

References

About

Resources

License

Stars

Watchers

Forks

Languages