Customization

Knowledge base

The sample knowledge base included in this AMP can be modified or augmented to provide different contexts in the Chatbot app.

Prepare your additional documents
- Custom knowledgebase documents should conform to the following requirements:
  - Less than 500 words (see Limitations)
  - Ensure ASCII character encoding (i.e. avoid Smart Quotes)
Add document files to the /data directory
Rerun the job Populate Vector DB with documents embeddings
Restart the application CML LLM Chatbot

Note: Keep in mind the semantic representation of each document is only calculated on the first 256 tokens. However the entire contents of the document file is included in the LLM enhanced prompt.

Models

The models used in the AMP can be swapped with other pre-trained models of similar type and compatibility with transformers interfaces.

Embeddings Model

This model is used to generate the embeddings (mathematical semantic representation) of each knowledgebase document.

Your chosen model should be compatible with hugging face transformers.AutoModel

Modify 2_job-download-models/download_models.sh
- EMBEDDING_MODEL_REPO
- EMBEDDING_MODEL_COMMIT
Then rerun the job Download Models
Rerun the job Populate Vector DB with documents embeddings
Restart the application CML LLM Chatbot

Large Language Model

This model is used to perform text generation with intruction prompts.

Your chose model should be compatible with hugging face transformers.AutoModelForCausalLM

Modify 2_job-download-models/download_models.sh
- LLM_MODEL_REPO
- LLM_MODEL_COMMIT
Then rerun the job Download Models
(Optional) Consider modifying the ehanced prompt used in the function create_enhanced_prompt() defined in4_app/llm_rag_app.py
- The prompt template should be similar to the patterns used when your chosen model was trained.
Restart the application CML LLM Chatbot

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

customization.md

customization.md

Customization

Knowledge base

Models

Embeddings Model

Large Language Model

Files

customization.md

Latest commit

History

customization.md

File metadata and controls

Customization

Knowledge base

Models

Embeddings Model

Large Language Model