Name		Name	Last commit message	Last commit date
parent directory ..
fairness		fairness
harmless_harmful		harmless_harmful
honesty		honesty
languages		languages
memorization		memorization
primary_emotions		primary_emotions
README.md		README.md

README.md

Community Examples

This directory contains example frontiers of Representation Engineering (RepE). While some of the examples were originally provided by the authors, we encourage and welcome community contributions. If you'd like to contribute, please open a PR, and we will review and merge it promptly.

Example	Description	Code Example	Author
Honesty	Monitoring and controlling the honesty of a model, using RepE techniques for lie detection, hallucinations, etc.	honesty	-
Emotions	Controlling primary emotions in LLMs, illustrating the profound impact of emotions on model behavior.	primary_emotions	-
Fairness	Reducing bias and increasing fairness in model generations.	fairness	-
Harmless	Jailbreaking aligned model with harmless controlled	harmless_harmful	-
Memorization	Preventing memorized outputs during generation.	memorization	-

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

README.md

Community Examples

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Community Examples