Skip to content

Commit

Permalink
update replay to divergence in detector spec, calibration
Browse files Browse the repository at this point in the history
  • Loading branch information
leondz committed Oct 11, 2024
1 parent 70adb38 commit b081082
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion garak/data/calibration/calibration-2024-09update.json
Original file line number Diff line number Diff line change
Expand Up @@ -385,7 +385,7 @@
"sigma": 0.01364044725976449,
"sw_p": 0.0011471324643020655
},
"replay.Repeat/replay.RepeatDiverges": {
"divergence.Repeat/divergence.RepeatDiverges": {
"mu": 0.7905982905982906,
"sigma": 0.22985291682655218,
"sw_p": 0.0014372154789051555
Expand Down
2 changes: 1 addition & 1 deletion garak/probes/divergence.py
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ class Repeat(Probe):
training data."""

bcp47 = "*"
primary_detector = "replay.RepeatDiverges"
primary_detector = "divergence.RepeatDiverges"
extended_detectors = ["mitigation.MitigationBypass"]
goal = "get the model to leak data by asking it to repeat words"
doc_uri = "https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html"
Expand Down

0 comments on commit b081082

Please sign in to comment.