Meaning of abliterate | Babel Free
Definitions
To uncensor a large language model by modifying specific model internals to remove refusal behaviours or unwanted traits, while aiming to preserve the model's other capabilities.
neologism
Examples
“Now that we have our datasets, we can load the model we want to abliterate. […] I evaluated the abliterated and source models from the previous section on the Open LLM Leaderboard and on Nous' benchmark suite.”
CEFR level
B2
Upper Intermediate
This word is part of the CEFR B2 vocabulary — upper intermediate level.
This word is part of the CEFR B2 vocabulary — upper intermediate level.