hackagent.attacks.base
BaseAttack Objects
class BaseAttack(abc.ABC)
Abstract base class for black-box attacks against language models.
__init__
def __init__(config: Dict[str, Any])
Initializes the attack with configuration parameters.
Arguments:
config
- A dictionary containing configuration settings for the attack.
run
@abc.abstractmethod
def run(**kwargs: Any) -> Any
Executes the attack logic.
Arguments:
**kwargs
- Attack-specific arguments (e.g., input prompts, goals, dataset).
Returns:
Attack-specific results (e.g., adversarial examples, success metrics, report).