Skip to main content

hackagent.attacks.base

BaseAttack Objects

class BaseAttack(abc.ABC)

Abstract base class for black-box attacks against language models.

__init__

def __init__(config: Dict[str, Any])

Initializes the attack with configuration parameters.

Arguments:

  • config - A dictionary containing configuration settings for the attack.

run

@abc.abstractmethod
def run(**kwargs: Any) -> Any

Executes the attack logic.

Arguments:

  • **kwargs - Attack-specific arguments (e.g., input prompts, goals, dataset).

Returns:

Attack-specific results (e.g., adversarial examples, success metrics, report).