A workforce of researchers from synthetic intelligence (AI) agency AutoGPT, Northeastern College and Microsoft (NASDAQ:) Analysis have developed a instrument that displays giant language fashions (LLMs) for probably dangerous outputs and prevents them from executing.
The agent is described in a preprint analysis paper titled “Testing Language Mannequin Brokers Safely within the Wild.” In accordance with the analysis, the agent is versatile sufficient to watch present LLMs and may cease dangerous outputs, reminiscent of code assaults, earlier than they occur.
Proceed Studying on Cointelegraph