How Retrieval-Augmented Generation (RAG) Can Make LLMs Less Safe
Super Data Science: ML & AI Podcast with Jon KrohnJuly 16, 20253 min167 views
5 connectionsΒ·9 entities in this videoβRAG's Unexpected Safety Implications
- π‘ Contrary to common belief, Retrieval-Augmented Generation (RAG) can actually make Large Language Models (LLMs) less safe and their outputs less reliable.
- β οΈ This research explored how RAG, when coupled with unsafe queries, can circumvent built-in safety mechanisms of LLMs, leading to unsafe responses even with innocuous retrieved documents.
Responsible AI and RAG Research
- π¬ The research is part of a broader responsible AI initiative focused on identifying, blocking, and monitoring potential misuse of AI technology.
- π― This is particularly crucial in heavily regulated industries where clients need assurance against accidental or purposeful abuse of AI tools.
- π RAG is recognized as a necessary technology for grounding LLM responses in trusted data sources, especially when dealing with vast amounts of daily incoming data.
Findings on RAG and Unsafe Queries
- π A study coupled unsafe queries (e.g., "How do I do insider trading?") with completely harmless documents from Wikipedia.
- π The results showed that while LLMs might not originally respond to such queries, their responses often became unsafe when augmented by RAG with these harmless documents.
- π― This highlights a critical need to understand and mitigate the potential risks introduced by RAG systems.
Knowledge graph9 entities Β· 5 connections
How they connect
An interactive map of every person, idea, and reference from this conversation. Hover to trace connections, click to explore.
Hover Β· drag to explore
9 entities
Chapters2 moments
Key Moments
Transcript11 segments
Full Transcript
Topics8 themes
Whatβs Discussed
Retrieval-Augmented Generation (RAG)Large Language Models (LLMs)AI SafetyResponsible AIUnsafe QueriesLLM SecurityData GroundingAI Misuse
Smart Objects9 Β· 5 links
ConceptsΒ· 4
CompaniesΒ· 2
ProductΒ· 1
MediasΒ· 2