Subscribe
Sign in
Home
Podcast
Notes
Archive
About
Safety Benchmark of Meta Llama 3.x Models
Why didn't Meta meet their own Llama 3.0 Safety benchmark? Why does Llama 3.2 generate drastically more unsafe responses?
Oct 3, 2024
•
Jitendra
Share this post
Detoxio AI
Safety Benchmark of Meta Llama 3.x Models
Copy link
Facebook
Email
Notes
More
Latest
Top
Discussions
Build Secure SOC AI Incident Investigation Agent, Part 1
Mitigate Prompt Injections, Jailbreaks, Data Leaks and Misalignment Issues
Jun 30
•
Jitendra
1
Share this post
Detoxio AI
Build Secure SOC AI Incident Investigation Agent, Part 1
Copy link
Facebook
Email
Notes
More
Guardrails in Practice: Measuring Llama-PG vs. Detoxio’s 300 M ‘AI Firewall'
What 20 adversarial prompts reveal about modern safety stacks
Jun 23
•
Jitendra
Share this post
Detoxio AI
Guardrails in Practice: Measuring Llama-PG vs. Detoxio’s 300 M ‘AI Firewall'
Copy link
Facebook
Email
Notes
More
AI Attack Surface: A Red Teamer’s Perspective
AI adoption is a reality. Are you prepared?
May 22
•
Jitendra
Share this post
Detoxio AI
AI Attack Surface: A Red Teamer’s Perspective
Copy link
Facebook
Email
Notes
More
Myth vs. Reality: What Detoxio AI Uncovered About Meta’s Llama-Guard-4-12B
Enterprises can deploy Detoxio AI Hardened Meta LLama Guard to reduce jailbreak success, significantly. in live deployments
May 17
•
Jitendra
Share this post
Detoxio AI
Myth vs. Reality: What Detoxio AI Uncovered About Meta’s Llama-Guard-4-12B
Copy link
Facebook
Email
Notes
More
The Evolution of AI Agents
From LLMs to Autonomous Intelligence
May 2
•
Jitendra
Share this post
Detoxio AI
The Evolution of AI Agents
Copy link
Facebook
Email
Notes
More
9:14
Hands-On AI Red Teaming Course
Coming Soon
Feb 9
•
Jitendra
1
Share this post
Detoxio AI
Hands-On AI Red Teaming Course
Copy link
Facebook
Email
Notes
More
1
Distilled Deepseek Models
Safety Evaluation Report
Jan 30
•
Jitendra
Share this post
Detoxio AI
Distilled Deepseek Models
Copy link
Facebook
Email
Notes
More
See all
Detoxio AI
Making GenAI Safe and Reliable for Enterprises
Subscribe
Recommendations
Gradient Flow
Ben Lorica 罗瑞卡
Software Analyst Cyber Research
Francis Odum
The Security Industry
Richard Stiennon
Marc Andreessen Substack
Marc Andreessen
The Cyber Why
Tyler Shields
Home
Home
Detoxio AI
Subscribe
About
Archive
Recommendations
Sitemap
Share this publication
detoxioai
Detoxio AI
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts