Skip to main content
Live Webinar with SANS (June 25)— Agentic CTI Automation for Fun & ProfitRegister Free
Mallory
Back to intelligence
ai-platform-securitydetection-content-updateai-enabled-threat-activity

Practical Guidance on Using LLMs in Security Work and Testing LLM Applications

Updated 3mo agoFirst seen Feb 5, 20263 sources

NVISO published a technical introduction on automating LLM red teaming to find security weaknesses in LLM-based applications, focusing on AI-specific risks such as prompt injection, data leakage, jailbreaking, and other behaviors that can bypass guardrails. The post describes why manual testing is difficult due to LLMs’ probabilistic behavior and demonstrates using the promptfoo CLI to scale testing against a deliberately vulnerable ChainLit application, positioning automated test harnesses as a way to systematically probe LLM apps for exploitable failure modes.

Separately, a practitioner write-up describes how security analysts and engineers are using general-purpose LLM tools (Claude, Cursor, ChatGPT) to accelerate day-to-day security work through better prompting patterns rather than “keyword searching.” It provides practical prompting techniques (e.g., “role-stacking” and supplying richer context like requirements docs or code repositories) and includes an example of using an LLM to help design a small Flask application for collecting OSINT (DNS, WHOIS/RDAP, HTML) for URL investigations—guidance that is adjacent to, but not the same as, automated red-teaming of LLM applications.

Share:
Practical Guidance on Using LLMs in Security Work and Testing LLM Applications
Stay ahead

Get ahead of threats like this

Mallory correlates global threat intelligence with your attack surface — know if you’re exposed before adversaries strike.

EVENT TIMELINE

How this story unfolded

3 events from the most recent confirmed update back to the earliest known activity.

3 EVENTS
Feb 6, 20265mo ago

Praetorian introduces Augustus LLM security testing suite

Praetorian introduced Augustus, an open-source LLM security testing tool and accompanying taxonomy covering jailbreaks, prompt injection, data extraction, package hallucinations, RAG/context attacks, multimodal attacks, renderer exploits, evasion methods, and agent/tooling probes. The publication framed these as structured evaluation probes for assessing LLM security.

Feb 5, 20265mo ago

NVISO outlines automated LLM red-teaming with Promptfoo

NVISO published a walkthrough of automated LLM red teaming using Promptfoo, explaining a workflow with target, adversarial, and grader models to test risks such as prompt injection, data leakage, jailbreaking, and authorization failures. The article included a lab against a deliberately vulnerable ChainLit chatbot and reported baseline and iterative jailbreak test results.

Feb 3, 20265mo ago

Guide published on using LLMs to augment security work

A practitioner guide described how to use LLMs such as Claude, Cursor, and ChatGPT to accelerate security and engineering tasks through context-rich prompting, role-stacking, iterative refinement, and validation. It emphasized that LLMs should augment rather than replace analyst judgment.

LINKED ENTITIES

Related entities

Vulnerabilities, threat actors, malware, products, organizations, and breaches Mallory has linked to this story.

3 LINKEDOpen in app
Organizations
3 linked
NvidiaNVISOPromptfoo
The operational view lives in Mallory

See the full picture, correlated to your attack surface.

This page covers what’s public. Mallory adds the parts that aren’t — which of your assets are affected, which threat actors are using it right now, which detections to deploy, and what to do next.
Exposure mapping

Map indicators from this story to your assets and identify affected systems in minutes.

Threat actor evidence

Every observed campaign, victim, and pivot linked to actors named in this story.

Associated malware

Malware, exploits, and IOCs connected to the activity described here.

Detection signatures

YARA, Sigma, and Snort rules deployed to your SIEM as soon as they’re published.

Scheduled alerts

Get matching new stories delivered to your team as they break — not the next morning.

AI threads

Ask questions about this story and take action on the answers.