- Explain how Velociraptor deploys YARA scans across endpoints and why it scales better than manual scanning - Configure the key parameters of Velociraptor YARA artifacts: YaraRule, PathGlob, FileNameRegex, and SizeMax - Differentiate between Windows (Windows.Detection.Yara.NTFS) and Linux (Linux.Detection.Yara.Glob) scanning artifacts - Create and launch a Velociraptor hunt that deploys YARA rules to multiple endpoints simultaneously - Interpret hunt results to identify infected hosts, matching files, and triggered rules across your fleet - Connect these skills to the endpoint-wide YARA hunt exercise in Lab 7.5

## From One Machine to the Entire Fleet In Lesson 7.4, you learned to scan files, directories, memory dumps, and disk images on a single system. That covers incident response on one compromised host. But what happens when you need to answer a much bigger question: **"Is this malware on any other system in our environment?"** Manually SSH-ing into every server, copying your YARA rules, running scans, and collecting results is not feasible when you have 50, 500, or 5,000 endpoints. You need a platform that can deploy YARA rules to every endpoint simultaneously, run the scan locally on each machine, and collect the results centrally. That platform is **Velociraptor**. You already know Velociraptor from Module 6 as an endpoint investigation tool. Now you will use its **hunt** capability — the ability to run the same artifact collection across every connected client at once. When that artifact is a YARA scan, you get enterprise-wide malware detection in minutes instead of days. ![YARA + Velociraptor hunting — the analyst writes a rule, Velociraptor deploys it to all endpoints, YARA runs locally on each, and only results travel back](https://cyberblue-academy-content.s3.us-east-2.amazonaws.com/courses/cyberbluesoc-academy/module-07/lesson-7-5/yara-velociraptor-hunting.png) ## How Velociraptor YARA Hunting Works The architecture is simple but powerful: 1. **Analyst writes a YARA rule** and pastes it into a Velociraptor hunt configuration 2. **Velociraptor server distributes the hunt** to all connected clients (or a targeted subset) 3. **Each client runs YARA locally** — the endpoint's CPU scans its own filesystem or memory 4. **Only results travel the network** — if 1,000 endpoints scan 100GB each, the server does not transfer 100TB. It receives only match reports (usually kilobytes per endpoint) 5. **Analyst reviews aggregated results** in the Velociraptor web interface — which hosts matched, which files triggered, which rules fired This model scales linearly. Scanning 10 endpoints takes the same amount of analyst time as scanning 1,000 — the work is distributed to the endpoints themselves.

**YARA runs ON the endpoint, not on the server.** This is the critical architectural insight. The Velociraptor server does not download files from endpoints and scan them centrally. Each endpoint has YARA built into the Velociraptor agent. When a hunt is dispatched, the agent receives the YARA rule text, runs the scan against its local filesystem, and reports back only the matches. This means network bandwidth usage is minimal and the server's CPU is not a bottleneck.

## Velociraptor YARA Artifacts Velociraptor provides several YARA-specific artifacts (pre-built collection scripts) for different operating systems and scan targets. ![Velociraptor YARA artifacts — Windows vs Linux with configuration parameters and use cases](https://cyberblue-academy-content.s3.us-east-2.amazonaws.com/courses/cyberbluesoc-academy/module-07/lesson-7-5/velociraptor-yara-artifacts.png) ### Windows: Windows.Detection.Yara.NTFS The primary Windows YARA scanning artifact. Uses raw NTFS access via the Master File Table (MFT) to scan files — including files that are locked by running processes. **Key parameters:** | Parameter | Description | Example | |---|---|---| | `YaraRule` | The complete YARA rule text (paste the entire rule) | `rule Webshell { strings: $a = "eval(" condition: $a }` | | `PathGlob` | Directory path to scan (supports ** wildcards) | `C:\Users\**`, `C:\Windows\Temp\**` | | `FileNameRegex` | Regex filter for filenames | `\.exe$\|\.dll$`, `\.php$` | | `SizeMax` | Maximum file size in bytes (skip larger files) | `50000000` (50MB) | | `NumberOfHits` | Max matches per file (0 = unlimited) | `1` (stop after first match per file) | | `Upload` | Upload matching files to the server | `true` | **Key advantage:** Raw NTFS access reads files that running processes hold open — including locked DLLs, open databases, and files with exclusive locks. Standard filesystem APIs cannot read these files, but MFT-level access bypasses all locks. **Common scan paths:** ``` C:\Users\** # User directories — malware staging C:\Windows\Temp\** # Temp directory — dropped payloads C:\ProgramData\** # App data — hidden tools C:\Windows\System32\** # System directory — masqueraded binaries ``` ### Linux: Linux.Detection.Yara.Glob The primary Linux YARA scanning artifact. Uses glob patterns to traverse the filesystem. **Key parameters:** | Parameter | Description | Example | |---|---|---| | `YaraRule` | Complete YARA rule text | Same as Windows | | `PathGlob` | Directory path with glob pattern | `/tmp/**`, `/var/www/**` | | `FileNameRegex` | Regex filter for filenames | `\.php$\|\.sh$` | | `SizeMax` | Maximum file size in bytes | `50000000` | | `NumberOfHits` | Max matches per file | `1` | | `Upload` | Upload matching files to server | `true` | **Common scan paths:** ``` /tmp/** # World-writable temp — attacker staging /var/tmp/** # Persistent temp — survives reboot /dev/shm/** # RAM-backed — fileless attack staging /var/www/** # Web directories — webshell hunting /home/** # User directories — backdoor tools /etc/cron.d/** # Cron persistence ``` ### Cross-Platform: Generic.Detection.Yara.Glob Works on both Windows and Linux. Uses standard file access (not raw NTFS). Simpler but cannot read locked files on Windows. ### Process Memory: *.Detection.Yara.Process Scans running process memory instead of filesystem: - **Windows.Detection.Yara.Process** — scan Windows process memory - **Linux.Detection.Yara.Process** — scan Linux process memory **When to use:** Detecting fileless malware, process injection, and packed payloads that only exist in memory. Deploy when your filesystem scan finds nothing but you still suspect compromise (the attacker may have deleted their files but their process is still running). ## Creating and Launching a Hunt A **hunt** is a Velociraptor operation that runs the same artifact across multiple endpoints. Here is the workflow: ### Step 1: Navigate to the Hunt Manager In the Velociraptor web interface, click the **Hunt Manager** icon (crosshair icon) in the left sidebar. Click **New Hunt** (+ button). ### Step 2: Configure the Hunt **Description:** Give the hunt a clear name: ``` Hunt: Webshell Detection - PHP eval/base64_decode/system Date: 2026-02-17 Analyst: CyberBlue Team Ref: INC-2026-0147 ``` **Artifacts:** Select the appropriate YARA artifact: - For Windows endpoints: `Windows.Detection.Yara.NTFS` - For Linux endpoints: `Linux.Detection.Yara.Glob` - For mixed environments: create two hunts (one per OS) **Parameters:** Configure the artifact with your YARA rule and scan scope: ``` YaraRule: rule Webshell_PHP_Eval { meta: author = "SOC Team" description = "Detects PHP eval webshells" severity = "critical" strings: $eval = "eval(" nocase $b64 = "base64_decode" nocase $system = "system(" nocase $post = "$_POST" $php = " **Be careful with Upload: true on large hunts.** If your rule matches hundreds of files across hundreds of endpoints, enabling file upload could overwhelm the Velociraptor server's storage. Start with `Upload: false` for the initial hunt. Review the results to confirm they are true positives. Then create a targeted follow-up hunt with `Upload: true` on specific endpoints to collect the confirmed malicious files. ## Hunt Optimization Tips ### Narrow Your Scope Do not scan `C:\**` or `/**` on every endpoint. This scans the entire filesystem and takes hours. Instead: - **Know your threat model.** Webshells go in web directories. Ransomware targets user files. C2 beacons hide in temp and appdata folders. Scan where the threat lives. - **Use FileNameRegex.** If you are hunting for PHP webshells, add `\.php$` — no point scanning .jpg, .pdf, and .log files. - **Set SizeMax.** If your target is under 50KB, set SizeMax to 50000 — skip everything larger. ### Stage Your Hunts 1. **Pilot hunt** — run on 5-10 test endpoints first. Verify the rule works, check for false positives, confirm scan duration is reasonable. 2. **Targeted hunt** — expand to the most likely targets (all web servers, all domain controllers, all endpoints in the affected network segment). 3. **Full hunt** — if the targeted hunt finds additional infections, expand to the entire fleet. ### Multiple Rules in One Hunt You can paste multiple YARA rules into a single hunt's YaraRule parameter: ``` rule Webshell_Eval { ... } rule Webshell_System { ... } rule Webshell_Passthru { ... } rule Cobalt_Strike_Beacon { ... } ``` All rules execute in a single pass over the filesystem, which is faster than running four separate hunts.

**Save your YARA rules as Velociraptor server artifacts.** Instead of pasting rules into every hunt configuration, you can create a custom Velociraptor artifact that embeds your organization's YARA rule library. This way, analysts can launch standardized hunts with a single click and the rules are version-controlled on the server.

## Real-World Hunt Scenario Here is a complete example of how a SOC team uses Velociraptor YARA hunting during an incident: **Trigger:** Wazuh fires alert for a suspicious PHP file creation on `linux-web-01`. Analyst investigates and confirms a web shell at `/var/www/html/.status.php`. **Question:** Is this web shell on any other web server? **Action 1:** Analyst writes a YARA rule based on the confirmed web shell: ``` rule Confirmed_Webshell_StatusPHP { meta: author = "SOC Analyst - INC-2026-0147" description = "Webshell found on linux-web-01" hash = "a1b2c3d4..." strings: $eval = "eval(base64_decode" nocase $post = "$_POST['cmd']" nocase $unique = "@set_time_limit(0)" condition: filesize < 10KB and all of them } ``` **Action 2:** Launch Velociraptor hunt targeting all endpoints labeled "web-server" with: - Artifact: `Linux.Detection.Yara.Glob` - PathGlob: `/var/www/**` - FileNameRegex: `\.php$` - Upload: false (first pass) **Result:** 3 additional web servers have the same shell. Analyst launches follow-up with `Upload: true` on those 3 hosts to collect the files. **Action 3:** Expand hunt to ALL endpoints (not just web servers) scanning `/tmp/**` and `/home/**` for the same rule — checking if the attacker staged tools elsewhere. **Result:** One additional endpoint has the web shell source in `/tmp/.cache/tools/` — this is likely where the attacker staged before deploying to web servers. **Outcome:** In under 30 minutes, the team went from a single alert to full enterprise-wide scope assessment.

- Velociraptor deploys YARA rules to endpoints and runs scans locally — only results travel the network, making it scale to thousands of hosts - Windows.Detection.Yara.NTFS uses raw NTFS access to read locked files; Linux.Detection.Yara.Glob uses glob patterns for filesystem traversal - Key artifact parameters: YaraRule (rule text), PathGlob (scan scope), FileNameRegex (file filter), SizeMax (size limit), Upload (collect matching files) - Hunts can target all clients, labeled subsets, OS-specific groups, or individual hosts - Stage hunts: pilot on 5-10 endpoints, then targeted groups, then full fleet — verify before expanding - Multiple YARA rules in a single hunt execute in one filesystem pass for efficiency - Raw NTFS access (Windows artifact) reads locked files that standard APIs cannot — critical for scanning running systems - In Lab 7.5, you will load YARA rules into Velociraptor, launch an endpoint-wide hunt, and triage the results to identify infected hosts

## What's Next Time to deploy at scale. In **Lab 10.5 — Endpoint-Wide Hunt**, you'll use Velociraptor to run YARA hunts across an entire endpoint — scanning the full filesystem instead of a single directory and collecting results through Velociraptor's hunt management interface.