mirror of https://github.com/fabriziosalmi/patterns.git synced 2026-07-30 22:45:44 -04:00

Go to file

Fabrizio Salmi 5c654b3da8 Redesign docs with Apple-native theme; verify content; route CI to self-hosted runner-02

- VitePress: custom theme (SF system fonts, glass nav, soft surfaces, pill buttons,
  light/dark code blocks, refined feature cards, platform showcase + stat strip).
- Replace every emoji across docs and README with inline SVG icons.
- Verify and fix doc accuracy against actual scripts: JSON schema (category+pattern only),
  env-var configuration for json2*/import_* scripts, owasp2json CLI surface.
- Add public assets (logo.svg, favicon.svg, hero-shield.svg) and Shiki haproxy alias.
- Workflows default to self-hosted runner-02 with a configurable fallback to GitHub
  runners via the RUNS_ON repo variable.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-01 08:07:04 +02:00

.github

Redesign docs with Apple-native theme; verify content; route CI to self-hosted runner-02

2026-05-01 08:07:04 +02:00

docs

Redesign docs with Apple-native theme; verify content; route CI to self-hosted runner-02

2026-05-01 08:07:04 +02:00

tests

Update nginx.conf

2025-01-07 20:27:51 +01:00

waf_patterns

Fix CI workflow and clarify Nginx WAF usage

2025-12-09 07:59:25 +01:00

.gitignore

chore: add .gitignore

2026-04-27 08:09:38 +02:00

badbots.py

nginx snippets generation fix + others minor improvements.

2025-01-16 14:02:19 +01:00

CODE_OF_CONDUCT.md

Create CODE_OF_CONDUCT.md

2024-12-21 01:14:23 +01:00

CONTRIBUTING.md

docs: Fix script names, improve CONTRIBUTING, add WAF READMEs, fix workflow

2025-11-15 19:33:13 +00:00

import_apache_waf.py

Update import_apache_waf.py

2025-02-28 11:20:17 +01:00

import_haproxy_waf.py

Update import_haproxy_waf.py

2025-02-28 11:21:33 +01:00

import_nginx_waf.py

Update import_nginx_waf.py

2025-02-28 11:22:21 +01:00

import_traefik_waf.py

Update import_traefik_waf.py

2025-02-28 11:24:10 +01:00

json2apache.py

Update json2apache.py

2025-02-28 11:26:45 +01:00

json2haproxy.py

Update json2haproxy.py

2025-02-28 11:15:14 +01:00

json2nginx.py

Update json2nginx.py

2025-02-28 11:19:32 +01:00

json2traefik.py

Update json2traefik.py

2025-02-28 11:23:08 +01:00

LICENSE

Initial commit

2024-12-21 01:00:15 +01:00

owasp2json.py

Update owasp2json.py

2025-02-28 11:16:46 +01:00

owasp_rules.json

Update: [Fri Feb 28 10:03:59 UTC 2025]

2025-02-28 10:03:59 +00:00

README.md

Redesign docs with Apple-native theme; verify content; route CI to self-hosted runner-02

2026-05-01 08:07:04 +02:00

requirements.txt

Update requirements.txt

2025-01-03 20:56:06 +01:00

SECURITY.md

docs: Add prerequisites, improve bug template, enhance security policy

2025-11-15 19:35:18 +00:00

README.md

Patterns

Production-grade WAF rules, on autopilot.

Automated OWASP Core Rule Set and bad-bot patterns, converted into native configurations for Nginx, Apache, Traefik, and HAProxy — refreshed every day.

Documentation · Get started · Latest release

Why Patterns

The OWASP Core Rule Set (CRS) is the de-facto open-source rule base behind ModSecurity, but plugging it into anything other than Apache is non-trivial. Patterns automates the whole pipeline:

Pull the latest CRS rules straight from upstream.
Convert them into the native syntax of each web server — not a generic shim.
Package the output as ready-to-deploy archives, refreshed every day by GitHub Actions.

You get equivalent protection across SQL injection, XSS, RCE, LFI, and bad-bot traffic, regardless of which proxy you run.

Highlights


OWASP CRS coverage	SQLi, XSS, RCE, LFI, RFI, plus generic anomaly and protocol-violation rules.
Native output	Nginx `map`/`if`, Apache `SecRule`, Traefik middleware TOML, HAProxy ACL files.
Bad-bot blocking	Curated User-Agent lists from public sources, with safe defaults that do not block major search engines.
Daily refresh	A scheduled GitHub Actions workflow rebuilds every backend and publishes a fresh release.
Pre-built archives	Skip the toolchain — download `nginx_waf.zip`, `apache_waf.zip`, `traefik_waf.zip`, or `haproxy_waf.zip`.
Composable	Each backend is a small Python converter on top of one JSON intermediate. Adding a new platform is a few hundred lines.

Using Caddy? See the dedicated caddy-waf project.

Quick start

Option 1 — download a pre-built release

# Pick the archive that matches your stack
curl -LO https://github.com/fabriziosalmi/patterns/releases/latest/download/nginx_waf.zip
unzip nginx_waf.zip -d /etc/nginx/waf_patterns

Then follow the Nginx, Apache, Traefik, or HAProxy integration guide.

Option 2 — build from source

Requires Python 3.11+, pip, and git.

git clone https://github.com/fabriziosalmi/patterns.git
cd patterns
pip install -r requirements.txt

python owasp2json.py            # 1. Fetch the latest OWASP CRS into owasp_rules.json
python json2nginx.py            # 2. Convert into Nginx WAF config
python json2apache.py           #    …or Apache (ModSecurity)
python json2traefik.py          #    …or Traefik middleware
python json2haproxy.py          #    …or HAProxy ACL files
python badbots.py               # 3. Generate bad-bot blocklists

Generated files land in waf_patterns/<platform>/.

Architecture

   ┌─────────────────────┐    daily cron     ┌──────────────────────┐
   │ coreruleset/        │ ───────────────▶  │ owasp2json.py        │
   │ coreruleset (GH)    │                   │   → owasp_rules.json │
   └─────────────────────┘                   └──────────┬───────────┘
                                                        │
            ┌─────────────────┬──────────────────┬──────┴──────────┐
            ▼                 ▼                  ▼                 ▼
      json2nginx.py    json2apache.py    json2traefik.py    json2haproxy.py
            │                 │                  │                 │
            ▼                 ▼                  ▼                 ▼
       nginx_waf.zip    apache_waf.zip    traefik_waf.zip    haproxy_waf.zip
                          (published as a GitHub Release)

Each converter is independent, idempotent, and configured exclusively through environment variables (INPUT_FILE, OUTPUT_DIR). Full reference at docs/api.

Repository layout

patterns/
├── owasp2json.py            # Pull and parse OWASP CRS into a JSON intermediate
├── json2nginx.py            # JSON → Nginx (map + if directives)
├── json2apache.py           # JSON → Apache (ModSecurity SecRule)
├── json2traefik.py          # JSON → Traefik (middleware TOML)
├── json2haproxy.py          # JSON → HAProxy (ACL files)
├── badbots.py               # Public bot lists → per-platform blocklists
├── import_*_waf.py          # Optional installers for each platform
├── waf_patterns/            # Generated outputs
│   ├── nginx/
│   ├── apache/
│   ├── traefik/
│   └── haproxy/
├── docs/                    # VitePress documentation site
├── tests/                   # Validation tests for each backend
└── .github/workflows/       # Daily build + release automation

Integration in 60 seconds

Nginx

http {
    include /etc/nginx/waf_patterns/nginx/waf_maps.conf;
    include /etc/nginx/waf_patterns/nginx/bots.conf;
}
server {
    include /etc/nginx/waf_patterns/nginx/waf_rules.conf;
    if ($bad_bot) { return 403; }
}

Apache (ModSecurity)

<IfModule security2_module>
    SecRuleEngine On
    Include /etc/apache2/waf_patterns/apache/*.conf
</IfModule>

Traefik

http:
  routers:
    app:
      rule: "Host(`example.com`)"
      service: app
      middlewares: [waf-protection@file, bot-blocker@file]

HAProxy

frontend http-in
    bind *:80
    acl waf_match path,url_dec -m reg -i -f /etc/haproxy/waf.acl
    acl bad_bot   hdr(User-Agent) -m reg -i -f /etc/haproxy/bots.acl
    http-request deny deny_status 403 if waf_match || bad_bot

Full guides — with logging, whitelists, and tuning — live in the docs.

Bad-bot example output (Nginx)

map $http_user_agent $bad_bot {
    default 0;
    "~*AhrefsBot"  1;
    "~*SemrushBot" 1;
    "~*MJ12bot"    1;
    "~*GPTBot"     1;
}

if ($bad_bot) { return 403; }

The default list blocks SEO crawlers, AI training bots, and known scanners while explicitly allowing major search engines (Google, Bing, DuckDuckGo, Yandex, Baidu).

Automation

Workflow	Schedule	Purpose
`update_patterns.yml`	Daily + manual	Re-fetch CRS, regenerate every backend, publish a release
`test_nginx.yml`	On PR	Validate generated Nginx rules against a live container
`test_apache_docker.yml`	On PR	Validate generated Apache rules against ModSecurity in Docker
`docs.yml`	On `docs/` change	Build and deploy the VitePress docs to GitHub Pages

All workflows target the runner-02 self-hosted runner by default, with an automatic fallback to GitHub-hosted runners by setting the repository variable RUNS_ON to '["ubuntu-latest"]'.

Documentation

The full documentation lives at fabriziosalmi.github.io/patterns — built with VitePress and deployed automatically.

Contributing

Fork the repository.
Create a feature branch: git checkout -b feature/your-change.
Commit and push.
Open a pull request — the test workflows will run automatically.

See CONTRIBUTING.md for details and SECURITY.md for the disclosure policy.

License

Released under the MIT License.

Resources

OWASP Core Rule Set
ModSecurity
Nginx · Apache HTTPD · Traefik · HAProxy
ai.robots.txt — upstream AI-bot list

_{Built and maintained by Fabrizio Salmi.}