Commit Graph

6317 Commits

Author SHA1 Message Date
Karl Seguin
875c147783 Main/Network reads CDP socket
Previously, the CDP socket was added to the worker's multi and fully owned
by the worker. While this is simple, it introduced some issues:

1 - Cannot detect a disconnected client during JS processing ( for(;;) )

2 - A blocked worker can cause back-pressure that blocks the client. This can
    cause a deadlock if the worker is blocked waiting for a CDP message

In addition to these 2 problems, there was 1 other serious CDP-related issue:
arbitrary CDP messages could be processed during JavaScript callback. For
example, a Worker calls importScripts while request interception is enabled,
this requires us to tick the HttpClient waiting for the interception response.
But, a client could sent Target.closeTarget, which we'd process and delete the
frame..all while importScripts is still blocked. Assuming importScripts unblocks
everything is a big UAF since the frame (and its workers) were cleared from
closeTarget.

The CDP socket is now read from the network (main) thread and an OTP-style
mailbox is used. The network thread posts message to the Worker's inbox and
signals it to wakeup. This solves #1 and #2. It doesn't directly solve the
reentrancy issue, but it provides the foundation. Specifically, in introduces
a queue for of CDP message and more control over when/how that queue is
processed. At "safe points" (Runner.tick, HttpClient.tick), any message can
be processed. But, when inside a JavaScript callback, we can process only non-
destructive/mutating message. Specifically, we can process only messages related
to request interception.
2026-05-19 20:52:21 +08:00
Karl Seguin
8ef6084fdb Re-organization CDP connection
network/WsConnection.zig was poorly named. It didn't represent a generic WS
connection, but rather a CDP-specific connection. This splits the generic WS
logic into network/WS.zig and the CDP-specific details in cdp/Connection.zig.

Some of the connection management in the Server has also been simplified.
2026-05-19 10:08:22 +08:00
Halil Durak
bdd456f76c Merge pull request #2491 from willmafh/improve-code-readability
more clean validateCookieString function to improve code readability
2026-05-18 17:53:45 +03:00
willmafh
2f66edc9b9 more clean validateCookieString function to improve code readability 2026-05-18 22:29:01 +08:00
Karl Seguin
b83cd9262b Merge pull request #2490 from lightpanda-io/blocking_read_failure_handling
On blocking read failure, break from loop
2026-05-18 21:19:40 +08:00
Karl Seguin
49aa0ad1a9 On blocking read failure, break from loop
Blocking read failure almost certainly means a disconnect client. As-is, that's
an endless loop. Instead, fail the request.
2026-05-18 19:44:25 +08:00
Pierre Tachoire
23a3d5476b Merge pull request #2458 from lightpanda-io/nikneym/cli-help-rework
`help`: rework `help` command
2026-05-18 11:54:29 +02:00
Pierre Tachoire
8b098a3c97 Merge pull request #2488 from lightpanda-io/ci-mcp-smoke-jq-tighten 2026-05-17 12:50:23 +02:00
Adrià Arrufat
8981a6245c ci: tighten mcp-smoke jq assertions
Replace `grep '"id":N' | jq -e ...` with `jq -ec 'select(.id == N) | ...'`.
The grep form also matched `"id":10`, `"id":11`, ... and any tool description
containing that substring; numeric `select` is type-correct. `jq -e` still
fails the job when `select` produces no output (exit 4), so the smoke
semantics are preserved.

Also add `jq --version` up front so the job fails fast and loud if the
`ubuntu-latest` image ever stops shipping jq.
2026-05-17 10:43:03 +02:00
Pierre Tachoire
803e4303c2 Merge pull request #2481 from navidemad/ci-mcp-smoke
ci: smoke test the MCP stdio server
2026-05-17 10:39:18 +02:00
Pierre Tachoire
4e80db6cf0 Merge pull request #2483 from navidemad/dockerfile-pipefail-hygiene
Dockerfile: fix curl|sh pipefail; trim builder stage
2026-05-16 19:21:30 +02:00
Pierre Tachoire
a3944a3b40 Merge pull request #2484 from lightpanda-io/e2e_kill_between_steps
Force kill lightpanda between steps to prevent "port already in-use" …
2026-05-16 18:51:36 +02:00
Karl Seguin
ab63cfbf39 Merge pull request #2478 from navidemad/fix-c10-inline-media-evaluation
css: evaluate @media and matchMedia against viewport
2026-05-16 21:42:56 +08:00
Karl Seguin
d870972ceb Small tweaks to @media
- Depth counter when recursing
- Better comment support
- Small perf tweak (e.g. lowercase once into stack buffer before multiple
  compares)
- Few more test cases
2026-05-16 20:52:11 +08:00
Karl Seguin
21e74b46ea Merge pull request #2486 from willmafh/typo-fix
typo fix
2026-05-16 20:39:36 +08:00
willmafh
c52356b6d7 chore: lowercase demo word 2026-05-16 20:07:32 +08:00
willmafh
c1e64232e5 chore: typo fix 2026-05-16 20:05:52 +08:00
Karl Seguin
7f8cb145e6 Merge pull request #2485 from lightpanda-io/nikneym/timers-hash
`Timers`: prefer integer-optimized hashing
2026-05-16 16:52:53 +08:00
Halil Durak
33d594be43 Timers: prefer integer-optimized hashing 2026-05-16 10:19:33 +03:00
Karl Seguin
d926291241 Merge pull request #2467 from lightpanda-io/http_transfer
Cleanup HttpClient.Transfer
2026-05-16 08:52:12 +08:00
Karl Seguin
0b358fd410 Merge pull request #2474 from staylor/fix/2472-frame-id-reset
Fix #2472: scope frame ID generator to Browser, not Session
2026-05-16 08:46:27 +08:00
Karl Seguin
94e8b06583 Merge pull request #2482 from navidemad/make-v8-path
make: forward optional V8_PATH to zig build
2026-05-16 08:41:05 +08:00
Karl Seguin
a5c1068b85 Force kill lightpanda between steps to prevent "port already in-use" error in CI 2026-05-16 08:39:53 +08:00
Navid EMAD
54e09a5ace make: rename V8_PATH to generic ZIGFLAGS
Per review feedback, generalise the optional pass-through so any
`-D...` build option can be forwarded, not just the prebuilt V8 path.
2026-05-16 02:27:52 +02:00
Karl Seguin
5550b61d2d Merge pull request #2480 from navidemad/make-clean
make: add clean target
2026-05-16 07:35:09 +08:00
Karl Seguin
732e19c7b6 add cargo clean to html5ever 2026-05-16 07:34:35 +08:00
Karl Seguin
d3f3e7f335 Merge pull request #2475 from navidemad/fix-a41-json-undefined
js: emit `null` when JSON-stringifying unserializable values
2026-05-16 07:24:14 +08:00
Karl Seguin
2163a2fd5a Merge pull request #2463 from lightpanda-io/nikneym/nav-accept-header
Send `Accept` header when navigating
2026-05-16 06:39:40 +08:00
Navid EMAD
fd0700a572 dockerfile: fix curl|sh pipefail; trim builder stage
- Download rustup to a file then execute, so a failed curl is not
  masked by sh's exit code under /bin/sh (no pipefail).
- Add --no-install-recommends and apt-list cleanup to both apt stages
  (stage 0 drops from 156 to 116 packages, 1144 MB to 605 MB).
- Add --retry 3 --retry-delay 2 to all 4 external downloads.
- Use git clone --depth 1 (28 MB to 9.6 MB working tree).
- Drop -v from tar for minisign and zig extractions (log noise only).

Final shipped image is unchanged; the wins live in the builder stage
and build-cache footprint.
2026-05-15 23:45:06 +02:00
Navid EMAD
f08a1fef12 ci: smoke test the MCP stdio server
Sends initialize + notifications/initialized + tools/list over stdin
and asserts the JSON-RPC responses with jq. Catches regressions in
the agentic surface (./lightpanda mcp) without needing a node client.

Reuses the existing lightpanda-build-release artifact, so the new
job costs about a minute on top of zig-build-release.
2026-05-15 22:53:38 +02:00
Navid EMAD
d1a0203d88 make: forward optional V8_PATH to zig build
Without -Dprebuilt_v8_path, the build/test targets rebuild V8 from
source (10+ minutes per invocation). Contributors who already have a
cached archive can now short-circuit by exporting V8_PATH:

    V8_PATH=v8/libc_v8.a mise exec -- make test

When V8_PATH is empty (default), behavior is unchanged.
2026-05-15 22:53:00 +02:00
Navid EMAD
ee1cbf1bb3 make: add clean target 2026-05-15 22:51:54 +02:00
Navid EMAD
dd5e335262 css: harden media-query evaluator and @media boundary
Address review feedback on PR #2478:

- MediaQuery.zig: strip CSS `/* ... */` comments before tokenization so
  `screen and /*x*/ (min-width: 1px)` evaluates the same as without the
  comment.
- MediaQuery.zig: bound-check `em` / `rem` multiplication via
  `std.math.mul` so a u32-overflowing length (e.g. `268435456em`) fails
  closed instead of panicking in debug or wrapping in release.
- StyleManager.zig: prelude brace search skips `/* ... */` comments, so
  `@media /* { fake */ screen { ... }` splits at the real opening brace
  rather than the one inside the comment.
- Tests: unit tests for stripped comments, em/rem overflow, and
  unimplemented units (cm/mm/pt/in/vw). HTML fixtures cover commented
  preludes/queries and the `replaceSync` cascade path.
2026-05-15 20:49:45 +02:00
Navid EMAD
68bd1441af css: evaluate @media and matchMedia against viewport
Inline `@media` rules were parsed but never applied to the cascade, and
`window.matchMedia(q).matches` always returned false. Add a Media Queries
Level 4 subset evaluator (`width`/`height`/`orientation`, lengths in
`px`/`em`/`rem`, comma OR, `and`, `not`, `only`) wired into both surfaces.
External `<link rel="stylesheet">` fetch remains out of scope; the
evaluator reads the same 1920x1080 viewport already exposed by
`Window.innerWidth` / `innerHeight`.

Closes #2477
2026-05-15 20:30:01 +02:00
Navid EMAD
353be6382d js: emit null when JSON-stringifying unserializable values
V8's `JSON::Stringify` finishes by calling `Object::ToString` on whatever
`i::JsonStringify` returns. For values that `JSON.stringify` treats as
non-serializable at the top level (`undefined`, functions, symbols),
`i::JsonStringify` yields the undefined sentinel and `ToString` coerces
it to the JS string `"undefined"`. `Value.jsonStringify` then wrote those
9 bytes raw via `writer.writeAll`, embedding a bare `undefined` token in
the JSON stream — invalid per RFC 8259 and rejected by any strict-JSON
CDP client. Detect the sentinel and emit JSON `null` instead, matching
what `JSON.stringify` produces when the same value sits in an array slot
(`JSON.stringify([undefined])` → `"[null]"`).

Closes #2473
2026-05-15 19:21:57 +02:00
Scott Taylor
4bd2edb596 Fix #2472: scope frame ID generator to Browser, not Session
CDP target IDs (`FID-{d:0>10}`) must stay unique for the lifetime of
the CDP connection -- Playwright's `CRBrowser._onAttachedToTarget`
asserts on duplicates and the assertion is fatal (the connection is
unusable afterwards).

Before this fix, `Session.frame_id_gen` reset to 0 in two places:

  1. `tearDownActivePage` explicitly reset to 0 after every page
     teardown (likely intended to mimic pre-pending-page numbering
     within a single Session, but invisible there because the
     immediately-following `installNewActivePage` typically reuses
     the old frame's explicit `frame_id`, see `replaceRootImmediate`).

  2. Fresh Sessions started from the field default of 0. Each
     `Target.createBrowserContext` calls `Browser.newSession`, which
     deinits the old Session and constructs a new one -- so even
     without (1), the next BrowserContext's first page would still
     get `FID-0000000001`.

(2) is what trips Playwright on the second `browser.newContext()`
on a connection: the second context's first frame re-issues
`FID-0000000001`, identical to the first context's frame, and
Playwright's `CRBrowser._onAttachedToTarget` raises
`Duplicate target FID-0000000001`.

Move `frame_id_gen` (and `nextFrameId`) from `Session` to `Browser`,
which is per-CDP-connection. Existing callers (`Session.createPage`,
`Frame.zig:1327`, `Frame.zig:1437`, `Worker.zig:74`) still go through
`Session.nextFrameId` -- it's now a thin pass-through to
`browser.nextFrameId()` -- so no call sites change. Removed the
explicit reset in `tearDownActivePage`; it was redundant within a
Session (root navigation reuses the old frame_id) and harmful across
Sessions.

`loader_id_gen` stays on Session: Loader IDs (`LID-...`) are scoped
per-frame in CDP and Playwright doesn't track them in the target
registry, so the per-Session reset is correct there.

Repro (`playwright-core@1.58.2`):

  for (let i = 1; i <= 3; i++) {
    const ctx = await browser.newContext();
    await ctx.newPage();
    await ctx.close();
  }

Before: cycle 2 throws `Duplicate target FID-0000000001`.
After: 5/5 cycles complete cleanly.

Tests: 653/653 pass. Added regression coverage in
`cdp.target: createTarget assigns unique IDs across BrowserContexts
(issue #2472)` -- verified to fail against the original source
(reverted Browser.zig and Session.zig, kept the test, ran zig build
test: only the new test fails).
2026-05-15 13:17:49 -04:00
Halil Durak
34557e3993 cli.zig: update doc comment 2026-05-15 18:31:10 +03:00
Halil Durak
658df6e500 cli.zig: support lightpanda help <command>
Another variation to receive `help` text for a specific command.
2026-05-15 18:31:10 +03:00
Halil Durak
3489129f68 main.zig: changes for new help 2026-05-15 18:31:10 +03:00
Halil Durak
b2d8c2b834 help.zon: introduce help.zon
Separates `help` explanation from configuration.
2026-05-15 18:31:09 +03:00
Halil Durak
f361f12316 cli.zig: change the way help command and sub-command detected
`cli.zig` is now aware of `help` command at all situations and creates it by itself. Instead of using errors, it initializes `Command` union where `help` branch is active.
2026-05-15 18:31:09 +03:00
Halil Durak
c993ba48a9 cli.zig: rewrite doc comment 2026-05-15 18:31:09 +03:00
Halil Durak
9c40cd9fb2 send Accept header when navigating 2026-05-15 18:30:18 +03:00
Pierre Tachoire
4f33d64c5c Merge pull request #2433 from lightpanda-io/webmcp
Implement webMCP API and webMCP cdp domain
2026-05-15 16:13:12 +02:00
Pierre Tachoire
60e3d48dbd webmcp: update comments 2026-05-15 15:12:27 +02:00
Pierre Tachoire
f00c0ab276 webmcp: implement abortSignal with _dependent 2026-05-15 13:11:59 +02:00
Pierre Tachoire
3803a1f8c6 webmcp: use value.jsonStringify for JSON write 2026-05-15 11:17:53 +02:00
Pierre Tachoire
dbb9b31061 webmcp: fix invoke callback with correct ModelContextClient param 2026-05-15 11:00:56 +02:00
Karl Seguin
64c5843e9e Merge pull request #2466 from lightpanda-io/pending_queue_pump
Clear pending destroy on createPage (a known safepoint).
2026-05-15 16:42:15 +08:00
Pierre Tachoire
7c5a3b211f cdp: cancel inflight webmcp invocation on bc deinit 2026-05-15 08:50:48 +02:00