browser

mirror of https://github.com/lightpanda-io/browser.git synced 2026-06-11 17:46:32 -04:00

Author	SHA1	Message	Date
Karl Seguin	dd99102f4b	Defer HTTP completion callbacks to next tick Client.makeRequest used to call self.perform(0) after handing the transfer to libcurl. That perform() does two things: drives curl_multi_perform (so bytes hit the wire) AND drains curl_multi_info_read messages, which is what fires the user-facing header/data/done callbacks. The issue is that, even in non-cache cases, a request could be immediately resolved in libcurl, and thus callbacks executed synchronously. By only calling `curl_multi_perform` on a new request, we prevent this from happening.	2026-05-13 15:59:29 +08:00
Karl Seguin	2fcad23834	Buffer worker postMessages received before script load completes	2026-05-13 15:59:29 +08:00
Pierre Tachoire	854eb6a62d	Merge pull request #2339 from lightpanda-io/cdp-console cdp: implement Console	2026-05-13 08:28:01 +02:00
Karl Seguin	bd4f4c89e1	Merge pull request #2440 from staylor/scott/fix-worker-context-exit-with-proxy Add LP.configureLoading worker + --disable-workers opt-out for Web Worker loading	2026-05-13 12:29:43 +08:00
Karl Seguin	10a5597aba	Merge pull request #2435 from navidemad/fix-b12-htmldialogelement-methods dom: implement HTMLDialogElement.{show, showModal, close}	2026-05-13 12:17:20 +08:00
muki	cc927c98ec	Merge pull request #2424 from lightpanda-io/nix-wpt-run Ability to run wpt with Nix/NixOS	2026-05-12 20:58:39 -07:00
Muki Kiboigo	06c2474376	use commit sha instead of branch name	2026-05-12 20:58:13 -07:00
Karl Seguin	393141e472	pass arena into handlers (consistent with other handlers)	2026-05-13 11:51:59 +08:00
Scott Taylor	b2998470c2	Add --disable-workers + LP.configureLoading worker opt-out Adds two ways to opt out of dedicated Web Worker loading entirely. The Worker constructor still returns a Worker object so calling pages don't throw, but no script fetch is initiated and the worker scope's eval never runs (postMessage from the page is queued indefinitely with no handler to drain it). * CDP method LP.configureLoading { worker: bool } -- per-session toggleable at runtime, alongside the existing { subFrame: bool }. Both fields are now optional so callers can flip one without resetting the other to its default. Backwards-compatible. * CLI flag --disable-workers -- process-wide default applying to every session and to the fetch subcommand. Operators can flip it on without any driver changes. Mirrors --disable-subframes (#2401) exactly. ## Motivation Reliably-reproducible SIGABRT in Worker.loadInitialScript whenever a page constructs a Web Worker AND lightpanda is launched with --http_proxy. Crash signature: $msg="V8 fatal callback" location=v8::Context::Exit() message="Cannot exit non-entered context" Stack: _browser.webapi.Worker.loadInitialScript _browser.webapi.Worker.httpDoneCallback _network.layer.InterceptionLayer.InterceptContext.doneCallback _browser.HttpClient.processMessages _browser.HttpClient.perform _browser.HttpClient.tick The Zig-side Enter/Exit pair around the worker's eval doesn't match v8's entered_contexts stack invariant under that timing -- something upstream of the loadInitialScript Exit leaves an extra Enter on the stack, so v8's Utils::ApiCheck (`isolate->context() == env`) fires and the process aborts. Reproducible against any Shopify storefront PDP (e.g. https://weareallbirds.myshopify.com/products/mens-wool-runners) when served through any HTTP proxy -- the proxy just adds enough latency to surface the race; the same code path runs without --http_proxy but the timing window is too tight to reliably hit. The Allbirds trigger script is the Shopify web-pixel-extension worker, but ANY Worker the page constructs hits the same code path. The proper fix needs the v8 entered-contexts invariant to be restored end-to-end through the worker eval. That's a deeper dig into how Worker.loadInitialScript / WorkerGlobalScope.importScript / ls.local.runMacrotasks compose with v8's microtask queues across multiple contexts; I tried three intermediate fixes (deferring loadInitialScript via the frame scheduler when other scripts are mid-eval, replacing the post-eval cross-context runMacrotasks with worker-only PerformCheckpoint, and removing runMacrotasks entirely) and none stopped the crash. The bug is fired from inside the synchronous tick path before the post-eval microtask handling runs, which means the leak happens during Script::Run itself and needs more targeted investigation. This PR is the workaround so users hitting the SIGABRT on storefront / analytics-heavy pages have a clean opt-in escape today. For our use case (product catalog extraction) Workers carry no extraction signal -- web-pixel sandboxes, analytics SDKs, marketing tag pixels, etc. -- so disabling them removes a fragile code path without any downside. ## Implementation `Session.worker_loading_enabled: bool = true` -- default matches existing behavior. `Worker.init` short-circuits AFTER constructing the Worker / WorkerGlobalScope / arena bookkeeping (so the JS `new Worker(url)` expression doesn't throw): if (!session.worker_loading_enabled) { log.debug(.browser, "worker disabled", .{ .url = resolved_url }); return self; } Two ways to flip the flag, mirroring the --disable-subframes pattern: 1. LP.configureLoading { worker: bool } -- both subFrame and worker are now optional fields in the params struct, so existing callers passing only { subFrame } continue to work unchanged. 2. --disable-workers CLI flag -- added to CommonOptions (so it applies to serve, fetch, mcp). New Config.disableWorkers() getter; Session.init reads it as the initial value. Total diff: +88 / -3 across 4 files (src/Config.zig, src/browser/Session.zig, src/browser/webapi/Worker.zig, src/cdp/domains/lp.zig). ## Verification Reproducer pattern (puppeteer-core 24.42.0 + tiny CONNECT-tunnel proxy on 127.0.0.1:9999, scripts in cdp-repros/): serve --host 127.0.0.1 --port 9222 --http_proxy http://127.0.0.1:9999 serve --host 127.0.0.1 --port 9222 --http_proxy http://127.0.0.1:9999 --disable-workers Driving https://weareallbirds.myshopify.com/products/mens-wool-runners: baseline (no --disable-workers): 5/5 SIGABRT in Worker.loadInitialScript with the v8 fatal callback above. with --disable-workers: 10/10 successful, returns full HTML (~1MB), no crash. Test suite: make test -> 637 of 637 tests passed (was 636/636 + new cdp.lp: configureLoading toggles subFrame and worker independently regression test). zig fmt --check ./.zig ./*/.zig -> clean. ## Notes * The CDP method is the same domain (LP.configureLoading) and same shape as --disable-subframes' driver-side opt-in, so existing Playwright / puppeteer integrations that already toggle subframes don't need a separate code path -- one CDP call can flip both. * worker_loading_enabled = false does NOT remove Worker from the global namespace (so feature-detection like `if (typeof Worker !== 'undefined')` still reports true). It just makes constructed workers no-op. Pages that postMessage to a worker and wait for a response will hang on that promise forever (or until the page is torn down). For our extraction use case that's fine -- we control the worklist timeout anyway -- but it's worth noting if upstream wants to surface the disabled state more strongly (e.g. throw from postMessage, or remove the global entirely behind an even-stricter flag). * Once the underlying v8 entered-contexts invariant is restored in Worker.loadInitialScript, this flag becomes a perf / sandboxing tool rather than a correctness workaround. Worth keeping anyway: blocking analytics / pixel workers is a reasonable thing to want. ## Related * #2400 -- the iframe analog to this issue (subframe nav invalidates executionContextId); same workaround pattern. * #2401 -- introduced --disable-subframes / LP.configureLoading { subFrame } that this PR mirrors exactly for workers.	2026-05-12 23:46:45 -04:00
Karl Seguin	2e159aaf12	Merge pull request #2436 from lightpanda-io/reset_frees_tasks on scheduler.reset, finalizer any remaining tasks	2026-05-13 09:06:02 +08:00
Karl Seguin	656e29476e	on scheduler.reset, finalizer any remaining tasks	2026-05-13 08:46:13 +08:00
Karl Seguin	d833eaa2e3	Merge pull request #2420 from lightpanda-io/http_client HttpClient Improvements	2026-05-13 08:15:16 +08:00
Karl Seguin	50b126b402	fix cachelayer hit path	2026-05-13 07:14:44 +08:00
Navid EMAD	989e2d03a2	dom: implement HTMLDialogElement.{show, showModal, close} The HTMLDialogElement constructor was exposed with `open` / `returnValue` IDL accessors, but the three instance methods that drive the open/close state were missing. Per HTML §4.11.4 (The dialog element), `show()` sets the `open` attribute if absent; `showModal()` throws `InvalidStateError` when the dialog is already open and otherwise sets `open`; `close()` removes `open`, optionally updates `returnValue`, and fires a non- bubbling `close` event. The non-rendering steps (focus trap, backdrop, top-layer placement) are intentional no-ops here — `[open]` reflecting through to selectors and the `close` event firing are what downstream CDP clients rely on. Closes #2434	2026-05-12 19:02:10 +02:00
Pierre Tachoire	4c58f8a6d0	Merge pull request #2432 from lightpanda-io/fix-build-banner-stderr build: print version banner to stderr, not stdout	2026-05-12 16:25:40 +02:00
Adrià Arrufat	842fbb78ef	build: print version banner to stderr, not stdout The build script wrote the version line to stdout, polluting any pipeline that captures program output via 'zig build run' or similar. Banners belong on stderr.	2026-05-12 15:12:50 +02:00
Karl Seguin	edc3d836d1	explicit track if transfer is queued for correct cleanup	2026-05-12 20:32:40 +08:00
Karl Seguin	caa47d50b2	fix build	2026-05-12 19:30:46 +08:00
Karl Seguin	73241dd1f7	don't mutate hashmap while iterating (in teardown)	2026-05-12 19:26:25 +08:00
Karl Seguin	5e0976bbd6	fix use-after-free on robotslayer shutdown	2026-05-12 19:26:24 +08:00
Karl Seguin	7869cbb68c	Delay Page destruction to avoid UAF Encapsulate resource ownership in a HttpClient.Owner	2026-05-12 19:26:24 +08:00
Karl Seguin	2dc3b4682b	abort frame-specific transfers	2026-05-12 19:26:24 +08:00
Karl Seguin	82a4fc752b	HttpClient Improvements 1 - Track owner of a request (for simpler / more accurate abort (TBD)) 2 - Create Transfer upfront, make everything work on Transfer (not Request) This helps remove ambiguity about cleanup and simplifies layers. For example Robots request is just another normal request, not a special case. This gives everything a stable address (the *Transfer which can be looked up by id)	2026-05-12 19:26:24 +08:00
Karl Seguin	49297d76d3	Merge pull request #2430 from lightpanda-io/worker_offscreen_canvas Enable ImageData and OffscreenCanvasRenderingContext2D on Worker	2026-05-12 18:04:13 +08:00
Karl Seguin	09747cf951	Enable ImageData and OffscreenCanvasRenderingContext2D on Worker These are currently reachable in Worker via the OffscreenCanvas, so when used they crash. WPT /html/canvas/offscreen/2d.conformance.requirements.basics.worker.html	2026-05-12 17:42:37 +08:00
Karl Seguin	8cad175c1d	Merge pull request #2427 from lightpanda-io/v8_inspector_string Update v8 dep	2026-05-12 15:51:48 +08:00
Karl Seguin	c8dc9d8938	Merge pull request #2428 from lightpanda-io/revert-pr-2394 Revert "Merge pull request #2394	2026-05-12 15:33:54 +08:00
Pierre Tachoire	5e7769552b	Merge pull request #2396 from navidemad/add-agents-md Add AGENTS.md and CLAUDE.md	2026-05-12 09:32:09 +02:00
Pierre Tachoire	538f194090	report test options examples from AGENTS to README	2026-05-12 09:30:22 +02:00
Karl Seguin	8bede8684e	Revert "Merge pull request #2394 from lightpanda-io/custom_element_reactions" This reverts commit `5c7097b6a0`, reversing changes made to `235f1cd9bc`.	2026-05-12 15:16:40 +08:00
Karl Seguin	bcd94c4f9d	Update v8 dep https://github.com/lightpanda-io/browser/issues/2407	2026-05-12 14:06:17 +08:00
Karl Seguin	e9b2aa4946	Merge pull request #2426 from lightpanda-io/feat/cdp-disable-iframes Feat/cdp disable iframes	2026-05-12 13:16:16 +08:00
Karl Seguin	cdfabf7953	Minor tweaks Remove repeating description/comment of flag. Change CDP lp command to be a general configuration endpoint.	2026-05-12 12:56:33 +08:00
Karl Seguin	353cf45b38	Merge pull request #2425 from lightpanda-io/subcommand_help Add per-subcommand help via `help` or `--help` argument	2026-05-12 10:52:44 +08:00
Karl Seguin	5c7097b6a0	Merge pull request #2394 from lightpanda-io/custom_element_reactions CustomElement Reactions	2026-05-12 10:49:46 +08:00
Karl Seguin	235f1cd9bc	Merge pull request #2386 from lightpanda-io/improve_elementFromPoint optimize elementFromPoint	2026-05-12 10:49:30 +08:00
Karl Seguin	58e7e72459	Merge pull request #2409 from lightpanda-io/abortsignal_any add AbortSignal.any static + AbortSignal reason can be a DOMException	2026-05-12 10:49:09 +08:00
Karl Seguin	f705d8b8e9	Merge pull request #2412 from lightpanda-io/events Improve events	2026-05-12 10:48:56 +08:00
Karl Seguin	45b1cc4762	Merge pull request #2411 from lightpanda-io/htmlcollection_string_indexer Fix HTMLCollection	2026-05-12 10:48:44 +08:00
Francis Bouvier	31ef5246bc	Add per-subcommand help via `help` or `--help` argument `lightpanda <subcommand> help`, `lightpanda <subcommand> --help` now print only the relevant subcommand options plus common options, instead of the full text. `lightpanda help <subcommand>` is also supported (and that's what use internally).	2026-05-11 22:16:52 +02:00
Muki Kiboigo	1822c33077	ability to run wpt with nix	2026-05-11 11:52:45 -07:00
Navid EMAD	827e696a4e	link AGENTS.md and CONTRIBUTING.md, add dev section Reciprocal pointers so humans landing in CONTRIBUTING.md discover the agent/operational conventions in AGENTS.md, and agents landing in AGENTS.md discover the CLA gate and pre-PR checks. Adds a short Development section (test + fmt commands) and a Before-opening-a-PR checklist to CONTRIBUTING.md; CLA paragraph preserved verbatim and moved to its own section.	2026-05-11 20:41:14 +02:00
Pierre Tachoire	0171a111a7	Merge pull request #2421 from lightpanda-io/agent-nightly ci: add agent pre-release nightly build	2026-05-11 18:34:16 +02:00
Pierre Tachoire	6a12ce7d0f	ci: invalidate cloudfront cache after build	2026-05-11 18:18:39 +02:00
Pierre Tachoire	f33f35d158	ci: add agent pre-release nightly build This ci would be deleted once agent branch is merged.	2026-05-11 18:08:54 +02:00
Karl Seguin	e56bd0862a	Merge pull request #2419 from lightpanda-io/nix-flake-update Update Nix Flake	2026-05-11 22:59:00 +08:00
Karl Seguin	3733abbf8a	Merge pull request #2413 from lightpanda-io/small_dom_fixes Various small DOM fixes, WPT driven	2026-05-11 21:58:01 +08:00
Karl Seguin	fe729f8b1f	CustomElement Reactions While this PR touches a lot of files, and isn't trivial, many of the changes are either: 1 - removing guards added in previous PRs, e.g. https://github.com/lightpanda-io/browser/pull/1969 https://github.com/lightpanda-io/browser/pull/2172 https://github.com/lightpanda-io/browser/pull/2313 https://github.com/lightpanda-io/browser/pull/2366 2 - Adding the `.ce_reactions = true` flag to various WebAPIs CustomElements have callbacks, e.g. connectedCallback. Also, many WebAPI calls are implemented as a series of mutations, e.g. appendChild = remove from current + append to new. These two things interact in an important way: when should callbacks execute? Before this PR, we were invoking callbacks at each individual step. This is (a) technically wrong and (b) breaks a lot of assumptions (the reason the above 4 PRs were needed to fix bugs). This PR adds a `_ce_reactions` queue to the frame. And, instead of invoking callbacks, we "enqueue" the reaction. At various boundaries, a scope is created the DOM manipulation is done, and then we pop the scope, invoking all queued reactions.	2026-05-11 21:50:24 +08:00
Muki Kiboigo	105f5028c8	update nix flake	2026-05-11 06:50:03 -07:00
Karl Seguin	b79870e07a	Merge pull request #2414 from lightpanda-io/test_timeout_config Allow HTML Tests to set a timeout	2026-05-11 21:49:57 +08:00

1 2 3 4 5 ...

6214 Commits