browser

mirror of https://github.com/lightpanda-io/browser.git synced 2026-06-11 09:35:59 -04:00

Author	SHA1	Message	Date
Halil Durak	3c7c08f822	`--inject-script`: don't error out if script execution fails	2026-05-11 15:24:08 +03:00
Halil Durak	19401dc950	`Config`: update `--inject-script` documentation	2026-05-11 15:15:36 +03:00
Halil Durak	d97081d71f	`--inject-script`: execute injected scripts if encountered <head> tag This now backs itself against html5ever to find <head>; the spec explicitly bans double-appearance of <head> tag and html5ever is aware of it.	2026-05-11 15:15:36 +03:00
Halil Durak	60d721caa2	`Config`: increase read file size for `--inject-script-file`	2026-05-11 15:15:36 +03:00
Halil Durak	246f91d1f8	`--inject-script`: prefer splice bytes into `<head>` directly	2026-05-11 15:15:35 +03:00
Halil Durak	c566d0c41c	introduce `--inject-script` and `--inject-script-file` * Prefer `--inject-` prefix. Support injecting multiple scripts (also allows using both variants together). * Instead of executing scripts in JS context, actually insert them to `<head>` for correct dump output.	2026-05-11 15:15:35 +03:00
Halil Durak	39f12a5669	`fetch`: add support for `--script` option Allows passing a JS file as an arg to be executed alongside other scripts.	2026-05-11 15:15:35 +03:00
Karl Seguin	08bc513fd9	Merge pull request #2416 from lightpanda-io/nikneym/link-crossorigin-getter-setter `HTMLLinkElement`: `crossOrigin` -> `crossorigin` for attributes	2026-05-11 20:12:37 +08:00
Karl Seguin	5ba0928635	Merge pull request #2415 from lightpanda-io/nikneym/link-media-getter-setter ` HTMLLinkElement`: add `media` getter/setter	2026-05-11 20:00:09 +08:00
Halil Durak	4e4e68e51c	`HTMLLinkElement`: update tests	2026-05-11 14:40:40 +03:00
Halil Durak	3c8c849947	`HTMLLinkElement`: `crossOrigin` -> `crossorigin` for attributes	2026-05-11 14:40:26 +03:00
Halil Durak	20c7bc14d2	`HTMLLinkElement`: update tests	2026-05-11 14:35:41 +03:00
Halil Durak	55a42fe5c6	`HTMLLinkElement`: add `media` getter/setter	2026-05-11 14:35:30 +03:00
Pierre Tachoire	d2151b6ffd	Merge pull request #2305 from navidemad/feat/xpath-1.0-evaluator xpath: implement XPath 1.0 (Document.evaluate, XPathResult, DOM.performSearch)	2026-05-11 10:01:28 +02:00
Karl Seguin	efbf1db87c	Merge pull request #2410 from lightpanda-io/fix_merge Try to fix a bad merge	2026-05-11 11:37:28 +08:00
Karl Seguin	0bbddb3179	Try to fix a bad merge https://github.com/lightpanda-io/browser/pull/2289 and https://github.com/lightpanda-io/browser/pull/2297	2026-05-11 11:25:40 +08:00
Karl Seguin	1bfefa3d58	Merge pull request #2289 from navidemad/fix-b2-page-navigation-history page: implement Page.getNavigationHistory and Page.navigateToHistoryEntry	2026-05-11 09:29:43 +08:00
Karl Seguin	92d617d649	Merge pull request #2404 from navidemad/fix-fetch-double-free-on-sync-error Fix double-free in fetch when http_client.request fails synchronously	2026-05-10 12:03:07 +08:00
Karl Seguin	520d968840	Merge pull request #2398 from staylor/fix/worker-importscripts-segfault Defer page teardown while worker scripts are evaluating	2026-05-10 11:08:49 +08:00
Karl Seguin	261059acbe	Merge pull request #2393 from lightpanda-io/scheduler_timeslice Add timeslice to scheduler	2026-05-10 10:33:21 +08:00
Scott Taylor	92607ad765	Defer page teardown while worker scripts are evaluating Worker scripts can call importScripts(), which performs a synchronous HTTP request via HttpClient.syncRequest. To stay responsive during a long fetch, syncRequest pumps the CDP socket (cdp.blocking_read) while waiting. If a CDP message such as Target.closeTarget arrives on that socket mid-fetch, the previous code path tore down the page immediately: Worker JS -> importScripts -> syncRequest -> blocking_read -> CDP dispatch -> Target.closeTarget -> Session.removePage -> Page.deinit -> Frame.deinit -> Worker.deinit (frees worker arena + identity_map) When control unwound back into the worker's eval, the next operation that hit ctx.identity.identity_map.getOrPut dereferenced the freed metadata pointer and segfaulted (sometimes immediately, sometimes a few connections later as the arena got recycled). Reproducer: any URL that loads dedicated workers calling importScripts during initial eval, driven via puppeteer-core's connectOverCDP. The allbirds.com product page (which loads ~8 web-pixel workers each calling importScripts) reliably triggered it within ~10 connections. Session.removePage already deferred when the frame's own ScriptManager.is_evaluating was set; that guard never tripped because worker scripts don't go through the frame's ScriptManager. Fix: * Worker.loadInitialScript now sets the worker's own _worker_scope._script_manager.is_evaluating around the eval, with save/restore so nested worker evals compose correctly. * WorkerGlobalScope.importScript also sets its own _script_manager.is_evaluating around the syncRequest + runMacrotasks. The typical caller (Worker.loadInitialScript) already sets this around its outer eval, so the outer guard usually covers us; the inner mark is defense-in-depth for callers that reach importScripts() from a setTimeout / microtask outside the loadInitialScript scope. * New Frame.anyScriptEvaluating method walks the frame tree (frame ScriptManager + every worker's ScriptManager + child frames) and returns true if any is mid-eval. Session.removePage and CDP.disposeBrowserContext use this in place of the frame-only check, deferring teardown until all evals unwind. Final cleanup happens at CDP.deinit on connection close, matching the existing deferred-teardown contract. Verified by running the puppeteer-core repro back-to-back against a single Lightpanda serve; all returned 200 with the right title, no UAF crashes (was previously crashing within 1-10 runs). All 521 unit tests still pass. Note: a separate, pre-existing latent V8 issue surfaces under stress on this same code path. After many iterations a Runtime.evaluate promise tracked by V8's inspector PromiseHandlerTracker is discarded during garbage collection's first-pass weak callbacks; the discard sends a failure response which triggers v8::String::NewFromOneByte, hitting the debug-only assertion AllowHeapAllocation::IsAllowed() in heap-allocator-inl.h:79 (no allocations allowed during weak callbacks). This reproduces on a baseline build of this PR commit and on a baseline build of just the original two-line is_evaluating fix \u2014 i.e. it is not introduced by the deferral logic. The deferral makes it more visible because inspector callbacks now live longer before teardown, so they are more likely to be alive during a GC. Tracking this as a follow-up; the fix here still resolves the UAF that was crashing the server immediately.	2026-05-09 17:26:41 -04:00
Navid EMAD	d7e283fed9	Don't propagate http_client.request errors from Fetch.init When http_client.request fails synchronously (e.g. RobotsLayer returning RobotsBlocked because robots.txt is already cached), Client.request invokes our error_callback before returning the error. httpErrorCallback rejects the promise and releases response._arena. Letting the error propagate from Fetch.init also fires the `errdefer response.deinit`, double-freeing the arena and corrupting the arena pool — eventually surfacing as a malloc abort during teardown. Fixes #2403.	2026-05-09 14:57:08 +02:00
Navid EMAD	d8b9391e33	xpath: drop internal references from comments Strip mentions of the private gem and its internal paths from xpath module docstrings, the conformance test header, and the dom dispatch heuristic. Comments now describe behavior directly without pointing at sources public readers can't access.	2026-05-08 08:58:07 +02:00
Navid EMAD	0b0a34c4a2	cdp: match closed set of axis names in isXPathQuery The previous `::` heuristic accepted any identifier-like character before `::`, which misrouted CSS pseudo-elements (`a::before`, `div::after`) to the XPath evaluator. Walk back the run of [a-zA-Z-] characters and look the candidate up in a StaticStringMap of the 13 XPath 1.0 named axes, so only real axis names match.	2026-05-08 08:44:32 +02:00
Karl Seguin	9830da04d8	Naming convention fixes Disable xpath_perf benchmark from test run as its quite verbose.	2026-05-08 08:44:31 +02:00
Navid EMAD	ce722c1f6e	xpath: extend fast path to non-positional descendant queries Generalizes 8733e33b's //tag[@id='x'] shape: tryFusedDescendantFastPath handles any //tag[safe] or .//tag[safe] where the predicates are non-positional boolean/node-set checks. Walks the search root's descendants once in document order, applies node test + predicates inline, no per-step materialization, no dedup. 5-9x on //div, //, //[@class='x'], //div[contains(...)]; ~25x on (//div)[1] and count(//div) where the inner path is the shape. Safety gate rejects predicates that could produce a number at the top level (number, neg, arithmetic binop, numeric-returning fn-call) and any predicate containing position()/last() anywhere. Conservative: a nested sub-path's local positional predicate is rejected even though it's scoped to its own axis.	2026-05-08 08:44:31 +02:00
Navid EMAD	c4c700f7ab	xpath: id-lookup fast path + perf benchmark evalPath recognizes //tag[@id='x'] and .//tag[@id='x'] (plus the //[@id='x'] wildcard) and serves them via frame.getElementByIdFromNode. ~100-150x speedup on ID lookups (3231us -> 22.6us for //[@id='target'] in the new benchmark). Falls through to general path on any deviation (extra step, extra predicate, non-eq, non-literal RHS). Inherits the same duplicate-ID compromise selector/List.zig ships for querySelector(All): the id-map stores only the first element per ID in document order. Capybara/Selenium hot paths assume unique IDs. tests/xpath/xpath_perf.html is the 13-query micro-benchmark used to collect the numbers; batched console.warn output survives test runner interleaving.	2026-05-08 08:44:31 +02:00
Navid EMAD	379664044e	xpath: apply review correctness feedback - Document.evaluate / XPathEvaluator.evaluate / XPathExpression.evaluate: result_type / requested_type now optional u16 defaulting to ANY_TYPE (matches WHATWG: `optional unsigned short type = 0`). context_node stays nullable with a fallback to the document — preserves the polyfill's behavior asserted by the `default_context` fixture - ast.zig NodeTest: clarify that namespaced names (`prefix:*`, `prefix:local`) are stored verbatim and fall through to a literal match against the node name — consistent with the `namespace::` axis stub (decision #3). Adds a TODO for if the polyfill ever drops the stub - Parser: cap recursive descent at depth 64 with new error.MaxDepthExceeded; depth tracked across parseExpr (parens, predicates, function args) and parseUnaryExpr (chained `-`). Two regression tests cover deep parenthesization and deep unary minus	2026-05-08 08:44:31 +02:00
Navid EMAD	94bcee6322	xpath: apply review style/convention feedback - Rename Result.zig / Ast.zig / Functions.zig to snake_case (no top-level fields per Zig style guide) - Restructure imports across xpath module: lib (std/lp) → relative (further → nearer) → aliases - Move `frame` to last parameter on Evaluator.evaluate, searchAll, Functions.call, idFn (matches js bridge convention); call sites updated in webapi/XPath{Result,Expression}.zig and cdp/domains/dom.zig - Local-pos style in XPathResult.iterateNext	2026-05-08 08:44:31 +02:00
Navid EMAD	e7c3e77c41	xpath: match CDATASection in text() node test Per XPath 1.0 §5.7, the data model has no CDATASection node — CDATA content is part of the text node value. The text() node test was only matching DOM nodeType 3 (Text), silently excluding CDATA sections (nodeType 4) parsed via DOMParser/XMLDocument and inline foreign content like SVG with embedded scripts.	2026-05-08 08:44:31 +02:00
Navid EMAD	a4abbb6d13	xpath: cache attribute axis nodes via frame lookup The attribute axis was calling Entry.toAttribute on every visit, materializing fresh Attribute structs (plus duped name/value strings) into page-lifetime storage. Repeated XPath queries — the Capybara/ Selenium polling pattern this PR targets — accumulated unbounded copies for the same DOM entries. Route through frame._attribute_lookup so each Entry resolves to a single cached Attribute, matching List.getAttribute and NamedNodeMap.getAtIndex.	2026-05-08 08:44:30 +02:00
Navid EMAD	33714a4dfd	cdp: tighten isXPathQuery '::' heuristic A bare indexOf("::") matched CSS pseudo-elements (a::before) and attribute values containing '::' ([data-x="x::y"]), misrouting them to the XPath evaluator. Require an axis-name shape ([a-zA-Z-]) immediately before '::' so only real axis specifiers like descendant::p are dispatched to XPath.	2026-05-08 08:44:30 +02:00
Navid EMAD	0fcd47e1e1	xpath: dupe expression into arena before parsing The Parser borrows string slices from its input for AST literals, names, and var refs. Without duping, the AST holds slices into the JS call_arena, which is reset when the top-level call returns — every subsequent evaluate() of a cached XPathExpression would dereference freed memory.	2026-05-08 08:44:30 +02:00
Navid EMAD	290fc7a9df	xpath: implement XPath 1.0 evaluator Ports the capybara-lightpanda XPath 1.0 polyfill into Lightpanda. Exposes the WHATWG Document.evaluate / XPathResult / XPathEvaluator / XPathExpression surface and routes CDP DOM.performSearch XPath queries through the new evaluator. The capybara-lightpanda gem can drop its ~700-line JS polyfill in the next release. New module src/browser/xpath/ (Tokenizer, Parser, Ast, Evaluator, Functions, Result). New webapi types XPathResult, XPathExpression, XPathEvaluator. Coverage and stubs match the polyfill 1:1 — see capybara-lightpanda/XPATH_COMPLIANCE.md for the full spec. Tests: 91-case conformance + result-API + evaluator-API + CDP fixtures, plus the engine's Zig unit suite (601/601 pass).	2026-05-08 08:44:30 +02:00
Karl Seguin	c633617544	Add timeslice to scheduler Give scheduler a 500ms timeslice to run per queue (high/low priority). A site can load hundreds of timeouts to all execute at the same time. These can be relatively expensive (e.g. lots of calls directly or indirectly to getBoundingClientRect). As-is, the scheduler drains its queue to completion and other timeouts, like --wait-ms can't do what they're meant to do. By adding timeslice, we prevent many tasks all scheduled for the same time to go unchecked. I was initially planning on putting this higher in runMacrotasks, but that could lead to starvation, i.e. if the first context used up all the time. Having it per context is more fair, at the cost of running 500ms * context. But, (a) the number of context we allow is fixed and (b) the reality is that most sites have few contexts and normally only the first one is doing anything interesting.	2026-05-08 10:34:47 +08:00
Karl Seguin	6e9156a86f	Merge pull request #2389 from lightpanda-io/interception-layer-on-serve InterceptionLayer only on `.serve` mode	2026-05-08 06:51:03 +08:00
Karl Seguin	97f95a992e	Merge pull request #2388 from lightpanda-io/cache-clear Add Clear to Cache and FsCache	2026-05-08 06:50:35 +08:00
Karl Seguin	ffee9e67ce	Merge pull request #2377 from lightpanda-io/page_dom_version Track DOM version on the page	2026-05-08 06:45:16 +08:00
Karl Seguin	b8674cd252	Merge pull request #2379 from lightpanda-io/setter_and_static_arity Give setters an arity of 1	2026-05-08 06:45:00 +08:00
Muki Kiboigo	66293ebc99	only enable InterceptionLayer on .serve mode	2026-05-07 09:03:40 -07:00
Muki Kiboigo	14e1f1bcf6	add clear fn to Cache and FsCache	2026-05-07 09:00:33 -07:00
Pierre Tachoire	1131cb09ff	Merge pull request #2387 from lightpanda-io/zig-fmt-ci Fix zig fmt step in CI	2026-05-07 17:49:15 +02:00
Muki Kiboigo	e5e5f78928	fix formatting on EventTarget.zig	2026-05-07 08:10:43 -07:00
Muki Kiboigo	f54ee13b32	fix zig fmt step in CI	2026-05-07 08:09:33 -07:00
Karl Seguin	fbb8126cc4	Merge pull request #2378 from lightpanda-io/blank_navigation Protect assertion when reload from about:blank	2026-05-07 23:00:33 +08:00
Karl Seguin	823a7c480d	Merge pull request #2380 from lightpanda-io/illegal_instructor_capture_name On Illegal Constructor, try to capture name (for logs)	2026-05-07 23:00:13 +08:00
Karl Seguin	d4a210c5f1	Merge pull request #2385 from lightpanda-io/avoid_script_error_double_free On error, don't free headers	2026-05-07 22:11:58 +08:00
Karl Seguin	61497ffe3a	Merge pull request #2383 from lightpanda-io/css_static_binding Fix binding for static (in general) and specifically for CSS.escape	2026-05-07 20:30:40 +08:00
Karl Seguin	a4cf214040	Merge pull request #2382 from lightpanda-io/xhr_fix_teardown_order Fix the XHR teardown order	2026-05-07 20:05:55 +08:00
Karl Seguin	87b0c33344	Merge pull request #2384 from lightpanda-io/silence_test_warns Silence test warnings	2026-05-07 19:50:57 +08:00

1 2 3 4 5 ...

6139 Commits