Input events dispatch to top-level frame #1847

sadym-chromium · 2024-10-22T15:58:38Z

As discussed in w3c/webdriver-bidi#795, the actions should be dispatched from the top-level browsing context.

Preview | Diff

index.html

sadym-chromium · 2024-10-23T09:29:51Z

@whimboo WDYT?

OrKoN · 2024-10-23T11:18:12Z

The following two things seem to be still missing:

steps to convert the coordinates to the top-level browsing context coordinate space if they are frame scoped. Edit: unless this is covered by the implementation-defined bits?
apply the same fix to other actions (I assume we do actually want to dispatch keyboard events through the top-level frame as well)

whimboo · 2024-10-23T12:15:10Z

The following two things seem to be still missing:
1. steps to convert the coordinates to the top-level browsing context coordinate space if they are frame scoped. Edit: unless this is covered by the implementation-defined bits?

Yes, absolutely. By changing the way where we dispatch the events the coordinates should be exactly the same. Not taking care of offsets would cause quite a lot of regressions for those users who make use of actions a lot.

Therefore see:

web-platform-tests/wpt#48147
web-platform-tests/wpt#48123

Given that Chrome seems to already dispatch actions in the top-level browsing context the referenced tests are failing.

2. apply the same fix to other actions (I assume we do actually want to dispatch keyboard events through the top-level frame as well)

Yes, we would have to do it for all the pointer and wheel input sources.

Keyboard actions are more challenging because they could allow users to trigger shortcuts that access restricted browser features. This would require additional checks to determine what actions are permitted and which ones should be blocked. Maybe we should have a follow-up issue for it?

sadym-chromium · 2024-10-23T13:33:44Z

apply the same fix to other actions (I assume we do actually want to dispatch keyboard events through the top-level frame as well)

Aren't all the actions end up in perform implementation-specific action dispatch steps?

sadym-chromium · 2024-10-23T13:35:17Z

@OrKoN @whimboo I added calculation of the offset relative to the parent browsing context, but IDK how well it will work

sadym-chromium · 2024-10-23T13:41:12Z

index.html

+ such that trusted events corresponding to the entries in
+ <var>list of events</var> are dispatched.


Alternatively we can "hand-wave" hear like "These steps must be equivalent to user trying to perform the given input device manipulations on context through the top-level browsing context."

@OrKoN @whimboo we can keep specification implementation-specific, just clarify the events are dispatched via top-level browsing context. I'm not sure if that approach would work with the events intercepted by the top-level frame.

whimboo · 2024-10-23T13:44:16Z

@OrKoN @whimboo I added calculation of the offset relative to the parent browsing context, but IDK how well it will work

I think that we should have more wdspec tests (similar to mine for mouse) so we cover at least each input source (including sub sources) and verify implementations against. Did you get Chrome working for my example?

OrKoN · 2024-10-23T13:46:46Z

@sadym-chromium it looks like each action type dispatches on the context parameter https://w3c.github.io/webdriver/#dfn-dispatch-a-keydown-action

apply the same fix to other actions (I assume we do actually want to dispatch keyboard events through the top-level frame as well)

Aren't all the actions end up in perform implementation-specific action dispatch steps?

that's right, overlooked that.

OrKoN · 2024-10-23T13:52:16Z

Keyboard actions are more challenging because they could allow users to trigger shortcuts that access restricted browser features. This would require additional checks to determine what actions are permitted and which ones should be blocked. Maybe we should have a follow-up issue for it?

so actually I am not sure, even with this change it is not quite defined what it means that the actions are dispatched to the top-level browsing context. For example, in Chrome that is the case but still would not give you access to browser shortcuts.
Perhaps we need a better definition of what the dispatch of an event means, perhaps hook into native event handles in https://www.w3.org/TR/uievents/#handle-native-mouse-move-id?

whimboo · 2024-10-23T14:48:22Z

so actually I am not sure, even with this change it is not quite defined what it means that the actions are dispatched to the top-level browsing context. For example, in Chrome that is the case but still would not give you access to browser shortcuts.

Yes you are right. I mixed it up with the native event dispatching that I'm working on as well right now. In those cases we would have that particular issue but not when dispatching it in the content process of the top-level browsing context.

OrKoN · 2024-10-23T18:51:05Z

index.html

- must be equivalent to performing the given input device manipulations
- on <var>context</var>, such that trusted events corresponding to the
- entries in <var>list of events</var>are dispatched.
+ dispatch steps</dfn> on a <var>context</var>, and a <var>list of events</var>


I think we should keep browsing context indicating the context var type

Yeah, I missed it.

index.html

OrKoN · 2024-10-23T19:01:59Z

index.html

@@ -8392,7 +8438,7 @@ <h3>Processing actions</h3>
   </dd>
  </dl>

- <li><p>Return (<var>x</var>, <var>y</var>)
+ <li><p>Return (<var>x</var> + <var>parentOffsetLeft</var>, <var>y</var> + <var>parentOffsetTop</var>)


should the offset be applied to all origins or only the viewport? I think the origin pointer might already be in the top level context coordinates and not sure if the element origin does some adjustments already.

I would expect the origin "pointer" to be relative to "context" param of "PerformActionsParameters" of the command.

That does not seem to follow from the spec text since input sources are per top-level browsing context.

OrKoN · 2024-10-23T19:05:57Z

index.html

+  </li>
+  <li>
+   Let <var>navigable</var> be <var>context</var>'s <a>active document</a>'s
+   [=navigable/parent=].


[=navigable/parent=] is a property of a navigable and not the document. Did you mean to get the current navigable instead of a parent (e.g., https://html.spec.whatwg.org/#node-navigable)?

Co-authored-by: Alex Rudenko <[email protected]>

jgraham

I agree that in principle this is the model we need to change to in order to work with modern browser designs.

However I think the change as written is insufficient, and makes the spec contradictory. In each action we still claim we're dispatching events to |context| which, afaict after these changes, is not the parent context but the actual child context. The spec is unfortunately rather badly written here, so the text you've modified is apparently normative, but exists in what is essentially an informative section describing the overall model.

I also suggest (perhaps as a followup) that we make it possible for specific actions to provide the context that is used for calculating origins (which must in all cases be an ancestor of the top-level command context). That makes it much easier to construct action chains that interact with both the top level document and iframes.

whimboo · 2024-12-11T16:16:44Z

We should as well clarify what should happen if a frame gets closed by an action. Following actions in the same chain should still be dispatched even through they will reach some other document and elements? See the following wpt test as example: https://github.com/web-platform-tests/wpt/blob/master/input-events/input-events-spin-button-click-on-number-input-delete-document.html

css-meeting-bot · 2024-12-11T17:34:38Z

The Browser Testing and Tools Working Group just discussed Input events dispatch to top-level frame.

The full IRC log of that discussion

<AutomatedTester> Topic: Input events dispatch to top-level frame
<AutomatedTester> github: https://github.com//pull/1847
<jgraham> q+
<AutomatedTester> sadym: There is already a discussion that is in the PR. I am a bit stuck with which approach we should be doing here
<AutomatedTester> ack next
<AutomatedTester> jgraham: At the moment the spec say we pick and iframe and send events to/from that
<AutomatedTester> ... but we may want other data from all frames. e.g. if an overlay is over an iframe and click. We want to have the envet fro the overlay and then the Iframe
<AutomatedTester> ... and there might also be a case with what happens when the iframe disappears
<AutomatedTester> ... we should handle cancelling when frame goes and stop the propagation from the frame but items can still go that way
<AutomatedTester> ... in the future we should probably have a way doing calculations based off the iframe
<AutomatedTester> sadym: my first question: do we want to specify the calc the coords or dispatch to
<jgraham> q+
<AutomatedTester> ... and do that on the <missed what was said>
<orkon> q+
<AutomatedTester> ... and then do the calculations more precise and then do htat to the top level
<AutomatedTester> ack jgraham
<AutomatedTester> jgraham: yes... we need to work with how browsers actually work and then do that from the top/parent and let that go down to the correct place
<AutomatedTester> ... I feel like we agree on the model here
<AutomatedTester> ... the main issue is what happens when the iframe disappears
<AutomatedTester> ... we can either keep going or can fail
<whimboo> q+
<AutomatedTester> q+
<AutomatedTester> ack next
<AutomatedTester> orkon: I think the issue if the iframe disappears. I thought that was solved with the PR from whimboo .
<AutomatedTester> ... I think that if the the iframe disappears we should still continue sending the actions
<AutomatedTester> ... e.g. mouse down removes the iframe we should continue
<jgraham> q+
<AutomatedTester> ... back to sadym if we change to to the top level then the calculations could be a lot harder to do where the current way is already working
<AutomatedTester> ack next
<AutomatedTester> whimboo: a follow to the PR, I haven't done this
<orkon> PR I meant https://github.com//pull/1861
<AutomatedTester> ... I wanted to give a comment to jgraham if we have to continue then it might be good to handle both case (carry on and error)
<AutomatedTester> ... and we would have a default that is managed by an argument
<AutomatedTester> ack next
<tidoust> AutomatedTester: Initially, actions were "do as I say", not "do what I mean". actions.mousedown would assume that the element would be in the viewport. Little things like that. Actions should be above the glass. You would just be telling the coordinates and do the action. But you don't necessarily know what's underneath. If I do element.click,
<tidoust> behavior is different.
<tidoust> i/AutomatedTester:/scribe+ tidoust
<tidoust> AutomatedTester: For iframes, behavior has indeed always be different.
<AutomatedTester> ack next
<orkon> q+
<orkon> q-
<AutomatedTester> jgraham: Do we need to be precise in the spec? Yes definitely
<AutomatedTester> ... I know that there are parts that say browser specific but coordiniates is different and we all have the same on that
<AutomatedTester> ... my proposal for clients would send details from the top level traversible
<AutomatedTester> ... but clients could send them at iframe if they want but then handle the situation if it disappears
<orkon> q+
<AutomatedTester> ... I think we need to follow this up in the issues
<AutomatedTester> ack next
<AutomatedTester> orkon: I agree with the error if it doesn't still exist
<AutomatedTester> ... we could do the calculations at the beginning
<AutomatedTester> jgraham: I don't think we can beause if we have scroll then all the coords are out
<AutomatedTester> q?

Input events dispatch to top-level frame

d140333

OrKoN reviewed Oct 23, 2024

View reviewed changes

index.html Outdated Show resolved Hide resolved

top-browsing context

e50676d

OrKoN reviewed Oct 23, 2024

View reviewed changes

index.html Outdated Show resolved Hide resolved

change reference

ab33a82

OrKoN approved these changes Oct 23, 2024

View reviewed changes

sadym-chromium requested a review from whimboo October 23, 2024 09:29

whimboo mentioned this pull request Oct 23, 2024

Clarify "dispatch action chain" behavior when executed within a frame #1840

Open

Introduce parent offset

eda93b1

add todo

403bdbb

sadym-chromium requested a review from OrKoN October 23, 2024 13:38

sadym-chromium commented Oct 23, 2024

View reviewed changes

fix check

88e7b6b

sadym-chromium force-pushed the sadym/input-iframe branch from d1e3343 to 88e7b6b Compare October 23, 2024 14:01

OrKoN reviewed Oct 23, 2024

View reviewed changes

sadym-chromium and others added 2 commits October 24, 2024 10:08

Update index.html

adf94ad

Co-authored-by: Alex Rudenko <[email protected]>

Update index.html

6cdef65

Co-authored-by: Alex Rudenko <[email protected]>

sadym-chromium mentioned this pull request Oct 24, 2024

Input events dispatch via top-level browsing context #1850

Draft

jgraham mentioned this pull request Nov 28, 2024

Raise "no such window" error in "dispatching actions" when browsing context (navigable) does no longer exist #1861

Merged

jgraham requested changes Nov 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input events dispatch to top-level frame #1847

Input events dispatch to top-level frame #1847

sadym-chromium commented Oct 22, 2024 •

edited by pr-preview bot

Loading

sadym-chromium commented Oct 23, 2024

OrKoN commented Oct 23, 2024 •

edited

Loading

whimboo commented Oct 23, 2024

sadym-chromium commented Oct 23, 2024

sadym-chromium commented Oct 23, 2024

sadym-chromium Oct 23, 2024 •

edited

Loading

sadym-chromium Oct 24, 2024

sadym-chromium Oct 24, 2024 •

edited

Loading

whimboo commented Oct 23, 2024

OrKoN commented Oct 23, 2024

OrKoN commented Oct 23, 2024

whimboo commented Oct 23, 2024

OrKoN Oct 23, 2024

sadym-chromium Oct 23, 2024

OrKoN Oct 23, 2024

sadym-chromium Oct 24, 2024 •

edited

Loading

OrKoN Oct 24, 2024

OrKoN Oct 23, 2024

jgraham left a comment

whimboo commented Dec 11, 2024 •

edited

Loading

css-meeting-bot commented Dec 11, 2024

		such that trusted events corresponding to the entries in
		<var>list of events</var> are dispatched.

Input events dispatch to top-level frame #1847

Are you sure you want to change the base?

Input events dispatch to top-level frame #1847

Conversation

sadym-chromium commented Oct 22, 2024 • edited by pr-preview bot Loading

sadym-chromium commented Oct 23, 2024

OrKoN commented Oct 23, 2024 • edited Loading

whimboo commented Oct 23, 2024

sadym-chromium commented Oct 23, 2024

sadym-chromium commented Oct 23, 2024

sadym-chromium Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

sadym-chromium Oct 24, 2024

Choose a reason for hiding this comment

sadym-chromium Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

whimboo commented Oct 23, 2024

OrKoN commented Oct 23, 2024

OrKoN commented Oct 23, 2024

whimboo commented Oct 23, 2024

OrKoN Oct 23, 2024

Choose a reason for hiding this comment

sadym-chromium Oct 23, 2024

Choose a reason for hiding this comment

OrKoN Oct 23, 2024

Choose a reason for hiding this comment

sadym-chromium Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

OrKoN Oct 24, 2024

Choose a reason for hiding this comment

OrKoN Oct 23, 2024

Choose a reason for hiding this comment

jgraham left a comment

Choose a reason for hiding this comment

whimboo commented Dec 11, 2024 • edited Loading

css-meeting-bot commented Dec 11, 2024

sadym-chromium commented Oct 22, 2024 •

edited by pr-preview bot

Loading

OrKoN commented Oct 23, 2024 •

edited

Loading

sadym-chromium Oct 23, 2024 •

edited

Loading

sadym-chromium Oct 24, 2024 •

edited

Loading

sadym-chromium Oct 24, 2024 •

edited

Loading

whimboo commented Dec 11, 2024 •

edited

Loading