What's different from a web page?

WebUIs are granted super powers so that they can manage Chrome itself. For example, it'd be very hard to implement the Settings UI without access to many different privacy and security sensitive services. Access to these services are not granted by default.

Only special URLs are granted WebUI “bindings” via the child security process.

How chrome: URLs work

A chrome: URL loads a file from disk, memory, or can respond dynamically.

Because Chrome UIs generally need access to the browser (not just the current tab), much of the C++ that handles requests or takes actions lives in the browser process. The browser has many more privileges than a renderer (which is sandboxed and doesn't have file access), so access is only granted for certain URLs.

chrome: protocol

Chrome recognizes a list of special protocols, which it registers while starting up.

Examples:

chrome-devtools:

chrome-extensions:

chrome:

file:

view-source:

This document mainly cares about the chrome: protocol, but others can also be granted WebUI bindings or have special properties.

chrome: hosts

After registering the chrome: protocol, a set of factories are created. These factories contain a list of valid host names. A valid hostname generates a controller.

In the case of chrome: URLs, these factories are registered early in the browser process lifecycle.

C++ classes

WebUI

WebUI is a high-level class and pretty much all HTML-based Chrome UIs have one. WebUI lives in the browser process, and is owned by a RenderFrameHost. WebUIs have a concrete implementation (WebUIImpl) in content/ and are created in response to navigation events.

A WebUI knows very little about the page it's showing, and it owns a WebUIController that is set after creation based on the hostname of a requested URL.

A WebUIcan handle messages itself, but often defers these duties to separate WebUIMessageHandlers, which are generally designed for handling messages on certain topics.

A WebUI can be created speculatively, and are generally fairly lightweight. Heavier duty stuff like hard initialization logic or accessing services that may have side effects are more commonly done in a WebUIController or WebUIMessageHandlers.

WebUI are created synchronously on the UI thread in response to a URL request, and are re-used where possible between navigations (i.e. refreshing a page). Because they run in a separate process and can exist before a corresponding renderer process has been created, special care is required to communicate with the renderer if reliable message passing is required.

WebUIController

A WebUIController is the brains of the operation, and is responsible for application-specific logic, setting up translations and resources, creating message handlers, and potentially responding to requests dynamically. In complex pages, logic is often split across multiple WebUIMessageHandlers instead of solely in the controller for organizational benefits.

A WebUIController is owned by a WebUI, and is created and set on an existing WebUI when the correct one is determined via URL inspection (i.e. chrome://settings creates a generic WebUI with a settings-specific WebUIController).

WebUIDataSource

WebUIMessageHandler

Because some pages have many messages or share code that sends messages, message handling is often split into discrete classes called WebUIMessageHandlers. These handlers respond to specific invocations from JavaScript.

Browser (C++) → Renderer (JS)

WebUIMessageHandler::AllowJavascript()

A tab that has been used for settings UI may be reloaded, or may navigate to an external origin. In both cases, one does not want callbacks from C++ to Javascript to run. In the former case, the callbacks will occur when the Javascript doesn't expect them. In the latter case, sensitive information may be delivered to an untrusted origin.

Therefore each message handler maintains a boolean that describes whether delivering callbacks to Javascript is currently appropriate. This boolean is set by calling AllowJavascript, which should be done when handling a call from Javascript, because that indicates that the page is ready for the subsequent callback. (See design doc.) If the tab navigates or reloads, DisallowJavascript is called to clear the flag.

Also beware of ABA issues: Consider the case where an asynchronous operation is started, the settings page is reloaded, and the user triggers another operation using the original message handler. The javascript_allowed_ boolean will be true, but the original callback should still be dropped because it relates to a operation that was discarded by the reload. (Reloading settings UI does not cause message handler objects to be deleted.)

Thus a message handler may override OnJavascriptDisallowed to learn when pending callbacks should be canceled.

and sent to the renderer via a FrameMsg_JavaScriptExecuteRequest IPC message.

While this works, it implies that:

a global method must exist to successfully run the Javascript request

any method can be called with any parameter (far more access than required in practice)

^ These factors have resulted in less use of CallJavascriptFunction() in the webui codebase. This functionality can easily be accomplished with the following alternatives:

FireWebUIListener() allows easily notifying the page when an event occurs in C++ and is more loosely coupled (nothing blows up if the event dispatch is ignored). JS subscribes to notifications via cr.addWebUIListener.

WebUIMessageHandler::FireWebUIListener()

FireWebUIListener() is used to notify a registered set of listeners that an event has occurred. This is generally used for events that are not guaranteed to happen in timely manner, or may be caused to happen by unpredictable events (i.e. user actions).

Here‘s some example to detect a change to Chrome’s theme:

cr.addWebUIListener("theme-changed", refreshThemeStyles);

This Javascript event listener can be triggered in C++ via:

voidMyHandler::OnThemeChanged(){FireWebUIListener("theme-changed");}

Because it‘s not clear when a user might want to change their theme nor what theme they’ll choose, this is a good candidate for an event listener.

WebUIMessageHandler::OnJavascriptAllowed()

OnJavascriptDisallowed() is a lifecycle method called in response to AllowJavascript(). It is a good place to register observers of global services or other callbacks that might call at unpredictable times.

Because browser-side C++ handlers are created before a renderer is ready, the above code may result in calling FireWebUIListener before the renderer is ready, which may result in dropped updates or accidentally running Javascript in a renderer that has navigated to a new URL.

when a renderer has been created and the document has loaded enough to signal to the C++ that it's ready to respond to messages.

WebUIMessageHandler::OnJavascriptDisallowed()

OnJavascriptDisallowed is a lifecycle method called when it‘s unclear whether it’s safe to send JavaScript messsages to the renderer.

There's a number of situations that result in this method being called:

renderer doesn't exist yet

renderer exists but isn't ready

renderer is ready but application-specifici JS isn't ready yet

tab refresh

renderer crash

Though it‘s possible to programmatically disable Javascript, it’s uncommon to need to do so.

Because there‘s no single strategy that works for all cases of a renderer’s state (i.e. queueing vs dropping messages), these lifecycle methods were introduced so a WebUI application can implement these decisions itself.

Often, it makes sense to disconnect from observers in OnJavascriptDisallowed():

Because OnJavascriptDisallowed() is not guaranteed to be called before a WebUIMessageHandler's destructor, it is often advisable to use some form of scoped observer that automatically unsubscribes on destruction but can also imperatively unsubscribe in OnJavascriptDisallowed().

WebUIMessageHandler::ResolveJavascriptCallback()

This method is called in response to cr.sendWithPromise() to fulfill an issued Promise, often with a value. This results in runnings any fulfillment (first) callbacks in the associate Promise executor and any registered then() callbacks.

WebUI listeners are a convenient way for C++ to inform JavaScript of events.

Older WebUI code exposed public methods for event notification, similar to how responses to chrome.send() used to work. They both resulted in global namespace polution, but it was additionally hard to stop listening for events in some cases. cr.addWebUIListener is preferred in new code.

Adding WebUI listeners creates and inserts a unique ID into a map in JavaScript, just like cr.sendWithPromise().

cr.sendWithPromise()

cr.sendWithPromise() is a wrapper around chrome.send(). It's used when triggering a message requires a response:

chrome.send('getNumberOfDonuts');// No easy way to get response!

In older WebUI pages, global methods were exposed simply so responses could be sent. This is discouraged as it pollutes the global namespace and is harder to make request specific or do from deeply nested code.

On the C++ side, the message registration is similar to chrome.send() except that the first argument in the message handler's list is a callback ID. That ID is passed to ResolveJavascriptCallback(), which ends up resolving the Promise in JavaScript and calling the then() function.

This approach still relies on the C++ calling a globally exposed method, but reduces the surface to only a single global (cr.webUIResponse) instead of many. It also makes per-request responses easier, which is helpful when multiple are in flight.

Security considerations

Because WebUI pages are highly privileged, they are often targets for attack, since taking control of a WebUI page can sometimes be sufficient to escape Chrome's sandbox. To make sure that the special powers granted to WebUI pages are safe, WebUI pages are restricted in what they can do:

WebUI pages cannot embed http/https resources or frames

WebUI pages cannot issue http/https fetches

In the rare case that a WebUI page really needs to include web content, the safe way to do this is by using a <webview> tag. Using a <webview> tag is more secure than using an iframe for multiple reasons, even if Site Isolation and out-of-process iframes keep the web content out of the privileged WebUI process.

First, the content inside the <webview> tag has a much reduced attack surface, since it does not have a window reference to its embedder or any other frames. Only postMessage channel is supported, and this needs to be initiated by the embedder, not the guest.

Second, the content inside the <webview> tag is hosted in a separate StoragePartition. Thus, cookies and other persistent storage for both the WebUI page and other browser tabs are inaccessible to it.

This greater level of isolation makes it safer to load possibly untrustworthy or compromised web content, reducing the risk of sandbox escapes.