Galata*
Galata is a set of helpers and fixtures for JupyterLab UI Testing using Playwright Test Runner that provides:
Getting Started
Installation
Add Galata to your project:
jlpm add -D @jupyterlab/galata
jlpm playwright install
Create a Playwright configuration file playwright.config.js
containing:
module.exports = require('@jupyterlab/galata/lib/playwright-config');
First test
Create ui-tests/foo.spec.ts
to define your test.
import { expect, test } from '@jupyterlab/galata';
test.describe('Notebook Tests', () => {
test('Create New Notebook', async ({ page, tmpPath }) => {
const fileName = 'create_test.ipynb';
await page.notebook.createNew(fileName);
expect(
await page.waitForSelector(`[role="main"] >> text=${fileName}`)
).toBeTruthy();
expect(await page.contents.fileExists(`${tmpPath}/${fileName}`)).toEqual(
true
);
});
});
This will create a notebook, open it and check it exists.
Launch JupyterLab
Before running the test, you will need to launch the JupyterLab server with some
specific options.
Create jupyter_server_test_config.py
with the following content.
from jupyterlab.galata import configure_jupyter_server
configure_jupyter_server(c)
Then start the server with:
jupyter lab --config jupyter_server_test_config.py
If you need to customize the set up for galata, you can look at the configure_jupyter_server
definition.
Run test project
jlpm playwright test
Galata should generate console output similar to following
Using config at .../playwright.config.js
Running 1 test using 1 worker
✓ ui-tests/foo.spec.ts:5:3 › Notebook Tests Create New Notebook (13s)
1 passed (15s)
Playwright Test just ran a test using Chromium browser, in a headless manner. You can use headed browser to see what is going on during the test:
jlpm playwright test --headed
Test assets (including test videos) will be saved in a test-results
folder and by default a HTML
report will be created in playwright-report
folder. That report can be see by running:
jlpm playwright show-report
User advices
Create tests
To create tests, the easiest way is to use the code generator tool of playwright:
jupyter lab --config jupyter_server_test_config.py &
jlpm playwright codegen localhost:8888
Debug tests
To debug tests, a good way is to use the inspector tool of playwright:
jupyter lab --config jupyter_server_test_config.py &
jlpm playwright test --debug
Or the UI mode:
jupyter lab --config jupyter_server_test_config.py &
jlpm playwright test --ui
Dealing with login
If you have set up a custom login handler for your Jupyter application and don't want to remove it
for your integration tests, you can try the following configuration (inspired by the
Playwright documentation):
- Create a file named
global-setup.ts
at the root of the test folder containing the login steps:
import { chromium, FullConfig } from '@playwright/test';
async function globalSetup(config: FullConfig) {
const { baseURL, storageState } = config.projects[0].use;
const browser = await chromium.launch();
const page = await browser.newPage();
await page.goto(baseURL ?? process.env.TARGET_URL ?? 'http://localhost:8888');
await page.locator('input[name="password"]').fill('test');
await page.locator('text=Log in').click();
await page.context().storageState({ path: storageState as string });
await browser.close();
}
export default globalSetup;
- Modify the Playwright configuration file to use that global setup and the stored state:
var baseConfig = require('@jupyterlab/galata/lib/playwright-config');
module.exports = {
...baseConfig,
globalSetup: require.resolve('./global-setup'),
use: {
...baseConfig.use,
storageState: 'storageState.json'
}
};
When you will start your test, a file named storageStage.json
will be generated if the log in
steps were successful. Its content will look like that:
{
"cookies": [
{
"name": "_xsrf",
"value": "...REDACTED...",
"domain": "localhost",
"path": "/",
"expires": -1,
"httpOnly": false,
"secure": false,
"sameSite": "Lax"
},
{
"name": "username-localhost-8888",
"value": "...REDACTED...",
"domain": "localhost",
"path": "/",
"expires": 1664121119.118241,
"httpOnly": true,
"secure": false,
"sameSite": "Lax"
}
],
"origins": []
}
This will only work if the authentication is stored in a cookie and you can access the Jupyter
app directly when that cookie is set.
Helpers
Listen to dialogs
You can add a listener that will be triggered when a JupyterLab dialog is shown:
await page.evaluate(() => {
window.galata.on('dialog', (dialog: Dialog<unknown> | null) => {
});
});
The listener will be called when a dialog is started and when it is closed (in that case dialog == null
).
You can stop listening to the event with:
await page.evaluate(() => {
window.galata.off('dialog', listener);
});
Or you can listen to a single event with:
await page.evaluate(() => {
window.galata.once('dialog', listener);
});
Listen to notification
You can add a listener that will be triggered when a JupyterLab dialog is shown:
await page.evaluate(() => {
window.galata.on(
'notification',
(notification: Notification.INotification) => {
}
);
});
The listener will be called when a notification is created or updated.
You can stop listening to the event with:
await page.evaluate(() => {
window.galata.off('notification', listener);
});
Or you can listen to a single event with:
await page.evaluate(() => {
window.galata.once('notification', listener);
});
Fixtures
Here are the new test fixture introduced by Galata on top of Playwright fixtures.
baseURL
Application base URL without /lab
. It defaults to environment variable TARGET_URL
or http://localhost:8888
if nothing
is defined.
appPath
Application URL path fragment; default "/lab"
autoGoto
Whether to go to JupyterLab page within the fixture or not; default true
.
If set to false
, it allows you to add route mock before loading JupyterLab.
Example:
test.use({ autoGoto: false });
test('Open language menu', async ({ page }) => {
await page.route(/.*\/api\/translation.*/, (route, request) => {
if (request.method() === 'GET') {
return route.fulfill({
status: 200,
body: '{"data": {"en": {"displayName": "English", "nativeName": "English"}}, "message": ""}'
});
} else {
return route.continue();
}
});
await page.goto();
});
serverFiles
- type: <'on' | 'off' | 'only-on-failure'>
Galata can keep the uploaded and created files in tmpPath
on
the server root for debugging purpose. By default the files are kept
on failure.
- 'off' -
tmpPath
is deleted after each tests - 'on' -
tmpPath
is never deleted - 'only-on-failure' -
tmpPath
is deleted except if a test failed or timed out.
mockState
- type: < boolean | Record<string, unknown> >
Mock JupyterLab state in-memory or not.
Possible values are:
- true (default): JupyterLab state will be mocked on a per test basis
- false: JupyterLab state won't be mocked (Be careful it will write state in local files)
- Record<string, unknown>: Initial JupyterLab data state - Mapping (state key, value).
By default the state is stored in-memory.
Example:
test.use({
mockState: {
'layout-restorer:data': {
main: {
dock: {
type: 'tab-area',
currentIndex: 0,
widgets: []
}
},
down: {
size: 0,
widgets: []
},
left: {
collapsed: false,
visible: true,
current: 'running-sessions',
widgets: [
'filebrowser',
'jp-property-inspector',
'running-sessions',
'@jupyterlab/toc:plugin',
'debugger-sidebar',
'extensionmanager.main-view'
]
},
right: {
collapsed: true,
visible: true,
widgets: []
},
relativeSizes: [0.4, 0.6, 0]
}
} as any
});
test('should return the mocked state', async ({ page }) => {
expect(
await page.waitForSelector(
'[aria-label="Running Sessions section"] >> text=Open Tabs'
)
).toBeTruthy();
});
mockSettings
- type: < boolean | Record<string, unknown> >
Mock JupyterLab settings in-memory or not.
Possible values are:
-
true: JupyterLab settings will be mocked on a per test basis
-
false: JupyterLab settings won't be mocked (Be careful it will read & write settings local files)
-
Record<string, unknown>: Mapping {pluginId: settings} that will be default user settings
The default value is galata.DEFAULT_SETTINGS
By default the settings are stored in-memory. However the
they are still initialized with the hard drive values.
Example:
test.use({
mockSettings: {
...galata.DEFAULT_SETTINGS,
'@jupyterlab/apputils-extension:themes': {
theme: 'JupyterLab Dark'
}
}
});
test('should return mocked settings', async ({ page }) => {
expect(await page.theme.getTheme()).toEqual('JupyterLab Dark');
});
mockUser
- type: boolean | Partial<User.IUser>
Mock JupyterLab user in-memory or not.
Possible values are:
- true (default): JupyterLab user will be mocked on a per test basis
- false: JupyterLab user won't be mocked (It will be a random user so snapshots won't match)
- Record<string, unknown>: Initial JupyterLab user - Mapping (user attribute, value).
By default the user is stored in-memory.
kernels
- type: <Map<string, Kernel.IModel> | null>
Kernels created during the test.
Possible values are:
- null: The kernels API won't be mocked
- Map<string, Kernel.IModel>: The kernels created during a test.
By default the kernels created during a test will be tracked and disposed at the end.
Example:
test('should return the active kernels', async ({ page, kernels }) => {
await page.notebook.createNew();
await page.waitForResponse(
async response =>
response.url().includes('api/kernels') &&
response.request().method() === 'GET' &&
((await response.json()) as any[]).length === 1
);
expect(kernels.size).toEqual(1);
});
sessions
- type: <Map<string, Session.IModel> | null>
Sessions created during the test.
Possible values are:
- null: The sessions API won't be mocked
- Map<string, Session.IModel>: The sessions created during a test.
By default the sessions created during a test will be tracked and disposed at the end.
Example:
test('should return the active sessions', async ({ page, sessions }) => {
await page.notebook.createNew();
await page.waitForResponse(
async response =>
response.url().includes('api/sessions') &&
response.request().method() === 'GET' &&
((await response.json()) as any[]).length === 1
);
expect(sessions.size).toEqual(1);
});
terminals
- type: < Map<string, TerminalAPI.IModel> | null >
Terminals created during the test.
Possible values are:
- null: The Terminals API won't be mocked
- Map<string, TerminalsAPI.IModel>: The Terminals created during a test.
By default the Terminals created during a test will be tracked and disposed at the end.
Example:
test('should return the active terminals', async ({ page, terminals }) => {
await Promise.all([
page.waitForResponse(
response =>
response.request().method() === 'POST' &&
response.url().includes('api/terminals')
),
page.menu.clickMenuItem('File>New>Terminal')
]);
await page.waitForResponse(
async response =>
response.url().includes('api/terminals') &&
response.request().method() === 'GET' &&
((await response.json()) as any[]).length === 1
);
expect(terminals.size).toEqual(1);
});
tmpPath
Unique test temporary path created on the server. Required if uploading files in beforeAll()
as otherwise the files would not be accessible from consecutive tests because by default tmpPath
has a random component added for each test.
Note: if you override this string, you will need to take care of creating the
folder and cleaning it.
Example:
test.use({ tmpPath: 'test-toc' });
test.describe.serial('Table of Contents', () => {
test.beforeAll(async ({ request, tmpPath }) => {
const contents = galata.newContentsHelper(request);
await contents.uploadFile(
path.resolve(__dirname, `./notebooks/${fileName}`),
`${tmpPath}/${fileName}`
);
});
test.afterAll(async ({ request, tmpPath }) => {
const contents = galata.newContentsHelper(request);
await contents.deleteDirectory(tmpPath);
});
});
Benchmark
Benchmark of JupyterLab is done using Playwright. The actions measured are:
- Opening a file
- Switching from the file to a simple text file
- Switching back to the file
- Closing the file
Two files are tested: a notebook with many code cells and another with many markdown cells.
The test is run on the CI by comparing the result in the commit at which a PR branch started and the PR branch head on
the same CI job to ensure using the same hardware.
The benchmark job is triggered on:
- Approved PR review
- PR review that contains the sentence
please run benchmark
The tests are located in the subfolder test/benchmark. And they can be
executed with the following command:
jlpm run test:benchmark
A special report will be generated in the folder benchmark-results
that will contain 4 files:
lab-benchmark.json
: The execution time of the tests and some metadata.lab-benchmark.md
: A report in Markdownlab-benchmark.svg
: A comparison of execution time distributionlab-benchmark.vl.json
: The Vega-Lite description used to produce the figure.
The reference, tagged expected, is stored in lab-benchmark-expected.json
. It can be
created using the -u
option of Playwright; i.e. jlpm run test:benchmark -u
.
Benchmark parameters
The benchmark can be customized using the following environment variables:
BENCHMARK_NUMBER_SAMPLES
: Number of samples to compute the execution time distribution; default 20.BENCHMARK_OUTPUTFILE
: Benchmark result output file; default benchmark.json
. It is overridden in the playwright-benchmark.config.js
.BENCHMARK_REFERENCE
: Reference name of the data; default is actual
.BENCHMARK_EXPECTED_REFERENCE
: Reference name of the reference data; default is expected
.
Development
Build
Install dependencies and build
cd galata
jlpm
jlpm run build
For tests to be run, a JupyterLab instance must be up and running. Launch it without credentials. Tests expect to connect JupyterLab from localhost:8888
by default. If a different URL is to be used, it can be specified by defining TARGET_URL
environment variable or setting the Playwright baseURL
fixture.
jlpm run start
The JupyterLab root directory is randomly generated in the temporary folder (prefixed with galata-test-).
Running tests
Tests are grouped in two projects: galata
and jupyterlab
. The first one is testing Galata helpers and fixtures when the other one is running all tests for Jupyterlab.
By default, both projects will be executed when running jlpm run test
. But you can select one project with the CLI option --project <project-id>
.
Configuration
Galata can be configured by using command line arguments or using playwright.config.js
file. Full list of config options can be accessed using jlpm playwright test --help
.
Custom benchmark report
By default, Galata will generate a text report in the form of markdown
table and a Vega-Lite graph of execution time distribution. Users can customize these reports in two ways:
- Using
playwright.config.js
file: in reporter
section, users can supply two functions vegaLiteConfigFactory
and textReportFactory
to the reporter's constructor options. These functions will be used to create Vega-Lite configuration (vegaLiteConfigFactory
) or to create a text report (textReportFactory
) from test records.
reporter: [
...,
[
'@jupyterlab/galata/lib/benchmarkReporter',
{ outputFile: 'lab-benchmark.json',
vegaLiteConfigFactory: (
allData: Array<IReportRecord>,
comparison?: 'snapshot' | 'project'
) => {
return {};
}
textReportFactory: (
allData: Array<IReportRecord>,
comparison?: 'snapshot' | 'project'
) => {
return Promise.resolve(['My report content', 'md']);
}
}
],
...
]
- The second way to customize the reports is to override the default text report factory (
defaultTextReportFactory
) and Vega-Lite graph config factory (defaultVegaLiteConfigFactory
) of BenchmarkReporter
class in a sub-class and then use it as a reporter in playwright.config.js
file.
Reference Image Captures
Reference image are saved next to test files in <test-file-name>-snapshots
folders. If a reference screenshots does not exist, it will be generated at the first execution
of a test. You can also update them by running jlpm playwright test --update-snapshots
.
About Galata Name
Galata framework is named after Galata Tower in Istanbul. Centuries ago, Galata Tower was used to spot fires in the city. Tower was also used as astronomical observatory in the past.
Acknowledgement
Development of this project began under Bloomberg organization by Mehmet Bektas, then it was transferred to JupyterLab organization. We gratefully acknowledge Bloomberg for the generous contribution and supporting open-source software community.