Machine learning, in your browser

Models that run on your laptop, not ours.

Three demos. An image classifier, an object detector, and a sentiment analyser. Each one downloads its model the first time you press a button, then runs every inference locally in WebAssembly or WebGPU. No data leaves your device.

Privacy by construction. Models are quantised ONNX, served from a public CDN. First use downloads between 15 and 170 megabytes per model, then cached. Subsequent runs take a few hundred milliseconds.

Open the demos

Choose a model.

Three tabs, three small models, three jobs. Pick one and follow the inline instructions.

Image classification

Xenova/vit-base-patch16-224  ·  about 85 MB  ·  ImageNet 1000 classes

First use of this tab will download about 85 MB to your browser cache. After that, classification takes well under a second per image.

or

Try a sample:

Your image preview will appear here.

Ready. Pick an image to begin.

    How this works.

    The model

    Each tab uses a quantised ONNX model from the Xenova catalogue. ONNX is the open standard format for cross runtime neural networks. Quantisation shrinks the weights to 8 bit integers, which makes them small enough to ship over a CDN and fast enough to run on a laptop.

    The runtime

    The transformers.js library runs the model in WebAssembly by default, or WebGPU if your browser supports it. The first inference of a session warms up the runtime. Subsequent inferences are an order of magnitude faster.

    The privacy story

    Your image or text never leaves your machine. The model weights travel one way, from the public CDN to your browser cache. After the first use of a tab, the model can run offline. No telemetry, no analytics, no server logs of your input.