Overview

UI actions are one of Gbox’s core features, enabling AI agents to interact with all devices just like humans do.

Through simple API calls, you can perform various operations including:

  • Clicking on specific coordinates or elements
  • Scrolling through content
  • Typing text into input fields
  • Dragging and other gesture-based interactions

💡 Explore More UI Actions →

Discover all available UI operations and advanced features.

Quick Start Example

Here’s a basic UI action example showing how to create an Android box and perform a click operation:

import GboxSDK from "gbox-sdk";

const gboxSDK = new GboxSDK({
  apiKey: process.env["GBOX_API_KEY"], // This is the default and can be omitted
});

async function main() {
  const box = await gboxSDK.create({ type: "android" });

  await box.action.click({
    x: 100,
    y: 100,
  });

  await box.action.screenshot({
    path: "screenshot.png",
  });
}

main();

More UI Action Capabilities

Explore the complete UI Action API reference to discover all available operations and features.