JavaScript
Model
Generate Coordinates
POST
JavaScript
Generate precise UI element coordinates using the gbox-handy-1 model. This specialized model analyzes screenshots and instructions to identify exact coordinates for UI operations.Documentation Index
Fetch the complete documentation index at: https://docs.gbox.ai/llms.txt
Use this file to discover all available pages before exploring further.
Supported Actions
The model supports three core actions that cover nearly all coordinate-based UI interactions:- Click: Identify precise tap/click coordinates for buttons, links, and interactive elements
- Drag: Calculate start and end coordinates for drag operations (e.g., swipe, scroll bars)
- Scroll: Determine optimal scroll coordinates and directions
Authorizations
Enter your API Key in the format: Bearer . Get it from https://gbox.ai
Body
application/json
Model request
Screenshot image as HTTP(S) URL or base64-encoded data URI. Supports both formats: 1) HTTP(S) URL pointing to an image file; 2) Base64-encoded data URI with format 'data:image/png;base64,[data]' or 'data:image/jpeg;base64,[data]'. Only PNG and JPEG formats are supported for base64.
Structured action object (click or drag)
- Click Action
- Drag Action
- Scroll Action
Model to use
Available options:
gbox-handy-1 Example:
"gbox-handy-1"
Response
200 - application/json
Model response data structure