options.screenshot
parameter for better control over screenshot behavior.before
: Screenshot before the actionafter
: Screenshot after the actiontrace
: Screenshot with operation tracedelay
: Wait time after action before taking final screenshot500ms
, Maximum: 30s
base64
: Direct image data (default)storageKey
: Storage key with presigned URL accesspresignedExpiresIn
: Custom expiration for storageKey URLs (default: 30m
)options.screenshot
parameter instead of the old screenshot fields, as they will be deprecated in future versions.Scale Value | Screenshot Size | Coordinate Example |
---|---|---|
1.0 (default) | Full size | Click({x: 100, y: 100}) |
0.5 | 50% size | Click({x: 50, y: 50}) equivalent |
0.25 | 25% size | Click({x: 25, y: 25}) equivalent |
tap
and longPress
actions for precise coordinate taps and long-press interactions.swipe
/ scroll
now support semantic distance
values ("tiny" | "short" | "medium" | "long"
), so you no longer need to provide pixel values.outputFormat: "storageKey"
, allowing GBOX.AI to directly store screenshot information. Compared to returning image data directly, StorageKey provides more flexible storage and access options:createPresignedUrl
to create presigned URLs with specified expiration times, convenient for returning to LLM models for image analysis