Interpret prediction results from image object detection models

After requesting a prediction, Vertex AI returns results based on your model's objective. AutoML image object detection prediction responses return all objects found in an image. Each found object has an annotation (label and normalized bounding box) with a corresponding confidence score. The bounding box is written as:

"bboxes": [ [xMin, xMax, yMin, yMax], ...]

Where xMin, xMax are the minimum and maximum x values and yMin, yMax are the minimum and maximum y values respectively.

Example batch prediction output

Batch AutoML image object detection prediction responses are stored as JSON Lines files in Cloud Storage buckets. Each line of the JSON Lines file contains all objects found in a single image file. Each found object has an annotation (label and normalized bounding box) with a corresponding confidence score.

Important: Bounding boxes are specified as:

"bboxes": [ [xMin, xMax, yMin, yMax], ...]

Where xMin and xMax are the minimum and maximum x values and yMin and yMax are the minimum and maximum y values respectively.

{
 "instance": {"content": "gs://bucket/image.jpg", "mimeType": "image/jpeg"},
 "prediction": {
 "ids": [1, 2],
 "displayNames": ["cat", "dog"],
 "bboxes": [
 [0.1, 0.2, 0.3, 0.4],
 [0.2, 0.3, 0.4, 0.5]
 ],
 "confidences": [0.7, 0.5]
 }
}

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025年10月13日 UTC.