Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

feat(vertexai): add google_vertex_ai_online_evaluator resource#17944

Open
guvenenb wants to merge 3 commits into
GoogleCloudPlatform:main from
guvenenb:online_evals
Open

feat(vertexai): add google_vertex_ai_online_evaluator resource #17944
guvenenb wants to merge 3 commits into
GoogleCloudPlatform:main from
guvenenb:online_evals

Conversation

@guvenenb

@guvenenb guvenenb commented Jun 12, 2026

Copy link
Copy Markdown

Release Note Template for Downstream PRs (will be copied)

See Write release notes for guidance.

@modular-magician modular-magician added the awaiting-approval Pull requests that need reviewer's approval to run presubmit tests label Jun 12, 2026

Copy link
Copy Markdown

Googlers: For automatic test runs see go/terraform-auto-test-runs.

@roaks3, a repository maintainer, has been assigned to review your changes. If you have not received review feedback within 2 business days, please leave a comment on this PR asking them to take a look.

You can help make sure that review is quick by doing a self-review and by running impacted tests locally.

Comment thread mmv1/products/vertexai/OnlineEvaluator.yaml Outdated
@@ -0,0 +1,37 @@
resource "google_vertex_ai_online_evaluator" "evaluator" {

@maxgasztych maxgasztych Jun 12, 2026

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would be great to create a simple reasoning engine here as well so the example is complete.
Also - did you test if it works and online evals are properly configured (in pantheon) ?

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I test with angelica's test project, it worked. I will do reasoning engine thing also

type: NestedObject
properties:
- name: metric
type: String

@maxgasztych maxgasztych Jun 12, 2026
edited
Loading

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it would be great to be explicit about metric types at least. If it's too much work let's just leave a todo and start working on it in a separate PR.

Image

Also, I haven't verified ai output but those are the predefined metric names available in pantheon, so would be good to include them in the comment so that users know what they can use.
final_response_quality_v1
tool_use_quality_v1
hallucination_v1
safety_v1
multi_turn_task_success_v1
multi_turn_tool_use_quality_v1
multi_turn_trajectory_quality_v1

Comment thread mmv1/products/vertexai/OnlineEvaluator.yaml Outdated
Comment thread mmv1/products/vertexai/OnlineEvaluator.yaml Outdated
Comment thread mmv1/products/vertexai/OnlineEvaluator.yaml
metric_sources {
metric = jsonencode({
"predefinedMetricSpec" = {
"metricSpecName" = "safety_v1"

@maxgasztych maxgasztych Jun 12, 2026
edited
Loading

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's include other 6 predefined metrics you can find in pantheon to make the example better.


metric_sources {
metric = jsonencode({
"predefinedMetricSpec" = {

@maxgasztych maxgasztych Jun 12, 2026
edited
Loading

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you also show how to create custom code or llm based metric (available in pantheon) ?
I actually am not sure if your yaml allows this because custom metrics are a separate resource (projects/.../locations/.../metrics/...)
It might make sense to create a separate PR for that so it's easier to review and we can at least deliver something this week, even if it doesn't include code and llm metrics

guvenenb added 2 commits June 12, 2026 09:32
- Rename location parameter to region for consistency.
- Update format description in name property to use region.
- Add timeouts (insert/update: 20m, delete: 60m) to OnlineEvaluator.
- Specify OpAsync type for async operations.
- Clean up comments in custom flatten template.
TAG=agy
CONV=5308268d-2e95-4a0c-8ce0-cdac27b8c645
- Use custom flattener for name to extract short ID.
- Update self_link to use full path template with short name.
- Update basic example to use region instead of location.
TAG=agy
CONV=5308268d-2e95-4a0c-8ce0-cdac27b8c645
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

@roaks3 roaks3 Awaiting requested review from roaks3

1 more reviewer

@maxgasztych maxgasztych maxgasztych requested changes

Reviewers whose approvals may not affect merge requirements

At least 1 approving review is required to merge this pull request.

Assignees

No one assigned

Labels

awaiting-approval Pull requests that need reviewer's approval to run presubmit tests

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

AltStyle によって変換されたページ (->オリジナル) /