OpenEnv Environment

IT Helpdesk Ticket Routing OpenEnv

it_helpdesk_ticket_routing

Queue decisions that actually carry forward.

A sleek benchmark surface for sequential helpdesk routing: hidden context, cluster-aware follow-ons, incident handling, deferrals, and a terminal rubric that rewards queue strategy instead of isolated classification alone.

Task family: easy to hard Closed-form grader Queue-level terminal objective
74 Tickets in the grounded dataset Curated records plus queue mutation mechanics create repeatable but non-trivial episodes.
26 Capacity-aware alternate routes The grader can reward declared fallback routes instead of collapsing to all-or-nothing exact match.
17 Cluster-linked or coordinated cases Handling one ticket can stabilize or destabilize the downstream tickets in the same workstream.
36 Hidden-context routing cases Investigation tools matter because key evidence does not appear in the initial observation by default.

Task Ladder

One benchmark family, not three disconnected demos

The difficulty ladder keeps the same full-routing output while progressively changing observability, queue dependencies, and operational pressure.

Task 1 GUIDED

Guided Full Routing

Perform full helpdesk routing by selecting issue type, priority, assignment group, and resolution action. Easy-task episodes keep the ticket text mostly visible and focus on grounded single-ticket routing.

issue_typepriorityassignment_groupresolution_action
Task 2 CONTEXTUAL

Contextual Full Routing

Perform full helpdesk routing with partial observability and moderate queue carry-over. Some tickets hide related-case, requester-history, or cluster-coordination details until you investigate or request more information, and medium episodes can also require deferral or coherent handling across linked tickets in the same queue.

issue_typepriorityassignment_groupresolution_action
Task 3 ADAPTIVE

Adaptive Queue Routing

Perform full helpdesk routing by selecting the best issue type, priority, assignment group, and resolution action for the ticket. Use any ambiguity notes, related-ticket previews, queue-capacity forecasts, and planning state when present. Some hard tickets intentionally hide decisive routing context until you investigate with the available tools, and some hard episodes also require queue-level capacity planning, deferrals, incident management, and recovery from downstream follow-up tickets.

issue_typepriorityassignment_groupresolution_action

Environment Signals

What the agent is balancing

The benchmark is designed so strong policy choices change later tickets, incident coverage, and terminal queue quality instead of just nudging shaped reward.

Hidden context retrieval

Related-ticket previews, requester history, internal routing notes, queue cluster summaries, and capacity forecasts are revealed through explicit tool use.

investigate request_info cluster summary

Operational actions with consequences

Deferrals can raise later urgency, incident handling can reduce downstream debt, and weak handling can spawn or worsen follow-up work.

defer open_incident follow-up spawning

Queue-level terminal rubric

Final scoring blends routing trajectory quality with queue management quality so agents are rewarded for coherent episode strategy, not just isolated ticket matches.

terminal rubric queue quality planning-aware

Quick Routes

Fast ways to demo the environment

Useful entry points for judges, reviewers, or anyone trying to get signal from the project quickly.

Interactive API docs

Browse the full OpenEnv-compatible surface, request models, and built-in helper endpoints.

GET /docs Open Docs

Task manifest

Inspect the easy, medium, and hard task definitions exactly as exposed by the server.

GET /tasks View Tasks

Hard-task baseline rollout

See a deterministic baseline episode over the hardest queue with the current environment logic.

GET /baseline?task_id=3&seed=42 Run Baseline

Health and deployment status

Quick check that the service is alive and ready for OpenEnv-style evaluation requests.

GET /health Check Health