You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Update CSAT evaluator output schema
* run docstyle
* update not applicable
* Fix not-applicable CSAT result label
Set not-applicable output result to not_applicable while keeping status as skipped, and update behavior tests to match.
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
* update csat output schema
---------
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@@ -109,13 +116,14 @@ AGENT RESPONSE: "I've successfully cancelled your order #12345. Your payment of
109
116
110
117
OUTPUT:
111
118
{
112
-
"score": 5,
113
-
"explanation": "The agent immediately resolved the cancellation request, provided clear refund timeline, and confirmed next steps. Customer would be very satisfied with the efficient and complete resolution.",
114
-
"dimensions": {
119
+
"reason": "The agent immediately resolved the cancellation request, provided clear refund timeline, and confirmed next steps. Customer would be very satisfied with the efficient and complete resolution.",
120
+
"properties": {
115
121
"helpfulness": "Directly addressed the cancellation request and completed it immediately.",
116
122
"completeness": "Provided all relevant details: confirmation, refund amount, timeline, and email notification.",
117
123
"tone": "Professional and helpful, ended with an offer for further assistance."
"explanation": "The agent provided basic information about the return window but lacked important details like conditions, process, or refund method. Customer got a partial answer but might need to ask follow-up questions.",
131
-
"dimensions": {
137
+
"reason": "The agent provided basic information about the return window but lacked important details like conditions, process, or refund method. Customer got a partial answer but might need to ask follow-up questions.",
138
+
"properties": {
132
139
"helpfulness": "Answered the basic question but minimal detail provided.",
133
140
"completeness": "Missing key information about conditions, exceptions, and return process.",
134
141
"tone": "Neutral tone, neither particularly warm nor cold."
135
-
}
142
+
},
143
+
"score": 3,
144
+
"status": "completed"
136
145
}
137
146
138
147
### Score 1 - Very Dissatisfied
@@ -143,13 +152,14 @@ AGENT RESPONSE: "According to our records, the package was delivered. Have you c
143
152
144
153
OUTPUT:
145
154
{
146
-
"score": 1,
147
-
"explanation": "The agent dismissed the customer's concern and offered no real help beyond a generic suggestion. Customer has a real problem that wasn't addressed, leaving them frustrated with no resolution path.",
148
-
"dimensions": {
155
+
"reason": "The agent dismissed the customer's concern and offered no real help beyond a generic suggestion. Customer has a real problem that wasn't addressed, leaving them frustrated with no resolution path.",
156
+
"properties": {
149
157
"helpfulness": "Failed to offer any meaningful assistance or resolution options.",
150
158
"completeness": "Did not offer to investigate, file a claim, or provide alternatives.",
151
159
"tone": "Dismissive tone that implies the customer is wrong or didn't look properly."
@@ -125,13 +133,14 @@ Agent turn 2: Order #12346 shipped yesterday via FedEx. Tracking number: FX12345
125
133
126
134
EXPECTED OUTPUT:
127
135
{
128
-
"score": 5,
129
-
"explanation": "Both requests handled efficiently across two turns. Cancellation processed immediately with refund details, and shipping status provided with tracking info. Customer would be very satisfied with the responsive, complete service.",
130
-
"dimensions": {
136
+
"reason": "Both requests handled efficiently across two turns. Cancellation processed immediately with refund details, and shipping status provided with tracking info. Customer would be very satisfied with the responsive, complete service.",
137
+
"properties": {
131
138
"helpfulness": "Both the cancellation and shipping inquiry were addressed immediately and completely.",
132
139
"completeness": "All details provided: refund timeline, confirmation email, tracking number, and delivery estimate.",
133
140
"tone": "Professional and proactive throughout, offering further assistance after the first request."
134
-
}
141
+
},
142
+
"score": 5,
143
+
"status": "completed"
135
144
}
136
145
137
146
### Score 3 - Neutral (Partial resolution across turns)
@@ -144,13 +153,14 @@ Agent turn 2: I see. Account locks usually expire after 30 minutes. Please try a
144
153
145
154
EXPECTED OUTPUT:
146
155
{
147
-
"score": 3,
148
-
"explanation": "The agent provided basic troubleshooting but didn't proactively unlock the account or offer alternative solutions. The customer got a partial answer but would need to wait and try again without certainty of resolution.",
149
-
"dimensions": {
156
+
"reason": "The agent provided basic troubleshooting but didn't proactively unlock the account or offer alternative solutions. The customer got a partial answer but would need to wait and try again without certainty of resolution.",
157
+
"properties": {
150
158
"helpfulness": "Initial suggestion was generic and unhelpful. Second response addressed the specific error but offered only passive waiting.",
151
159
"completeness": "Missing proactive options like unlocking the account, password reset, or escalation to support.",
152
160
"tone": "Polite but somewhat dismissive of the customer's frustration with a 'try again later' response."
153
-
}
161
+
},
162
+
"score": 3,
163
+
"status": "completed"
154
164
}
155
165
156
166
### Score 1 - Very Dissatisfied (Failed session)
@@ -163,13 +173,14 @@ Agent turn 2: Unfortunately, since the package shows as delivered, we cannot pro
163
173
164
174
EXPECTED OUTPUT:
165
175
{
166
-
"score": 1,
167
-
"explanation": "The agent dismissed the customer's concern across both turns, offered no real resolution, and suggested filing a police report instead of helping. The customer's problem was completely unresolved.",
168
-
"dimensions": {
176
+
"reason": "The agent dismissed the customer's concern across both turns, offered no real resolution, and suggested filing a police report instead of helping. The customer's problem was completely unresolved.",
177
+
"properties": {
169
178
"helpfulness": "Failed to offer any meaningful assistance. Deflected responsibility to the customer.",
170
179
"completeness": "Did not offer investigation, replacement, refund, or escalation options.",
171
180
"tone": "Dismissive in both turns, implying the customer is wrong and offering no empathy for the situation."
0 commit comments