microsoft-foundry
diff --git a/‎.github/ISSUE_TEMPLATE/bug_report.yml‎
Lines changed: 124 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/bug_report.yml‎
Lines changed: 124 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/config.yml‎
Lines changed: 11 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/config.yml‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/documentation.yml‎
Lines changed: 47 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/documentation.yml‎
Lines changed: 47 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/feature_request.yml‎
Lines changed: 75 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/feature_request.yml‎
Lines changed: 75 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/feedback.yml‎
Lines changed: 39 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/feedback.yml‎
Lines changed: 39 additions & 0 deletions
@@ -0,0 +1,124 @@
+name: 🐛 Bug report
+description: Something is broken or behaving unexpectedly in the evaluation toolkit.
+title: "[Bug]: "
+labels: ["bug", "needs-triage"]
+body:
+  - type: markdown
+    attributes:
+      value: |
+        Thanks for taking the time to file a bug report! Please fill in as much of the form below as you can — the more detail you provide, the faster we can help.
+
+        > **Before you submit:** Please search [existing issues](https://github.com/microsoft-foundry/Model-Router-Auto-Evaluation/issues?q=is%3Aissue) to avoid duplicates, and check the [FAQ](../blob/main/docs/faq.md) for known issues.
+
+  - type: checkboxes
+    id: prereqs
+    attributes:
+      label: Pre-flight checklist
+      description: Please confirm the following before submitting.
+      options:
+        - label: I have searched existing issues and this is not a duplicate.
+          required: true
+        - label: I have read the [FAQ](../blob/main/docs/faq.md).
+          required: true
+        - label: I have removed any API keys, endpoints, or other secrets from logs and config snippets I paste below.
+          required: true
+
+  - type: textarea
+    id: summary
+    attributes:
+      label: Summary
+      description: A clear, concise description of the bug.
+      placeholder: When I run `python scripts/run_eval.py --dry-run`, the script crashes with a KeyError.
+    validations:
+      required: true
+
+  - type: textarea
+    id: reproduce
+    attributes:
+      label: Steps to reproduce
+      description: Exact commands and inputs we can run to see the bug ourselves.
+      placeholder: |
+        1. Clone the repo at commit <SHA>
+        2. Create `.env` with the following variables (values redacted): ...
+        3. Run `python scripts/run_eval.py --dataset datasets/sample_custom.jsonl --sample-size 5`
+        4. See the error
+    validations:
+      required: true
+
+  - type: textarea
+    id: expected
+    attributes:
+      label: Expected behaviour
+      description: What did you expect to happen?
+    validations:
+      required: true
+
+  - type: textarea
+    id: actual
+    attributes:
+      label: Actual behaviour
+      description: What actually happened? Include error messages, stack traces, or unexpected output. Use code fences (```) for readability.
+    validations:
+      required: true
+
+  - type: dropdown
+    id: pipeline
+    attributes:
+      label: Which part of the pipeline is affected?
+      options:
+        - Local evaluation (scripts/run_eval.py)
+        - Foundry cloud evaluation (scripts/run_foundry_eval.py)
+        - Comparison or export scripts (compare_results.py / export_results.py)
+        - WALKTHROUGH.ipynb (Jupyter notebook)
+        - Configuration (configs/*.yaml)
+        - Dataset loading (JSONL / CSV / SQL)
+        - Reporting / dashboard / charts
+        - Tests (pytest)
+        - Documentation
+        - Other / not sure
+    validations:
+      required: true
+
+  - type: input
+    id: python-version
+    attributes:
+      label: Python version
+      description: Output of `python --version`
+      placeholder: "3.11.7"
+    validations:
+      required: true
+
+  - type: input
+    id: os
+    attributes:
+      label: Operating system
+      placeholder: "Windows 11 / macOS 14.4 / Ubuntu 22.04"
+    validations:
+      required: true
+
+  - type: input
+    id: package-version
+    attributes:
+      label: Repo commit or release
+      description: Output of `git rev-parse --short HEAD` or the release tag you are using.
+      placeholder: "abc1234 or v1.0.0"
+
+  - type: textarea
+    id: config
+    attributes:
+      label: Relevant configuration
+      description: A redacted snippet of the YAML config or environment variables involved. **Do not include API keys.**
+      render: yaml
+
+  - type: textarea
+    id: logs
+    attributes:
+      label: Logs and screenshots
+      description: Paste any relevant terminal output, stack traces, or screenshots. Redact secrets first.
+      render: shell
+
+  - type: textarea
+    id: additional
+    attributes:
+      label: Additional context
+      description: Anything else we should know — workarounds tried, related issues, hypotheses, etc.
@@ -0,0 +1,11 @@
+blank_issues_enabled: false
+contact_links:
+  - name: Question or general feedback
+    url: https://aka.ms/foundry/discord
+    about: For questions about usage, sharing experience, or general feedback please use GitHub Discussions.
+  - name: Microsoft Foundry documentation
+    url: https://learn.microsoft.com/azure/ai-foundry/
+    about: For questions about the Microsoft Foundry product itself (not this evaluation tool), see the official docs.
+  - name: Security vulnerabilities
+    url: https://www.microsoft.com/msrc
+    about: Please report security vulnerabilities privately via the Microsoft Security Response Center, not as public issues.
@@ -0,0 +1,47 @@
+name: 📚 Documentation issue
+description: Something in the README, QUICKSTART, docs/, or notebook is wrong, missing, or unclear.
+title: "[Docs]: "
+labels: ["documentation", "needs-triage"]
+body:
+  - type: markdown
+    attributes:
+      value: |
+        Thanks for helping improve the documentation. Clear docs are a feature — please tell us what tripped you up.
+
+  - type: input
+    id: page
+    attributes:
+      label: Which page or file?
+      description: Path or URL of the doc that needs attention.
+      placeholder: "docs/how-to-run-live-eval.md, README.md, WALKTHROUGH.ipynb cell 5, ..."
+    validations:
+      required: true
+
+  - type: textarea
+    id: issue
+    attributes:
+      label: What is wrong, missing, or unclear?
+      placeholder: |
+        - The instructions assume `az login` already works, but I had no Azure account yet.
+        - The example config refers to a model deployment name that doesn't exist in the default config.
+    validations:
+      required: true
+
+  - type: textarea
+    id: suggestion
+    attributes:
+      label: Suggested improvement
+      description: Optional — if you have wording in mind, drop it here.
+
+  - type: dropdown
+    id: audience
+    attributes:
+      label: Reader audience this affects
+      options:
+        - First-time / beginner user
+        - Developer extending the toolkit
+        - Operator running large-scale evaluations
+        - Foundry / cloud-eval user
+        - Other
+    validations:
+      required: true
@@ -0,0 +1,75 @@
+name: ✨ Feature request
+description: Suggest a new capability, dataset format, grader, or improvement.
+title: "[Feature]: "
+labels: ["enhancement", "needs-triage"]
+body:
+  - type: markdown
+    attributes:
+      value: |
+        Thanks for suggesting an improvement! Please describe both the **problem** you're trying to solve and the **outcome** you'd like, so we can consider alternative solutions too.
+
+  - type: checkboxes
+    id: prereqs
+    attributes:
+      label: Pre-flight checklist
+      options:
+        - label: I have searched existing issues and discussions for similar requests.
+          required: true
+        - label: This request is about the evaluation toolkit, not about Microsoft Foundry as a product.
+          required: true
+
+  - type: textarea
+    id: problem
+    attributes:
+      label: What problem are you trying to solve?
+      description: Describe the use case or pain point. Don't lead with the proposed solution.
+      placeholder: I want to evaluate Model Router on a 50,000-prompt dataset stored in BigQuery, but the current loader only supports JSONL/CSV/SQLAlchemy URLs.
+    validations:
+      required: true
+
+  - type: textarea
+    id: proposal
+    attributes:
+      label: Proposed solution
+      description: What would you like the toolkit to do? Be as specific as you can — CLI flags, config keys, output format, etc.
+    validations:
+      required: true
+
+  - type: textarea
+    id: alternatives
+    attributes:
+      label: Alternatives considered
+      description: Other approaches you thought about, and why they don't quite fit.
+
+  - type: dropdown
+    id: area
+    attributes:
+      label: Area of the toolkit
+      multiple: true
+      options:
+        - Local evaluation pipeline
+        - Foundry cloud evaluation
+        - Dataset loading / format support
+        - Judge / grader prompts
+        - Cost or latency methodology
+        - Reporting / dashboard / charts
+        - CLI / configuration
+        - Documentation / tutorials
+        - Tests / CI
+        - Other
+    validations:
+      required: true
+
+  - type: textarea
+    id: additional
+    attributes:
+      label: Additional context
+      description: Examples, mock outputs, links to related issues or external docs.
+
+  - type: checkboxes
+    id: contribute
+    attributes:
+      label: Contribution
+      options:
+        - label: I would be willing to contribute a pull request for this feature.
+        - label: I'd like to discuss the design first before any implementation work.
@@ -0,0 +1,39 @@
+name: 💬 Feedback
+description: Share your experience using the toolkit — what worked, what didn't, what surprised you.
+title: "[Feedback]: "
+labels: ["feedback"]
+body:
+  - type: markdown
+    attributes:
+      value: |
+        Thanks for sharing! Feedback helps us understand how the toolkit is used in the real world. There are no required fields — fill in whatever is useful.
+
+        > For free-form questions, please use [Microsoft Foundry Discord](https://aka.ms/foundry/discord) instead.
+
+  - type: textarea
+    id: use-case
+    attributes:
+      label: How are you using the toolkit?
+      placeholder: "Comparing Model Router against gpt-5 on ~500 internal customer-support prompts."
+
+  - type: textarea
+    id: worked
+    attributes:
+      label: What worked well?
+
+  - type: textarea
+    id: friction
+    attributes:
+      label: Where did you hit friction?
+      description: Confusing docs, broken behaviour, missing features, surprising results — anything.
+
+  - type: textarea
+    id: outcome
+    attributes:
+      label: Did the evaluation help you make a decision?
+      description: e.g. adopted Model Router, kept the baseline, found a quality regression, identified a cost saving.
+
+  - type: textarea
+    id: additional
+    attributes:
+      label: Anything else?