[DEV-14703] Create NL Search Django Model by AHasan-FRB · Pull Request #4645 · fedspendingtransparency/usaspending-api

AHasan-FRB · 2026-04-30T14:45:51Z

Description:

Create the Django models and the corresponding migrations, ensuring the correct tables and relationships are created in Postgres, related to NL search assistant design and work.

Technical Details:

To load data for fixtures, you can run each separately or alternative run the script to do it automatically with the command python manage.py load_llm_fixtures
New tests created to ensure loading new modelsworks successfully
Models include str attribute for easy admin debugging
Default ordering is present for most model fields, for easier indexing and searching

Requirements for PR Merge:

Unit & integration tests updated
API documentation updated (examples listed below)
1. API Contracts
2. API UI
3. Comments
Data validation completed (examples listed below)
1. Does this work well with the current frontend? Or is the frontend aware of a needed change?
2. Is performance impacted in the changes (e.g., API, pipeline, downloads, etc.)?
3. Is the expected data returned with the expected format?
Appropriate Operations ticket(s) created
Jira Ticket(s)
1. DEV-14703

Explain N/A in above checklist:

… and ensures spark is installed to the environemtn

…d award type field to lookup mapping

…hub.com/fedspendingtransparency/usaspending-api into ftr/dev-14642-fix-sorting-on-award-type

zachflanders-frb

Looking good! I am requesting changes to remove the LLMSearchQurey model and to add the migrations file to the commit.

zachflanders-frb · 2026-04-30T15:49:06Z

Can we add Nova Pro? (amazon.nova-pro-v1:0)

zachflanders-frb · 2026-04-30T15:52:42Z

+        db_table = "ai_model"
+        ordering = ["-id"]
+
+class Prompts(models.Model):


I wonder about adding a name field to this model in order to have an easier way to get prompts than using the id or the full description?

zachflanders-frb · 2026-04-30T16:01:47Z

+    system_prompt = models.ForeignKey(Prompts, on_delete=models.SET_NULL, null=True, related_name="sessions")
+    started_at = models.DateTimeField(auto_now_add=True)
+    ended_at = models.DateTimeField(null=True, blank=True)
+    feedback = models.BooleanField(default=None, null=True, blank=True, help_text="positive=True, negative=False")


Based on the UX mocks it looks like we are going to have a short survey if someone gives feedback. This makes me think that feedback could be its own model with an is_positive field and then potentially a survey json field to represent survey questions and answers or even survey question and survey response models to fully model out the survey. OTOH the survey might not use the api at all and use some other tool to collect feedback.

zachflanders-frb · 2026-04-30T16:04:25Z

+    created_at = models.DateTimeField(auto_now_add=True)
+    input_tokens = models.IntegerField(default=0)
+    output_tokens = models.IntegerField(default=0)
+    total_tokens = models.IntegerField(default=0)


In my initial testing total_tokens might not be that important to keep track of since input tokens and output tokens are priced differently and this is primarily to keep track of usage and cost.

zachflanders-frb · 2026-04-30T16:06:07Z

+class LLMSearchQuery(models.Model):
+    user_query = models.TextField()
+    session = models.ForeignKey(Session, on_delete=models.CASCADE, related_name="search_queries")
+    created_at = models.DateTimeField(auto_now_add=True)
+
+    def __str__(self):
+        preview = self.user_query[:75] + "..." if len(self.user_query) > 75 else self.user_query
+        return f"Query {self.id}: {preview}"
+
+
+    class Meta:
+        db_table = "llm_search_query"
+        indexes = [
+            models.Index(fields=["-created_at"]),
+        ]


I found that this model is redundant because the first message in the session will be a user message that includes the user query, so I would recommend that we do not keep this model.

zachflanders-frb · 2026-04-30T16:16:10Z

+    selectedRecipientLocations: dict[str, Any] = Field(default_factory=dict)
+    awardType: list[str] = Field(default_factory=list)
+    selectedAwardIDs: dict[str, Any] = Field(default_factory=dict)
+    awardAmounts: dict[str, list[int]] = Field(default_factory=dict)


I added to the poc branch to give the llm more context and understanding of the awardAmounts field.

Suggested change

awardAmounts: dict[str, list[int]] = Field(default_factory=dict)

awardAmounts: dict[str, list[int | None]] = Field(

default_factory=dict,

description=(

"Dictionary of award amount ranges for filtering. "

"Each value is a two-element list: [min_amount, max_amount]. "

"Use `None` for unbounded ranges.\n\n"

"TWO MUTUALLY EXCLUSIVE MODES:\n\n"

"MODE 1 - STANDARD RANGES (can select multiple):\n"

"- 'range-0': [None, 1000000] - Awards up to $1M\n"

"- 'range-1': [1000000, 25000000] - Awards $1M to $25M\n"

"- 'range-2': [25000000, 100000000] - Awards $25M to $100M\n"

"- 'range-3': [100000000, 500000000] - Awards $100M to $500M\n"

"- 'range-4': [500000000, None] - Awards over $500M\n\n"

"MODE 2 - SPECIFIC RANGE (must be alone):\n"

"- 'specific': [min, max] - Specify exact dollar amounts\n\n"

"CRITICAL RULES:\n"

"1. You can use multiple standard ranges together (range-0 through range-4)\n"

"2. You can use ONE specific range with specific min/max values\n"

"3. NEVER mix standard ranges with specific range\n"

"4. When using 'specific', it must be the ONLY key in the dictionary"

),

json_schema_extra={

"examples": [

# Example 1: Multiple standard ranges

{"range-0": [None, 1000000], "range-2": [25000000, 100000000]},

# Example 2: Single standard range

{"range-3": [100000000, 500000000]},

# Example 3: Custom range with both bounds

{"specific": [5000000, 50000000]},

# Example 4: Custom range unbounded above

{"specific": [10000000, None]},

# Example 5: Custom range unbounded below

{"specific": [None, 75000000]},

]

},

)

zachflanders-frb · 2026-04-30T17:38:47Z

Can we add the migrations file to this PR?

AHasan-FRB added 10 commits April 20, 2026 10:08

Fixes to setup script to ensure database creation has required roles,…

764f9d6

… and ensures spark is installed to the environemtn

Merge branch 'qat' into fix/dev-setup-commands

1b1cbab

Bug fixes in place, utilizing .keyword for award type on search, adde…

19c07c3

…d award type field to lookup mapping

Rolled back local changes from different branch

3ae652d

Merge branch 'qat' into ftr/dev-14642-fix-sorting-on-award-type

1f990bd

Style updates

e917645

Merge branch 'ftr/dev-14642-fix-sorting-on-award-type' of https://git…

202020f

…hub.com/fedspendingtransparency/usaspending-api into ftr/dev-14642-fix-sorting-on-award-type

Style updates

48bee55

Added NL models and tests

3c847bb

restore files from overwritten changes

8802b28

github-actions Bot assigned AHasan-FRB Apr 30, 2026

AHasan-FRB added 2 commits April 30, 2026 11:57

Style fixes

b281d8e

Merge branch 'qat' into ftr/dev-14561-nl-django-models

28e956a

zachflanders-frb requested changes Apr 30, 2026

View reviewed changes

github-actions Bot assigned zachflanders-frb Apr 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DEV-14703] Create NL Search Django Model#4645

[DEV-14703] Create NL Search Django Model#4645
AHasan-FRB wants to merge 12 commits intoqatfrom
ftr/dev-14561-nl-django-models

AHasan-FRB commented Apr 30, 2026 •

edited

Loading

Uh oh!

zachflanders-frb left a comment

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

zachflanders-frb Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-    awardAmounts: dict[str, list[int]] = Field(default_factory=dict)
+    awardAmounts: dict[str, list[int | None]] = Field(
+        default_factory=dict,
+        description=(
+            "Dictionary of award amount ranges for filtering. "
+            "Each value is a two-element list: [min_amount, max_amount]. "
+            "Use `None` for unbounded ranges.\n\n"
+            "TWO MUTUALLY EXCLUSIVE MODES:\n\n"
+            "MODE 1 - STANDARD RANGES (can select multiple):\n"
+            "- 'range-0': [None, 1000000] - Awards up to $1M\n"
+            "- 'range-1': [1000000, 25000000] - Awards $1M to $25M\n"
+            "- 'range-2': [25000000, 100000000] - Awards $25M to $100M\n"
+            "- 'range-3': [100000000, 500000000] - Awards $100M to $500M\n"
+            "- 'range-4': [500000000, None] - Awards over $500M\n\n"
+            "MODE 2 - SPECIFIC RANGE (must be alone):\n"
+            "- 'specific': [min, max] - Specify exact dollar amounts\n\n"
+            "CRITICAL RULES:\n"
+            "1. You can use multiple standard ranges together (range-0 through range-4)\n"
+            "2. You can use ONE specific range with specific min/max values\n"
+            "3. NEVER mix standard ranges with specific range\n"
+            "4. When using 'specific', it must be the ONLY key in the dictionary"
+        ),
+        json_schema_extra={
+            "examples": [
+                # Example 1: Multiple standard ranges
+                {"range-0": [None, 1000000], "range-2": [25000000, 100000000]},
+                # Example 2: Single standard range
+                {"range-3": [100000000, 500000000]},
+                # Example 3: Custom range with both bounds
+                {"specific": [5000000, 50000000]},
+                # Example 4: Custom range unbounded above
+                {"specific": [10000000, None]},
+                # Example 5: Custom range unbounded below
+                {"specific": [None, 75000000]},
+            ]
+        },
+    )

Conversation

AHasan-FRB commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description:

Technical Details:

Requirements for PR Merge:

Explain N/A in above checklist:

Uh oh!

zachflanders-frb left a comment

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

zachflanders-frb Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AHasan-FRB commented Apr 30, 2026 •

edited

Loading