Add CLI support for task prompt synchronization#346
Conversation
8fc7f89 to
e1a3a05
Compare
| ${"Piece-wise options for `dev`:"} | ||
| ${cDim('--grade')} Run/calibrate grader | ||
| ${cDim('--test-grader')} Check grader calibration (demo + negative-demo) | ||
| ${cDim('--sync-task')} Force update task prompt from prompts.md |
There was a problem hiding this comment.
maybe we can just make the default for gd dev to update existing tasks (just the prompt)? and remove this sync-task option?
There was a problem hiding this comment.
If we make it the default without a flag, would it be a concern that task descriptions got unintentionally overwritten? or maybe gd dev itself (without any flag) is already intentional enough..
There was a problem hiding this comment.
By default, if a dev runs gd dev without the sync-task option, it would still check if there's any outdated tasks. And instead of sync the prompts, it would warn the dev for any outdated tasks
There was a problem hiding this comment.
I think it is fine to have it overwrite. for now we can keep the prompts consistent between the two locations.
|
i'd be interested in solving this problem a slightly different way.. essentially move task.md into the usecase folder. delete prompt. use task for everything we used prompt for. this also implies synced prompts, naturally. But I don't currently recall a reason why we need these two things to be separate. especially now with the choice of empty vs daily-grind. @micahjo7 do you know a reason why we'd want a prompt and task to have diff content? IIRC we chatted about this briefly last week. I think on a call. |
that would also work and simplify things. I don't think we need them to have different content. a couple things we would need to address though:
maybe that's a lot.. @paulirish let me know what you think, if we want to go ahead with this, I can talk thru these things in more detail with you @formgit |
|
I like these ideas and think we should do it. 👍 @formgit i think there's still a bit of nuance here that you'll have to figure out.. or at least raise as questions we need to answer... like... how do we want to handle multiple prompts? and some of the other things micah raised. but i'm excited to do this to clarify our eval pipeline and workflow. this prompt/task overlap has been a code smell that's primarily my fault. |
This is my high level understanding and it's clear to me atm. Will raise more questions as they surface during implementation. |
gd gen-task-suitecommand to batch update task files with the latest prompts--sync-taskoption togd devto force update a task promptgd devwhen an existing task prompt drifts fromprompts.md