Users input their BAML prompt in AI Studio/online. optional ability submit upload to the benchmark system, find similar BAML category and difficulty, and suggest potential models for the user to try. (don't want to re run the whole benchmark again every time one is submitted).
This gives us more benchmark material and added value to the user. (could be explored as potential for monetization after x amount free model suggestions per day?)
Users input their BAML prompt in AI Studio/online. optional ability submit upload to the benchmark system, find similar BAML category and difficulty, and suggest potential models for the user to try. (don't want to re run the whole benchmark again every time one is submitted).
This gives us more benchmark material and added value to the user. (could be explored as potential for monetization after x amount free model suggestions per day?)