Evaluation is an essential step to evolve your cooking assistant from an agent that can conduct a conversation, to an agent that many users can have a conversation with. Robustness is the keyword here. By testing, you can find out the deficiencies of the agent that need to be addressed so that it will not break down easily. This is typically done in a cyclical manner, where rounds of tests and improvements follow one another. You will follow two stages in the current project: system testing and a user study, which we will explain in further detail in the following subsections. The two stages will be reported on by your group in the end report.
Evaluating Your BotData Collection with Your Bot
Add Comment