Page Comparison

In the small user study at the end of the project, you will test an agent from another team and another team will test yours! As you can see in the Course Schedule there is a Wednesday reserved in the last week of the project for running a user study.

...

Setting up your user study: Your team will run the user study for your own agent and collect what you think is relevant data to analyze the performance of your agent. Consider some of the metrics that were already explained in the Agent Testing section: effectiveness, efficiency, robustness, and user satisfaction. So think about what you want to evaluate and how to do that. Also, consider that the greatest advantage of having another team test your agent is that they likely will interact differently and may have different conversations with your agent than you have seen before in your own agent testing (that’s why it makes sense to already involve people from outside your team during agent testing).

Reporting: In your Final Report you should (very) briefly report on the setup of your user study but most importantly focus on the analysis of the data that you collected. You should have collected data for close to 20 conversational interactions with your agent. For these conversations, you can report some of the more interesting and basic https://en.wikipedia.org/wiki/Descriptive_statistics. Descriptive statistics provide simple summaries about the sample (your participants). You do not need to provide figures or tables (you don’t have the space in your report! If you want to provide details you can add these in an appendix to your report) but should rather focus on the more interesting findings. Most importantly, briefly discuss and interpret the data that you collected to explain what the data can tell us about the performance of your agent.

...

Versions Compared

Old Version 1

New Version 2

Key