Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents

Info

The information provided here should be sufficient for you to complete the project. For those of you who are interested and want to learn more about conversational patterns and the related coding scheme that we use here (e.g. C4, etc.), see: Moore, R. J., Arar, R. (2019), Conversational UX Design: A Practitioner's Guide to the Natural Conversation Framework. ACM.

Background Knowledge Preparation

Before beginning your project, we would like you all to familiarize yourself with some background knowledge about Git and Dialogflow, which are needed to successfully complete your project.

Everything we think you need to know can be found here: Getting Started: What to Know About Git and DialogFlow

To test your knowledge of these subjects there are two Canvas quizzes on the subjects that must be completed.

Additionally, on the page Getting Started: The Tools Used we explain and help you set up the infrastructure needed to complete this project.

Please complete the instructions on both of these pages before beginning your project.

The Project

In this project, you will be developing a conversational cooking assistant that uses speech to interact and is able to conduct a conversation for selecting a recipe to cook. Automatic Speech Recognition (ASR) has matured to a level where it is possible to categorize a user's utterances adequately to make sense out of them. Also, Speech Synthesis (TTS) can be used to produce well-pronounced spoken utterances from written text. Yet, conversational agents have not become mainstream, and whoever has used a home assistant (Google Home or Apple Siri) has experienced being misunderstood. These assistants are typically able to perform well on basic Question-Answering (QA) interactions, which most of the time consist of just two conversational turns: a question and an answer. However, conducting longer conversations tends to be more challenging. This is because a longer conversation can take (too) many directions and the chance that a user says something unexpected significantly increases. We will investigate this challenge in this project.

In this project, you will develop a cooking assistant that conversationally assists a user to select a recipe based on a variety of filters, whereas the instruction of the recipe itself is not in the scope of the Project MAS course. We chose to focus on the recipe selection activity of cooking support, since several challenging elements of building a conversational agent come together in this segment. First, there are many different ways in which this conversation may conducted, with different aspects of recipes that may be specified by the user to come to a selection (type of ingredients, cooking duration, course, etc.), and the option to mention multiple of those elements in one utterance. Second, the agent has to cover a broad knowledge space to understand what the user is looking for. Third, the agent has to reason over its database of recipes to filter for recipes that fit the user’s preferences.

You are provided with a prolog knowledge base of close to 1,000 recipes and their components, that still requires an effort of your group to make it usable by the cooking assistant. In the instructions we walk you through the procedure by pinpointing aspects of the agent that need to be altered or filled in. If you fill in these blanks the agent should work, but it will be pretty basic…

...

The project does not stop there. We challenge you to extend this basic bot with your own flare and ideas, so as to earn a higher grade. Read a bit more about this in our Extensions section at the end of your project.

During development and after finishing the agent, it is essential to do some testing! This will also involve having another team conversing with your agent, and checking out another team’s agent yourself. You can read about this in our System testing and User study section.

Apart from a working and tested agent, your project work will result in an end report in which you describe its different features and its performance according to the testing, among others. More information on that can be found in the 2023: End Report Section.

For now, let’s get started by dividing work among teammates.

Team Forming and Initial Set Up

...

Assigning Roles Within Your Team

At the start of the course, you are expected to have formed a team of 6.One of the first things to think about as a team is how to divide the three main tasks or roles amongst your team members:

  1. Dialogflow and Filters (2 team members);

  2. Visual Support (2 team members); and

  3. Patterns and Responses (2 team members).

The two team members with a similar role should work together in pairs. You can successfully do pair programming in person or remotely. As you will work together, but the three teams of two need to communicate well to make sure those three aspects of the agent are integrated.

Instructions for the tools and setup can be found here:

...

Team Roles

all need to work on one and the same agent, however, it will also be essential for the three pair programming subteams to communicate well with each other to make sure the three programming tasks are adequately integrated and work well together. Another task that you will have as a team that will be essential to develop an effective and robust agent is to continuously evaluate and test your bot once it is up and running (even if it is still only minimally functioning!). For this, make sure you also read the Agent testing page.

At the end of this project, you and your team should have a fully functioning conversational agent that is able to assist users with selecting a recipe they would like to cook!

...

The This agent is will be composed of MARBEL modules and Prolog files which you can develop and edit in Eclipse and Dialogflow which which will interact with a Dialogflow agent that you need to help finish. The goal of this project is to use Dialogflow, MARBEL, and Prolog to develop a conversational agent with the knowledge to assist us in a given task. In this case, we have a large recipe database (recipe_database.pl) that the agent draws upon to help the user choose what to eatcreate and further develop. As a start, we will use a large recipe database that the agent can use to suggest recipes to users. The user should be able to tell the agent certain criteria what they would like, and the agent should present try to find a fitting matching recipe (if of course, assuming that one is available in the database).

To help you decide which team member fits best to which role, the three roles are described briefly below. Make sure you also read the System testing page as you should be continuously testing your bot once it is minimally functioning.

Role 1: Dialogflow and Filters (D&F)

Do you want to get to know a very cool Google Cloud tool, and like to optimize the agent’s understanding of what the user says? In the introductory part of the course, you should have learned a bit about Dialogflow. If that sounded super cool to you, then this could be the section for you! You will create Intents, Entities, and Prolog Filter Rules in order to provide the agent with the vocabulary comprehension and filtering abilities it needs to converse about recipes.

For more information, go to the Dialogflow and Filters Section:

...

Role 2: Visual Support (VS)

Do you want to get creative? Your agent will not only have a conversational component, but a visual one. The program uses dynamic webpages to provide the user with visual support in their conversation. This not only includes subtitles to the conversation, but also information in support of what is talked about at any moment - such as the recipes that fit the preferences voiced by the user. If you are ready to break out a bit of HTML and Prolog, you can create cool pages through rules. The Visual Support team will incorporate visuals to enrich the conversation.

For more information, go to the Visual Support Section:

...

Role 3: Patterns and Responses (P&R)

Do you like to think in more detail about the anatomy of a conversation? Then this role could be of interest to you. The patterns and responses section focuses on the conversation part of the conversational agent. Your bot will not be the most robust it can only handle some specific tasks. Thus, we create patterns of what we think conversations in the agent’s intended domain will look like so it knows what to expect and how to respond. You will encode these patterns in Prolog and add responses for every situation.

For more information go to the Pattern and Responses Section:

...

Setting up the Tools You Need

Before you actually install the tools that you need for this project, first have a brief look at this page to get a quick overview of what you will use for developing your agent:

...

Now you have some idea of what software tools you will be using, get started by following the instructions on this page for setting up the tools you will need:

...

Getting up to Speed: the Background Knowledge You Need

Before beginning your project, we would like you all to familiarize yourself with some important background knowledge about Git and Dialogflow. You will find you will need this to successfully navigate this project, and it will help you to better understand what some of the tools and one of the key components of the agent are all about. The basic stuff that we think you really need to know can be found here:

...

Note

Note that to test your knowledge of these subjects, there are Canvas quizzes that must be completed.

Info

The information provided here should be sufficient for you to complete the project. For those of you who are interested and want to learn more about conversational patterns and the related coding scheme that we use here (e.g. C4, etc.), see: Moore, R. J., Arar, R. (2019), Conversational UX Design: A Practitioner's Guide to the Natural Conversation Framework. ACM.