Dialogflow

Before you start: make sure your MARBEL agent is connected to Dialogflow.

Read more about creating intents and entities before you start on the DialogFlow: Create an Agent, Intents and Entities page.

Creating a Greeting Intent

Create an intent called greeting in your Dialogflow agent.

Add Training Phrases: Add a number of expressions (at least 10) as examples of how a user might greet your agent. Be thorough and try to cover as many phrases as you can come up with. For inspiration, you can do a Google search for greeting phrases. You may find, for example, useful phrases here: 20 Greetings in English. You can also ask ChatGPT to generate example phrases.
Under Action and parameters make sure the box above the table is filled in with the name of your intent as shown in the image below.

Don’t forget to press SAVE!

Check that your greeting intent is working by using the microphone button in the test console on the DIalogflow console page. Try various phrases also using the test console and check whether what you say is recognized as a greeting intent.

Prolog and Patterns

Greeting pattern without self-identification

We shall start by considering how a typical greeting pattern looks like. When the agent does not introduce itself, a very common greeting pattern could like look this:

Example:

A: Hello.

U: Hi!

In this example conversation, we see that agent A greets the user and the user U greets the agent in return. We will now try to capture this pattern using the intent we just defined in Dialogflow.

To capture the example pattern, we will use the Prolog predicate pattern([PatternID | Sequence]). Note that this predicate only takes a single list argument, but this list should always have the same structure: the first item in the list should be a PatternID or name of the pattern, followed by a sequence of actor-intent pairs (more on this below).

The first thing we thus always need to come up with when defining a pattern/1 fact is to specify a pattern identification code (ID). For this purpose, we use a systematic taxonomy of patterns with codes for patterns as specified in the book Conversational UX Design: A Practitioner’s Guide to the Natural by Robert J. Moore and Raphael Arar. Greetings are classified as opening patterns by Moore and Arar. A minimal greeting such as in our example is classified as C1.0 in their taxonomy. In our code, we like to avoid the use of capitals and dots too, and we will simply write c10 in our code to refer to this greeting pattern.

The second part of the list we need to add as argument when creating a pattern/1 fact consists of the sequence of dialog moves the pattern itself consists of. A dialog move has two parts: first, we need to specify who makes the move (in our case, the agent or the user); second, we need to specify the type of move made by either actor. As we will classify what agents and users say as intents, we will use intent labels for representing dialog move types. A dialog move then can be represented as a two-element list [Actor, Intent], where the first element Actor is the dialog actor doing the talking (either the user or the agent) and the second element Intent is the intent representing the dialog move type. The first move in our greeting pattern example above thus can be represented by [agent, greeting] and the second move by [user, greeting]. Combining this with the pattern ID c10, the simple and most basic greeting pattern of our example can be represented by the following fact:

pattern([c10, [agent, greeting], [user, greeting]]).

Make sure the labels (names of intents and entities) that you use in your MARBEL agent code are always exactly the same as the intents and entity entry names that you have specified in your Dialogflow agent! Small spelling mistakes (e.g., using capitals or not, or other misspellings) may cause issues later on!

We advise using Camel case for intent and entity names, starting with a lowercase letter everywhere (except for the Training phrases that you add).

As mentioned above, this greeting pattern should be used if the agent does not introduce itself by its name. If the agent has a name, we would like the agent to introduce itself. To differentiate between these two cases, we will add a condition to our c10 pattern fact which indicates that it should only be used when the agent has no name (which we will identify with the basic fact agentName(''), i.e. when the empty string is associated with the agent’s name). We then obtain the following complete rule for our basic non-self-identifying greeting pattern:

pattern([c10, [agent, greeting], [user, greeting]]) :- agentName('').

You should add this Prolog rule to the patterns.pl file.

We have used the same name or label greeting that we used for the user’s move also for the agent’s move in the pattern that we defined. This makes sense conceptually because the moves are the same type of move, i.e. a greeting from one actor to another. The same intent label, however, should be handled very differently for the agent than for a user. Dialogflow should handle the natural language understanding (NLU) of a user’s move first by (transcribing the speech and then) classifying the move as an intent. In contrast, the MARBEL agent should handle the agent intent and make sure the agent generates natural language (NLG) text as output for a speech synthesizer. A user’s intent thus can be viewed as an input label whereas agent intents can be viewed as output or response labels. Below, we will see how we can provide texts for the agent to make its move in the responses.pl file. Because the same intent label for a user and for an agent are handled in very different places, there is no harm in using the same label either, and we can keep things conceptually simple.

Specifying the agent’s greeting

In the Prolog file responses.pl we determine what exactly the agent will say to initiate a move in the conversation or how it will respond to something a user said. The basic idea is to add natural language phrases, sentences, or text for each type of move the agent can make. In other words, we need to specify phrases for all the intent labels that occur in dialog moves in all patterns that are made by the agent. If there is no phrase specified at all for one of the agent’s possible dialog moves, the agent will not be able to perform that move…!

We will use the predicate text(Intent, Text) for specifying how the agent will greet a user. The first argument Intent of the text/2 predicate should be instantiated with an intent label in a dialog move of the agent, and the second argument Text should be a string with the text you want your agent to say. For the greeting intent we thus need to specify at least one phrase by adding a text(greeting, "YOUR PHRASE HERE") fact to the responses.pl file. Now add such a text/2 fact for the intent label greeting under the % Intent: greeting comment.

If you add only one text/2 fact for an intent label to the responses.pl file, your agent will quickly become boring (if not to your users it will soon enough start to bore you and your team members when you repeatedly need to test your agent!). To enable your agent to vary in what exactly it says when it makes a particular type of dialog move, all you need to do is to add more text/2 facts for an intent. The text_generator/2 predicate defined at the top of the responses.pl file will then randomly select one of these prhases for the agent to say.

Hear your agent say its first words

To test and hear something, you still need to do one more thing: In the dialog_init.mod2g file, add the c10 pattern name also to the agenda of the agent after start that is already in it (separate them by a comma, the agenda is a list!).

You can now Run your Conversational Agent again to hear your agent say its first opening words.

Note that unless the corresponding c10 pattern and the textual response have also beed added to the pattern.pl and reponses.pl file, your code will not yet work. Also make sure you not already added a name for your agent; the basic greeting pattern c10 assumes that the agent has no name yet!

Greeting with self-identification

You and your team should think of a name for your agent. Feel free to be creative. We need to tell our agent its name somewhere. In dialog_init.mod2g, on line 40 you can add the name you came up with for your agent. Change the empty string in insert(agentName(''), for example, to insert(agentName('Bellabot'). Now your agent has a name, we would also like the agent to self-identify and be able to use the following greeting pattern:

Pattern C1.1: Opening Self-Identification (Agent)

Example:

A: Hello.

A: My name is _____*.

U: Hi!

* insert your agent’s name here

This pattern is quite similar to the C1.0 pattern above, but consists of one more dialog move made by the agent. The second dialog move that is new in the C1.1 pattern we call a self-identification move. We suggest that you use the intent label selfIdentification for this agent move. Although Moore’s classification has it that this pattern is a c11, the difference with the c10 pattern is small and can be handled easier by our dialog manager agent if we use the same c10 label again for this only slightly different pattern. You should now be able to add another c10 pattern to the patterns.pl file which adds the self-identification move of the agent as an additional actor-intent pair. Of course, for this pattern to be selected, we should add the condition (as we did before above but now) that the agent has a name: not(agentName('')).

Specifying the agent’s self-identification

We still need to specify at least one phrase for the agent’s selfIdentification intent. We can do this by simply copying the agent’s name we inserted in dialog_init.mod2g but a more generic approach is to use the agentName(Name) query to retrieve this name from the agent’s database. This approach will also show you how you can use facts stored in the agent’s database to create text responses for an intent. The basic idea is to introduce a rule defining text(selfIdentification, Txt) instead of a simple text(selfIdentification, "SOME PHRASE") fact. For the body of this Prolog rule, you need to specify a query that concatenates two strings: A string such as “My name is” and a string of the agent’s name. To specify this query, the string_concat/3 predicate will be useful. Add the rule under the comment % Intent: self-identification in the responses.pl file.

When you have added a name for your agent, and the new pattern and rule for generating a self-identifying phrase, you can now Run your Conversational Agent again to hear your agent self-identify itself.

Visuals

Welcoming Page

When a user has visited the Start page and clicked on the Start button, your agent should start by greeting its user. But we would also like to show a webpage that welcomes the user and is shown while the greeting pattern c10 is active and ongoing. A new page should be shown if only because we need to provide the user with the ability to start talking too by clicking on the microphone icon.

As before, we need to introduce a rule for generating a webpage. The head of this rule should be page(c10, _, Html). The main requirement for this page is that it shows a microphone icon that the user can use to start talking to the agent. All you need to do for this is to create a page with a header. Other ideas for this page could be to show a greeting, and introduce your agent by showing its name.

Consider the condition that needs to succeed when you generate this page (Hint: look at the start page rule body and how the first parameter of the page/3 predicate is used in that rule).
Think about what you want to be on this page. It needs somewhere to show text to introduce your agent. For this, you can use a card or an alert, for example. This will be the first atom in your HTML code that we use in Step 3. The second atom should be HTML code with text(formatted) to introduce your agent, i.e. what you want to go on your alert or card. In Step 5 we need to format the HTML code by formating the Prolog string with ‘applyTemplate’. This means that in this second atom there should be a placeholder for the variable with the name of the agent. Read the Visual Support Guide for hints on how to do that.
Use a predicate to combine the first atom and second atom you created in the previous step (check out the Visuals Guide for how to do this). What we created in HTML above is our template, and the predicate should return the variable Template.
Now we need to get the agent's name in order to add it to our text. This knowledge should be somewhere in the agent's memory. Hint: Look at line 40 in dialog_init for the applicable predicate. Retrieve the name of the agent in a variable.
Next, you should use the applyTemplate/3 predicate to add the agent’s name to our HTML code atom (which is also explained in the Visual Support Guide).
Lastly, you should use the last variable that you used in the applyTemplate/3 predicate and use the html/2 predicate to convert your code snippet into a completed HTML page. This can be done by using html(YourCompleteHTMLCodeVariableHere, Html). Html is the variable that returns the page as you can see in the head of the rule.

Run It!

Make sure you have added the c10 pattern to the agent’s agenda in the dialog_init.mod2g file. Then test your agent. When you run your conversational agent, you should:

hit your Start page,
be able to press the Start button,
and after you do so, your agent should introduce itself;
then you should be able to unmute yourself by clicking the microphone icon in the top left corner,
give Google Chrome permission to use your microphone, and
return the greeting, by saying hello or anything else you come up with that sounds like a greeting.

Check whether your Dialogflow agent understood what you said and classified what you said into a greeting intent by inspecting the terminal in which you launched the SIC server:

All done? → https://socialrobotics.atlassian.net/wiki/spaces/PM2/pages/2216001572/Building+Your+Agent#Agent-Capability-2%3A-Specify-Goal-and-Select-Recipes-by-Name

Capability 1: Greet, and Self-Identify