Available services
The social interaction cloud has many components available for you to speed up creating social interactions with the robot’s.
Service | Command | Source | Demo files | Install | Note |
---|---|---|---|---|---|
Dialogflow for creating conversational agents using google’s framework. This provides a flow chart like dialog management and speech recognition. |
| demo_desktop_microphone_dialogflow.py or demo_nao_dialogflow.py |
| Â | |
Face detection using OpenCV’s cascading classifier, which is very fast and can run on a laptop CPU |
| None, no extra dependencies are needed | Â | ||
DNN Face detection using a YOLOv7 neural network for accurate detection, and detection of small faces. |
|
| The model file used in this example can be found here: https://drive.google.com/file/d/1oIaGXFd4goyBvB1mYDK24GLof53H9ZYo/view | ||
DNN Face recognition using a resnet50 network to extract face embeddings and assign an id based on automatic clustering. |
|
| The cascade classifier file used in this example can be found here: haarcascade_frontalface_default.xml. The model file can be found here: resnet50_ft_weight.pt | ||
OpenAI ChatGPT a text based large language model that provides a very capable dialog agent. Requires a credit card. |
|
| An openai api key can be created here: https://platform.openai.com/api-keys | ||
OpenAI Whisper a powerful speech to text model, capable of running both local and in the cloud. Cloud usage requires a credit card. Start and end recognition is performed using python’s SpeechRecognition |
|
| An openai api key can be created here: https://platform.openai.com/api-keys | ||
Google Text to speech using google cloud API. Requires a credit card. (WIP) | Â | Â | Â | Â | |
Templates for creating your own components |
| Â | Â | Â |
Â