Page Comparison

...

Service	Command	Source	Demo files	Install	Note
Dialogflow for creating conversational agents using google’s framework. This provides a flow chart like dialog management and speech recognition.	`run-dialogflow`	dialogflow	demo_desktop_microphone_dialogflow.py or demo_nao_dialogflow.py	`pip install social-interaction-cloud[dialogflow]`
Face detection using OpenCV’s cascading classifier, which is very fast and can run on a laptop CPU	`run-face-detection`	face_detection	demo_desktop_camera_facedetection.py	None, no extra dependencies are needed
DNN Face detection using a YOLOv7 neural network for accurate detection, and detection of small faces.	`run-face-detection-dnn --model yolo7-face.pt`	face_detection_dnn	demo_desktop_camera_facedetection_dnn.py	`pip install social-interaction-cloud[face-detection-dnn]`	The model file used in this example can be found here: https://drive.google.com/file/d/1oIaGXFd4goyBvB1mYDK24GLof53H9ZYo/view
DNN Face recognition using a resnet50 network to extract face embeddings and assign an id based on automatic clustering.	`run-face-recognition --model xxx.pt --cascadefile xxx.xml`	face_recognition_dnn	demo_desktop_camera_facerecognition.py	`pip install social-interaction-cloud[face-recognitiomrecognition]`	The cascade classifier file used in this example can be found here: haarcascade_frontalface_default.xml. The model file can be found here: resnet50_ft_weight.pt
OpenAI ChatGPT a text based large language model that provides a very capable dialog agent. Requires a credit card.	`run-gpt`	gpt	demo_openai_gpt.py	`pip install social-interaction-cloud[openai-gpt]`	An openai api key can be created here: https://platform.openai.com/api-keys
OpenAI Whisper a powerful speech to text model, capable of running both local and in the cloud. Cloud usage requires a credit card. Start and end recognition is performed using python’s https://pypi.org/project/SpeechRecognition/	`run-whisper`	whisper_speech_to_text	demo_desktop_microphone_whisper.py	`pip install social-interaction-cloud[whisper-speech-to-text]`	An openai api key can be created here: https://platform.openai.com/api-keys
Google Text to speech using google cloud API. Requires a credit card. (WIP)		text2speech
Templates for creating your own components		templates

...

Versions Compared

Old Version 18

New Version Current

Key