The social interaction cloud has many components available for you to speed up creating social interactions with the robot’s.
[WIP] Depth estimation using the stereo grayscale cameras | |
Dialogflow for creating conversational agents using google’s framework. This provides a flow chart like dialog management and speech recognition. | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/dialogflow/ |
Face detection using OpenCV’s cascading classifier, which is very fast and can run on a laptop CPU | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/face_detection/ |
DNN Face detection using a YOLOv7 neural network for accurate detection, and detection of small faces. | |
DNN Face recognition using a resnet50 network to extract face embeddings and assign an id based on automatic clustering. | |
[WIP] Object tracking general object tracking using motpy for temporal knowledge of objects [WIP] | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/object_tracking/ |
OpenAI ChatGPT a text based large language model that provides a very capable dialog agent. Requires a credit card. | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/openai_gpt/ |
OpenAI Whisper a powerful speech to text model, capable of running both local and in the cloud. Cloud usage requires a credit card. Start and end recognition is performed using python’s https://pypi.org/project/SpeechRecognition/ | |
Google Text to speech using google cloud API. Requires a credit card. | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/text2speech/ |
Pepper’s Tablet for displaying URLs or HTML on the tablet. | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/webserver/ |
Templates for creating your own components | https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/templates/ |
...