Available services

The social interaction cloud has many components available for you to speed up creating social interactions with the robot’s.

 

 

 

 

[WIP] Depth estimation using the stereo grayscale cameras

https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/depth_estimation/

Dialogflow for creating conversational agents using google’s framework. This provides a flow chart like dialog management and speech recognition.

https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/dialogflow/

Face detection using OpenCV’s cascading classifier, which is very fast and can run on a laptop CPU

https://bitbucket.org/socialroboticshub/framework/src/master/sic_framework/services/face_detection/

DNN Face detection using a YOLOv7 neural network for accurate detection, and detection of small faces.

DNN Face recognition using a resnet50 network to extract face embeddings and assign an id based on automatic clustering.

[WIP] Object tracking general object tracking using motpy for temporal knowledge of objects

OpenAI ChatGPT a text based large language model that provides a very capable dialog agent. Requires a credit card.

OpenAI Whisper a powerful speech to text model, capable of running both local and in the cloud. Cloud usage requires a credit card. Start and end recognition is performed using python’s

Google Text to speech using google cloud API. Requires a credit card.

Pepper’s Tablet for displaying URLs or HTML on the tablet.

Templates for creating your own components