Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Service

Command

Source

Dialogflow for creating conversational agents using google’s framework. This provides a flow chart like dialog management and speech recognition.

run-dialogflow

dialogflow

Face detection using OpenCV’s cascading classifier, which is very fast and can run on a laptop CPU

run-face-detection

face_detection

DNN Face detection using a YOLOv7 neural network for accurate detection, and detection of small faces.

run-face-detection-dnn

face_detection_dnn

DNN Face recognition using a resnet50 network to extract face embeddings and assign an id based on automatic clustering.

run-face-recognition --model xxx.pt --cascadefile xxx.xml

face_recognition_dnn

OpenAI ChatGPT a text based large language model that provides a very capable dialog agent. Requires a credit card.

run-gpt

gpt

OpenAI Whisper a powerful speech to text model, capable of running both local and in the cloud. Cloud usage requires a credit card. Start and end recognition is performed using python’s https://pypi.org/project/SpeechRecognition/

run-whisper

whisper_speech_to_text

Google Text to speech using google cloud API. Requires a credit card.

text2speech

Templates for creating your own components

templates

...