Week 1
In week 1 we met up on the following days:
...
Kiran: look into the libraries useful for extracting features from the environment that can be used for Personalisation and Reinforcement Learning. Features to be extracted are:
Age
Clothing
There were problems in installing the libraries tensorflow and keras because of the MacOS and python version in the system resulting in conflicts.
Error - zsh: illegal hardware instruction python
Ran into a problem when installing mediapipe, but was resolved:
Error - No matching distribution found for mediapipe mac
Solution : Step 1 - Install Rosetta 2 ( /usr/sbin/softwareupdate --install-rosetta)
...
What did we work on? | |
---|---|
Monday 10:00 - 16:30 |
|
Tuesday 10:00 - 17:00 |
|
Thursday 9:15 - 17:30 |
|
Friday 9:15 - 15:00 |
|
Monday 31.1
Unfortuately we did not have access to the lab today, so we worked on the VU together on the presentation and stuff we could do without having the robots available.
...
Gt is the encoded feedback.
The feedback means either to:
increase duration
decrease duration
stop
If the feedback means to increase, Gt would be a positive value.
But since we want to increase the duration, we do not want any negative value while updating that would decrease the thetamu
Therefore we check if :
(where At is the prior duration action, mutheta(St) is the mean of the Age range), is positive or negative. If the difference is negative and the feedback means to increase, Gt would be a negative value(say -0.1) and if the feedback is meant to increase and if the difference is positive, Gt would be a positive value(say 0.1).
...