What Makes ChatGPT Chat? Modern AI for the layperson
Natasha Jaques
What Makes ChatGPT Chat? Modern AI for the layperson
39:04
Reinforcement Learning (RL) for LLMs
Natasha Jaques
Reinforcement Learning (RL) for LLMs
33:10
Social Reinforcement Learning talk at RLDM
Natasha Jaques
Social Reinforcement Learning talk at RLDM
37:41
Badly trained policy after 40000 steps
Natasha Jaques
Badly trained policy after 40000 steps
1:40
Multi-agent DQN training step 90000 trajectory video
Natasha Jaques
Multi-agent DQN training step 90000 trajectory video
1:40
Multi-agent DQN training step 0 trajectory video
Natasha Jaques
Multi-agent DQN training step 0 trajectory video
1:40
Learning to grab with bell as reward
Natasha Jaques
Learning to grab with bell as reward
1:14
Intel Deep Learning Community of Practice talk
Natasha Jaques
Intel Deep Learning Community of Practice talk
57:28
Natasha Jaques PhD Thesis Defense
Natasha Jaques
Natasha Jaques PhD Thesis Defense
1:30:15
Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health
Natasha Jaques
Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health
1:53
VHRED Cornell baseline
Natasha Jaques
VHRED Cornell baseline
0:27
Influence agent in Harvest game
Natasha Jaques
Influence agent in Harvest game
0:34
Influence agent in Cleanup game
Natasha Jaques
Influence agent in Cleanup game
0:34
A3C baseline in Harvest
Natasha Jaques
A3C baseline in Harvest
0:34
A3C baseline in Cleanup game
Natasha Jaques
A3C baseline in Cleanup game
0:34
Agent trained with intrinsic social influence reward - Tragedy of the Commons
Natasha Jaques
Agent trained with intrinsic social influence reward - Tragedy of the Commons
0:32
Agent trained with intrinsic social influence reward
Natasha Jaques
Agent trained with intrinsic social influence reward
0:13
Influence reward in River with 1 influencer
Natasha Jaques
Influence reward in River with 1 influencer
0:34
A3C will not free other agent trapped in a box
Natasha Jaques
A3C will not free other agent trapped in a box
0:17
Influence agent frees compatriot trapped in a box
Natasha Jaques
Influence agent frees compatriot trapped in a box
0:17
Note RNN
Natasha Jaques
Note RNN
0:06
Q
Natasha Jaques
Q
0:08
G
Natasha Jaques
G
0:09
Basic LSTM
Natasha Jaques
Basic LSTM
0:08
Psi
Natasha Jaques
Psi
0:08
RL Tuner
Natasha Jaques
RL Tuner
0:08
EDAExplorer PeakTutorial
Natasha Jaques
EDAExplorer PeakTutorial
0:31
EDAExplorer ArtifactTutorial
Natasha Jaques
EDAExplorer ArtifactTutorial
0:29
The Challenge
Natasha Jaques
The Challenge
0:21
Affective Computing - Spring 2015 Virtual Visit
Natasha Jaques
Affective Computing - Spring 2015 Virtual Visit
1:18
Eye gaze data
Natasha Jaques
Eye gaze data
0:30
5 Lego Robots Dancing to Gangnam Style
Natasha Jaques
5 Lego Robots Dancing to Gangnam Style
0:34
Lego Robot Gangnam Style
Natasha Jaques
Lego Robot Gangnam Style
0:36