Natasha Jaques

What Makes ChatGPT Chat? Modern AI for the layperson

Natasha Jaques

What Makes ChatGPT Chat? Modern AI for the layperson

39:04

Reinforcement Learning (RL) for LLMs

Natasha Jaques

Reinforcement Learning (RL) for LLMs

33:10

Social Reinforcement Learning talk at RLDM

Natasha Jaques

Social Reinforcement Learning talk at RLDM

37:41

Badly trained policy after 40000 steps

Natasha Jaques

Badly trained policy after 40000 steps

1:40

Multi-agent DQN training step 90000 trajectory video

Natasha Jaques

Multi-agent DQN training step 90000 trajectory video

1:40

Multi-agent DQN training step 0 trajectory video

Natasha Jaques

Multi-agent DQN training step 0 trajectory video

1:40

Learning to grab with bell as reward

Natasha Jaques

Learning to grab with bell as reward

1:14

Intel Deep Learning Community of Practice talk

Natasha Jaques

Intel Deep Learning Community of Practice talk

57:28

Natasha Jaques PhD Thesis Defense

Natasha Jaques

Natasha Jaques PhD Thesis Defense

1:30:15

Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health

Natasha Jaques

Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and Health

1:53

VHRED Cornell baseline

Natasha Jaques

VHRED Cornell baseline

0:27

Influence agent in Harvest game

Natasha Jaques

Influence agent in Harvest game

0:34

Influence agent in Cleanup game

Natasha Jaques

Influence agent in Cleanup game

0:34

A3C baseline in Harvest

Natasha Jaques

A3C baseline in Harvest

0:34

A3C baseline in Cleanup game

Natasha Jaques

A3C baseline in Cleanup game

0:34

Agent trained with intrinsic social influence reward - Tragedy of the Commons

Natasha Jaques

Agent trained with intrinsic social influence reward - Tragedy of the Commons

0:32

Agent trained with intrinsic social influence reward

Natasha Jaques

Agent trained with intrinsic social influence reward

0:13

Influence reward in River with 1 influencer

Natasha Jaques

Influence reward in River with 1 influencer

0:34

A3C will not free other agent trapped in a box

Natasha Jaques

A3C will not free other agent trapped in a box

0:17

Influence agent frees compatriot trapped in a box

Natasha Jaques

Influence agent frees compatriot trapped in a box

0:17

Note RNN

Natasha Jaques

Note RNN

0:06

Natasha Jaques

Q

0:08

Natasha Jaques

G

0:09

Basic LSTM

Natasha Jaques

Basic LSTM

0:08

Psi

Natasha Jaques

Psi

0:08

RL Tuner

Natasha Jaques

RL Tuner

0:08

EDAExplorer PeakTutorial

Natasha Jaques

EDAExplorer PeakTutorial

0:31

EDAExplorer ArtifactTutorial

Natasha Jaques

EDAExplorer ArtifactTutorial

0:29

The Challenge

Natasha Jaques

The Challenge

0:21

Affective Computing - Spring 2015 Virtual Visit

Natasha Jaques

Affective Computing - Spring 2015 Virtual Visit

1:18

Eye gaze data

Natasha Jaques

Eye gaze data

0:30

5 Lego Robots Dancing to Gangnam Style

Natasha Jaques

5 Lego Robots Dancing to Gangnam Style

0:34

Lego Robot Gangnam Style

Natasha Jaques

Lego Robot Gangnam Style

0:36

次のページ