Differences between revisions 9 and 10
Revision 9 as of 2017-06-11 13:52:04
Size: 1983
Editor: DavidOwen
Comment: ML generating ML
Revision 10 as of 2017-07-06 04:32:38
Size: 2189
Editor: DavidOwen
Comment:
Deletions are marked like this. Additions are marked like this.
Line 11: Line 11:
 * [[https://arxiv.org/abs/1706.03741|Deep reinforcement learning from human preferences]]: Aims to minimize how much time a human must give feedback to the system for the system to train itself correctly

Papers for discussion

Papers (last edited 2019-08-04 01:39:13 by DavidOwen)