Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning
ACL materials are Copyright © 1963-2018 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 License.
Ingestion Queue |
Matt Post (Editor, 2019–) /
Min-Yen Kan (Editor, 2008–2018) /
Steven Bird (Editor, 2001–2007)