ML Reading Group
In October 2021 I started a reading group among my cohort in the MSc in Advanced Computer Science at Oxford University. We read papers that highlight important activity in the field of Machine Learning, going for breadth over depth. The point is to develop a spanning view of the field so that we might have good ideas on how to change it.
List
29 October 2021
- Rubin and Berant 2021: SmBoP: Semi-autoregressive Bottom-up Semantic Parsing presented by Kyle Duffy
- Vaswani et. al. 2017: Attention Is All You Need presented by Oliver Rausch
5 November 2021
- Raffel et. al. 2019: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (usually referred to as “T5”) presented by Kyle Duffy
- Rausch Ben-Nun et. al. 2021: A Data-Centric Optimization Framework for Machine Learning presented by Oliver Rausch
12 November 2021
- Lundberg and Lee 2017: A Unified Approach to Interpreting Model Predictions (usually referred to as “SHAP”) presented by Fernando Almansa
- Lewis et. al. 2019: BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension discussed but not presented. Today the group went to a talk called Off-Belief Learning and Zero-Shot Coordination given by Jakob Foerster
19 November 2021
- Moll et. al. 2018: Quantum optimization using variational algorithms on near-term quantum devices presented by Vasilis Ntogramatzis
- Kumar and Talukdar 2020: NILE : Natural Language Inference with Faithful Natural Language Explanations presented by Wilfried Bounsi
26 November 2021
- Komeili, Shuster, and Weston 2021: Internet-Augmented Dialogue Generation presented by Kyle Duffy
- Kerenidis, Landman, Prakash 2019: Quantum Algorithms for Deep Convolutional Neural Networks presented by Snehal Raj
21 January 2022
- Lee-Thorp et. al. 2021: FNet: Mixing Tokens with Fourier Transforms presented by Kyle Duffy
- Borgeaud et. al. 2021: Improving language models by retrieving from trillions of tokens presented by Fernando Almansa
11 February 2022
- Chen et. al. 2020: A Simple Framework for Contrastive Learning of Visual Representations presented by Oliver Rausch
- Veličković et. al. 2018: Graph Attention Networks presented by Kyle Duffy
18 February 2022
- Lester, Al-Rfou, and Constant 2021: The Power of Scale for Parameter-Efficient Prompt Tuning presented by Hunar Batra
- Li et. al. 2022: Competition-Level Code Generation with AlphaCode presented by Fernando Almansa
25 February 2022
- Rezende and Mohamed 2016: Variational Inference with Normalizing Flows presented by Marcel Rød
11 March 2022
- Nye et. al. 2021: Show Your Work: Scratchpads for Intermediate Computation with Language Models presented by Hunar Batra
- Drexler 2019: Reframing Superintelligence: Comprehensive AI Services as General Intelligence presented by Oliver Rausch